Seqanswers Leaderboard Ad

**gringer** · 12-11-2016, 10:37 AM

I wouldn't recommend trusting FastQC unless you have some other independent verification of the results.

When was the sequencing done? If it has been carried out in the last five years, it's most likely to be phred33. Here's my favourite reference that shows the differences:

FASTQ format - Wikipedia

https://en.wikipedia.org/wiki/Fastq#Encoding

It might be that the quality scores are so good that it has been detected as phred64. Converting is a bad idea: q40 values will become q10, which will mess up alignment and error correction.

**biocomputer** · 12-12-2016, 08:34 AM

It's two sets of sequencing from a year or two apart all done several years ago but I don't know exactly when, I can try to find out. But the samples from the first (older) batch are all indicated to be Illumina 1.5 while the second (newer) batch are all 1.9.

**gringer** · 12-12-2016, 10:20 AM

Okay, that counts as an independent reason for the difference.

It would be best to map the batches separately using different phred options in HISAT2. There may be batch effects associated with the different Illumina runs, and the process is simpler if it's done that way; the HISAT2 output can be merged post-mapping if desired (with 'samtools merge' on sorted BAM files).

The SAM format specification is that quality scores output should be phred+33, so the mapping process will auto-convert the phred+64 scores to phred+33.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

HISAT2 and different Illumina versions/Phred quality scores

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News