Unconfigured Ad

**Brian Bushnell** · 11-05-2014, 09:27 AM

PacBio uses ASCII-33 encoding, but for their reads of insert / CCS, which are consensus of the same read multiple times, they routinely assign quality values way above the standard limit of 41. This breaks a lot of tools that try to auto-detect quality encoding, so it's important to manually specify the quality encoding, or else preprocess the PacBio reads to cap their quality at 41.

For BBTools, for example, you can use the "qin=33" flag. Also, Reformat can accept the "fixquality" flag which will cap all incoming qualities at 41 and write the corrected output file.

**lankage** · 11-05-2014, 09:36 AM

pac bio quality scores

So a pac bio quality string score of 80 --> "q", is for all intents and purposes equivalent to a score of 41 as far as read quality filtering is concerned?

The tool i want to use attempts to auto detect ASCII -33 or -64 offset, picks 64 offset, then throws out half the reads. Would it be appropriate to preprocess the fastq files and replace any quality characters with score > 41 with a "J".
ord("J") - 33 = 41

**Brian Bushnell** · 11-05-2014, 10:10 AM

Yes, that would be fine. The super-high quality values of PacBio reads are not accurate anyway. Q41 means under 1/10000 chance of error, and anything past that is unimportant for the purposes of quality filtering (particularly if its inaccurate).

From the BBTools package, this command will do the trick:

reformat.sh in=reads.fq out=fixed.fq qin=33 fixquality

**lankage** · 11-05-2014, 10:12 AM

Great thanks!

**sunz** · 06-18-2015, 01:05 PM

Originally posted by Brian Bushnell View Post

Yes, that would be fine. The super-high quality values of PacBio reads are not accurate anyway. Q41 means under 1/10000 chance of error, and anything past that is unimportant for the purposes of quality filtering (particularly if its inaccurate).

From the BBTools package, this command will do the trick:

reformat.sh in=reads.fq out=fixed.fq qin=33 fixquality

Hi Brian,

I installed the BBMap 35.07 and tried to run the above reformat command but got the following error:

"java -ea -Xmx200m -cp /home/sunz/bbmap/current/ jgi.ReformatReads in=test.fastq out=test_Qfixed.fq qin=33 fixquality
Executing jgi.ReformatReads [in=test.fastq, out=test_Qfixed.fq, qin=33, fixquality]

Unknown parameter fixquality
Exception in thread "main" java.lang.AssertionError: Unknown parameter fixquality
at jgi.ReformatReads.<init>(ReformatReads.java:168)
at jgi.ReformatReads.main(ReformatReads.java:45)"

Any suggestion? Thanks!

**Brian Bushnell** · 06-18-2015, 03:02 PM

Oh, I guess I took out that flag. It's now done automatically. You can specify a cutoff with the flag "maxcalledquality", which defaults to 41. So, the command would be:

reformat.sh in=reads.fq out=fixed.fq qin=33 maxcalledquality=41

...but you can leave out the "maxcalledquality=41" if you want.

**sunz** · 06-19-2015, 06:26 AM

great, thx!

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 36 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 99 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 120 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 113 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Pac Bio fastq file quality score encoding

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News