Read quality filtering for long, PE runs

kmcarr

Senior Member

Join Date: May 2008

Posts: 1180
- Share
- Tweet
#1

Read quality filtering for long, PE runs

07-21-2009, 07:07 AM

It is obvious that the default read filter passing parameters set in the Illumina pipeline or RTA are pretty meaningless for long, paired end runs. By this I mean that filter passing is based solely on the first 25 cycles of the run; on a 2X76 PE run a lot can happen in cycles 26-152 to make a read worthless. As an example, a software failure at cycle 25 forced us to restart a run, including realigning the xy of the stage. This resulted in a small fraction of clusters being too far out of alignment to be further called. I have actual data where bases 26-76 of read1 and all of read2 are 'N's but the filtering algorithm still calls them as passed reads.

I have been thinking about better ways to filter reads but I would like to hear from the community. Has anyone else here applied filtering of their own or do most people ignore filtering entirely and just throw the whole pile at the mapper/assembler and let it figure it out?
Tags: None

Previous template Next

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad