Seqanswers Leaderboard Ad

**HESmith** · 06-06-2012, 07:52 PM

1) If the PHRED scores of the UMI and post-T segments look good, you could rerun the initial scripts from CASAVA to output all of the reads (passed and failed) as FASTQs while masking the T segment and demultiplexing on the UMI.
2) Not sure, but if it's the T segment (as suspected) then the PHRED scores for those cycles are probably much lower than the flanking segments.
3) The images are not saved, so 4) deferred basecalling is not an option at this point.

**mmpillai** · 06-06-2012, 08:21 PM

Thank you, we will try those options first. I was under the impression that the PF is calculated on the "chastity scores" of the first 12-20 bases or so, does that directly correlate with the PHRED score for the base or is that a separate metric ?

**HESmith** · 06-07-2012, 02:41 AM

Chastity differs from PHRED score, and is calculated for the first 25 cycles. Per cycle PHRED scores can be visualized with Illumina's HCS or SAV software.

**mmpillai** · 06-12-2012, 08:33 PM

So as an update, illumina and our NGS core both say they cannot rerun the scripts by masking the Ts ( bps 7 to 20). We do have the CIF files saved and I am guessing using a third party base caller would be the next logical step. There seems to be several available, but would there be an advantage of one vs another ( say those with no need for training sets like AYB, naivebayescall or OnlineCall vs IBIS )? And should the Ts try to be masked with these base callers ? I remain optimistic that the dataset is usable given the intensity files "looked good" per the illumina tehnical person himself but almost certainly the base calling is being thrown off by the T stretch.

**simonandrews** · 06-13-2012, 12:43 AM

If you have a subset of bases which are causing a problem another option is to rerun the bcl conversion specifying --no-eamss. I'd also tell it to export QC filtered sequences as well (don't have the 1.8.1 manual to hand so can't remember the exact option to specify for this). You might find that the qualities of the poly-T stretch are poor, but that they recover once the low complexity sequence is over. Turning off EAMSS will allow the qualities to come back up again and you might return to usable sequence.

**GenoMax** · 06-13-2012, 04:36 AM

Originally posted by simonandrews View Post

If you have a subset of bases which are causing a problem another option is to rerun the bcl conversion specifying --no-eamss. I'd also tell it to export QC filtered sequences as well (don't have the 1.8.1 manual to hand so can't remember the exact option to specify for this).

The option referenced by Simon is "--with-failed-reads" which will include reads failing the filter in the output file.

**HESmith** · 06-13-2012, 05:03 AM

You can combine the recommendations of Simon and Genomax with the flag --use-bases-mask I6n14Y* to mask the Ts and demultiplex on the first six bases.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Low Diversity library ( 14 Ts) on HiSeq2000

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News