Seqanswers Leaderboard Ad

**GenoMax** · 12-19-2014, 05:36 AM

Just to clarify: Data in example 1 above was done with bcl2fastq and 2 with MiSeq reporter?

On-board demultiplexing on MiSeq is able to keep all bases on the tag reads. It also does 1-error demultiplexing by default on tag reads (and this can't be turned off).

If the two tag reads can't be correctly assigned, based on the sample sheet you are providing to bcl2fastq, they will end up in the fastq header as a concatenated string.

Most of the discrimination happens in the first 5-7 bases on the tag with standard barcodes so the 8th position is not that critical.

**PopGenTech** · 12-19-2014, 06:00 AM

Thanks GenoMax,

Yes that is correct top example is bcl2fastq, bottom is MSR. Note that the reads are unrelated examples from different runs/libraries.

"On-board demultiplexing on MiSeq is able to keep all bases on the tag reads." is what I'm trying to understand.

From your previous explanation, and the fact that the last position of index read isn't phased, I understand that there is no data to identify the 8th position base. Consequently, the only way to deduce the full 8 bp of a tag is informatically. However, if the sample sheet is purposefully fake and the sequencer software doesn't have a look up table of indexes, how can it impute the correct tag? This is why the bcl2fastq read id lines have a 14 bp tag, despite the 16 bp of index sequence.

"If the two tag reads can't be correctly assigned, based on the sample sheet you are providing to bcl2fastq, they will end up in the fastq header as a concatenated string.
"

Yes, as in the lostreads file - but how does the MSR manage to construct a 16 bp tag given the same fake SampleSheet.CSV? What information was used to impute the identity of 8th position base, or should it be really treated as an N?

Is it that on-board demuxing can use sequence run information not available to bcl2fastq in post run analysis, or is it error correction / imputation using a separate algorithm?

Thank you for your kind help and explanations of the process.

**GenoMax** · 12-19-2014, 06:58 AM

Calls for the last base are there. They are not being imputed. Bcl2fastq ignores the call where as the on-board software keeps it. Instrument is going to sequence as it was set up. If you absolutely need "n" bases and are planning to use bcl2fastq then it is better to set the run up as n+1 cycles.

**PopGenTech** · 12-19-2014, 07:03 AM

Thanks GenoMax, that's what I needed to know.
Kind regards.

**PopGenTech** · 12-19-2014, 09:02 AM

Final word: Thanks to CRI UK for pointing me in this direction:

explicitly state the base-mask: --use-bases-mask to override config.xml

#configureBclToFastq.pl --input-dir test_demux/Data/Intensities/BaseCalls --output-dir test_output --sample-sheet test_demux/Data/Intensities/SampleSheet.csv --use-bases-mask y150n,I8,I8,y150n --no-eamss --fastq-cluster-count 0

This gets the full 2x8bp of the index in the output as shown:

(vex)[ir210@beast Sample_lane1]$ zgrep '^@' lane1_Undetermined_L001_R1_001.fastq.gz |head -n10|cut -d: -f10|sort|uniq -c|sort -nr

4 TAAGGCTCTAAGGCTC
2 CTAGTCAGATTCCGAG
1 TACATGAGTCTTTCCC
1 GCCTTAGACGATTGAC
1 GACTAGCTGGCATTGT
1 GACCGATTGATGCTGT

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, 05-10-2024, 06:35 AM	0 responses 18 views 0 likes	Last Post by seqadmin 05-10-2024, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 21 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 21 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

bcl2fastq and index length

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News