Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bcl2fastq fail to demultiplexing, Barcode collision

    bcl2fastq --barcode-mismatches 1 -o ./test --tiles s_6 --sample-sheet SampleSheet_L006_8index.csv



    2019-05-27 13:33:58 [28b0880] ERROR: bcl2fastq::common::Exception: 2019-May-27 13:33:58: Success (0): /TeamCityBuildAgent/work/556afd631a5b66d8/src/cxx/lib/layout/BarcodeCollisionDetector.cpp(187): Throw in function void bcl2fastq::layout::BarcodeCollisionDetector::handleCollision(const value_type&, const value_type&)
    Dynamic exception type: boost::exception_detail::clone_impl<bcl2fastq::layout::BarcodeCollisionError>
    std::exception::what: Barcode collision for barcodes: GACCTGAT, CAGCTGAT
    By default, bcl2fastq allows 1 mismatch in each barcode. Barcodes with too few mismatches are ambiguous ( less than 2 times the number of mismatches plus 1). To reduce the number of allowed mismatches, use the command line option: '--barcode-mismatches'. Note that particularly for barcodes with only 1 mismatch, there is the danger that some reads will be written to the wrong sample due to errors in the barcode sequence.



    What's wrong about bcl2fastq ? index GACCTGAT, CAGCTGAT have two base different, I cannot see collision between GACCTGAT, CAGCTGAT.

  • #2
    The confusion comes from the use of "mismatch" both for "sequencing error" and "difference between barcodes".
    If you allow one sequencing error, then the number of differences between barcodes must be equal to (2*sequencing errors + 1) = 3. Otherwise, for example, if the sequencer reads CACCTGAT it won't be able to attribute this to the first barcode with one sequencing error, or the second barcode with one sequencing error.

    In your case, you should allow zero mismatches (sequencing errors) due to having barcodes with two mismatches (differences).
    Last edited by r.rosati; 05-27-2019, 02:32 AM.

    Comment


    • #3
      CAGCTGAT
      CACCTGAT

      GACCTGAT
      CACCTGAT

      Great help , thank you

      Comment


      • #4
        Glad to be of help!
        As an afterthought - one can't blame the software for calling both "mismatches". Not the sequencer, nor the software know the "truth" and they don't know if a base is a sequencing error or not. So for the software, a mismatch is a mismatch; if it was known that a called base was an error, it wouldn't have been called. Perhaps I should have phrased the two as (1) "mismatch between the read sequences vs the barcode sequences" and (2) "mismatch between the two expected barcodes".
        Last edited by r.rosati; 05-27-2019, 05:29 AM.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          Today, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 07:17 AM
        0 responses
        11 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-02-2024, 08:06 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-30-2024, 12:17 PM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-29-2024, 10:49 AM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Working...
        X