Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Demultiplexing softwares

    Hello everyone,

    I used to demultiplexe using bcl2fastq.pl provied by CASAVA.
    I want to try new tools because I got some trouble at the demultiplexing step.
    So I checked on google and I found several softs like TagGD, Flexbar, Sabre, deML etc. but all of them take a FASTQ file in input... And the only way I can have a FASTQ file is demultiplexing using bcl2fastq.pl plus this script removes the index sequence of all reads...
    So how can I demultiplexe something which is already demultiplexed and where the index sequence are removed ?

    I think the only thing I can do is to give an empty SampleSheet to CASAVA, then I suppose it will put all reads together as undetermined reads, but maybe there is a cleaner way to do ?

  • #2
    Can you specify what kind of "trouble" you ran into with bcl2fastq?

    Please keep in mind that if there was a problem with the index reads (multiple N's in the sequence) then no demultiplexing tool is going to help you. This run may have to be repeated.

    Comment


    • #3
      Well I got more undetermined reads that I excepted.

      I'm sure the main problem comes from the run but anyway how can I do if I want to compare these toolts ?

      Comment


      • #4
        Originally posted by ClemBuntu View Post
        Well I got more undetermined reads that I excepted.

        I'm sure the main problem comes from the run but anyway how can I do if I want to compare these toolts ?
        I suspect (not having used the tools you mentioned) that they are focused on demultiplexing user-generated barcodes instead of using the Illumina indexes and thus you will be disappointed in them. As you said you could force everything into undetermined reads but that won't give you the Illumina indexes.

        If you have more undetermined reads than expected then, like GenoMax, I think that you have problems with your indexing reads. You can alleviate some of the problems via the 'use-bases-mask' and 'num_of_mismatches' parameters to Casava.

        Comment


        • #5
          Originally posted by ClemBuntu View Post
          Well I got more undetermined reads that I excepted.

          I'm sure the main problem comes from the run but anyway how can I do if I want to compare these toolts ?
          What fraction of the reads are in the undetermined file?

          Are you recovering all the samples you expect from the run? A common problem is specifying an incorrect barcode for a sample which sends the reads for that sample(s) to the undetermined file.

          Comment


          • #6
            GenoMax has, as usual, great suggestions.

            His "common problem" of specifying an incorrect barcode (which is indeed something we do often enough) should be easy to spot because one of your samples will have close to zero reads.

            The harder problem is if many of the index reads overall have poor quality bases in them. If the problem is base-specific (e.g., occurs only at the 2nd index base) then Casava's 'use-bases-mask' parameter can be used to ignore the specific base. Otherwise 'num_of_mismatches' needs to be used. Sometimes I use both if the reads are really poor.

            Comment


            • #7
              Use the following command to find out what tags are represented in your "undetermined" file. You will generally see a great variation present. Thing to look for is if there are some tags that are way over-represented. Sometime people make mistakes in making libraries/with pooling and you don't quite have the result that you expect from a pool.

              Replace the ? with specific lane number.

              Code:
              $ zcat lane?_Undetermined_L00?_R1_001.fastq.gz | grep @HWI |cut -d: -f10 | sort | uniq -c | sort -r -n -k1

              Comment


              • #8
                The fraction of undetermined reads is not too high, so don't worry about that guys but thanks anyway....


                My question was about demultiplexing tools that take FastQ files as input... I'm just wondering why they exist since Illumina software like bcl2fastq demultiplexe and creats FastQ file.... It seem nobody here use these tools that's very strange !

                As westerman said maybe theses tools are only useful when you have a custom barecode...

                Comment


                • #9
                  Long before the index reads read-out, barcodes were inserted into the reads' sequence (inline barcoding). You had to demultiplex the Fastq file and split it into the different samples according to the first n bases. Therefore, various tools were implemented.
                  Now in times of the index reads, these tools are more or less outdated.
                  I hope that answers your question.
                  Cheers,
                  Michael

                  Comment


                  • #10
                    Oh I see, now that makes sense thanks !

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM
                    • seqadmin
                      Techniques and Challenges in Conservation Genomics
                      by seqadmin



                      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                      Avian Conservation
                      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                      03-08-2024, 10:41 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 06:37 PM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 06:07 PM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-22-2024, 10:03 AM
                    0 responses
                    51 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-21-2024, 07:32 AM
                    0 responses
                    67 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X