Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • CLC genomics wokbench and illumina demultiplexing

    Hi there!
    After a miseq Nextera XT run we got a lot of undetermined data (undeterminedbarcode sequences with one mismatch or more). We wouldn't like to throw away so much data , and looking for a possibility to demultiplex sequences with one or more mismatch in the barcode.
    Does CLC genomics workbench have this function? there is an option to process tagged sequences, but can the mismatched barcodes be processed?

    Thank you for any answers!

  • #2
    I am not sure CLC can help since MiSeq reporter apparently will not add the tags to the "undetermined" reads file it produces. I am going by the info provided by dsobral in a recent thread that is in the list below.

    In cases such as this you will need to de-multiplex the MiSeq data using the "Bcl2fastq" software that is available here: http://support.illumina.com/download...tware_184.ilmn. If you are not comfortable using command line tools then you will need to find someone who is reasonably proficient with linux and has access to a linux server.

    You will need:

    1. Full data folder from your MiSeq run
    2. Working install of bcl2fastq (in addition to the illumina link above look at this thread http://seqanswers.com/forums/showthread.php?t=34844) You can allow up to 2 mismatches per tag read.
    3. Example of the SampleSheet.csv you will need to create to run Bcl2fastq is in post #14 in this thread.
    Bridged amplification & clustering followed by sequencing by synthesis. (Genome Analyzer / HiSeq / MiSeq)


    NOTE: If this run was over-clustered (density > 1300-1400 clusters/mm^2 for v.3 reagents) then chances of recovering useful data are slim.

    Comment


    • #3
      Are we talking index or barcode?
      For indecies use CASAVA by Illumina
      For inline barcodes use jMHC

      Comment


      • #4
        Originally posted by GenoMax View Post
        2. Working install of bcl2fastq (in addition to the illumina link above look at this thread http://seqanswers.com/forums/showthread.php?t=34844) You can allow up to 2 mismatches per tag read.
        bcl2fastq allows, just as CASAVA before, exactly one or zero mismatches in index recognition.

        Comment


        • #5
          Originally posted by Etherella View Post
          Hi there!
          After a miseq Nextera XT run we got a lot of undetermined data (undeterminedbarcode sequences with one mismatch or more). We wouldn't like to throw away so much data , and looking for a possibility to demultiplex sequences with one or more mismatch in the barcode.
          Does CLC genomics workbench have this function? there is an option to process tagged sequences, but can the mismatched barcodes be processed?

          Thank you for any answers!
          As GenoMax has already pointed out, it is possible to get the "undetermined indices" when demultiplexing with CASAVA/bcl2fastq (no idea why Illumina does not write the index sequences in the header for the miseq undet files).

          But maybe it is enough if you just ask your sequence provider to run demultiplexing with one mismatch?

          Comment


          • #6
            To my knowledge,
            CLC does demultiplexing only for in-line barcodes, not for barcodes in separate barcode reads. CLC assumes that such de-multiplexing is being done by the Illumina system software. It is relatively easy to do demultiplexing with some scripts tolerating one (examples are already mentioned) or more mismatches (there certainly are better options, but we have some quick and dirty script if desired).
            Last edited by luc; 02-25-2014, 01:55 PM.

            Comment


            • #7
              Hi,

              Does anyone have perl or pythogn script that can pull out Reads (Forward) from R1 file and corresponding pair (Reverse) from R2 file. CLC workbench does give paired sequence, but as mentioned by luc it looks for inline barcodes.
              I want some script that works alike and tolerate some mismatch. i would also expect it looks for barcode in seperate barcode reads.

              Many thanks

              Comment


              • #8
                Hi Bioinform,

                The allPrep-8.py script out of barcode-tools set, will do what you want and more.
                When using the "-D" it will only demultiplex ( and not do adapter or quality trimming).

                Comment


                • #9
                  More options:
                  jmhc
                  fastx_barcode_splitter

                  But I have a question, is there any software that detects also insertions/deletions in the barcodes? I want to use something to repair Ion Torrent barcodes but the software above only detects mismatches

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Advancing Precision Medicine for Rare Diseases in Children
                    by seqadmin




                    Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                    12-16-2024, 07:57 AM
                  • seqadmin
                    Recent Advances in Sequencing Technologies
                    by seqadmin



                    Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                    Long-Read Sequencing
                    Long-read sequencing has seen remarkable advancements,...
                    12-02-2024, 01:49 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 12-17-2024, 10:28 AM
                  0 responses
                  33 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-13-2024, 08:24 AM
                  0 responses
                  48 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-12-2024, 07:41 AM
                  0 responses
                  34 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-11-2024, 07:45 AM
                  0 responses
                  46 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X