Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • recommendations on multiplex?

    Hi,
    I'm looking for recommendations on/your experiences with Illumina multiplexing. I plan to sequence 11 isolates for assembly against an existing reference genome, and would like to minimize the run cost by multiplexing them. Illumina offers a 12-sample multiplexing kit, but I've also looked at the paper by Cronn et al using 3 bp tags. The 3 bp tags used there don't seem as appealing to me because a single error can cause misidentification. Can anyone here advise on what they've found to be the best multiplexing method? Go with the kit or use custom 4-6 nt barcodes?

    I've seen the thread on the google Solexa group (http://groups.google.com/group/solex...1ba2313d65332c) , but that is dated last fall before the kit came out.

  • #2
    Recommendations on multiplex - update

    Hi Greigite -

    Our lab group has been using the same, simple 3' barcode method outlined in the Cronn et al. paper, only with slightly longer barcodes (3 barcode nucleotides + 1 terminal "T" = 4 bp), and the paired-end adapter sequences described in Bentley et al. (Nature, 2008).

    We designed 20 of the 64 possible combinations, and we routinely use them for 6-8 plex reactions. We've even run multiplexes as high as 16-plex four different times. I don't have the updated numbers, but at last check we've run over 200 separate samples at an average multiplex level of 7X. Here are the exact results from a recent multiplex:

    sample a = 16-plex; 85% reads correctly tagged
    sample b = 16-plex; 87% reads correctly tagged
    sample c = 14-plex; 86% reads correctly tagged

    We do tend to lose about 10-15% of the reads (for a variety of reasons) using the home-made kit. We have not tested the Illumina barcoding kit, but I imagine their product is sufficiently polished that this is less of an issue.

    The method definitely works for single-end reads (successful results with 36 to 60 bp reads) and paired-end reads (36 x 36).

    Comment


    • #3
      Does anyone have any advice on the bioinformatics side of dealing with barcoded data? Specifically, we are using the Illumina Pipeline 1.3.2. The obvious simple solution is to separate the reads by grepping for the code a the start of the read and feeding each separately into ELAND/MAQ/etc. Does anyone have any other suggestions or tips for modifying the pipeline to handle this? Thanks.

      Comment


      • #4
        Thanks, Treebeard. I am thinking of using the 6 bp (actually 5 bp plus the overhanging T) described in the Craig et al paper to get a better check on per-base error rates. How do you purify your oligos (desalt, HPLC)? Do you see any tag-specific underrepresentation?

        Originally posted by treebeard View Post
        Hi Greigite -

        Our lab group has been using the same, simple 3' barcode method outlined in the Cronn et al. paper, only with slightly longer barcodes (3 barcode nucleotides + 1 terminal "T" = 4 bp), and the paired-end adapter sequences described in Bentley et al. (Nature, 2008).

        We designed 20 of the 64 possible combinations, and we routinely use them for 6-8 plex reactions. We've even run multiplexes as high as 16-plex four different times. I don't have the updated numbers, but at last check we've run over 200 separate samples at an average multiplex level of 7X. Here are the exact results from a recent multiplex:

        sample a = 16-plex; 85% reads correctly tagged
        sample b = 16-plex; 87% reads correctly tagged
        sample c = 14-plex; 86% reads correctly tagged

        We do tend to lose about 10-15% of the reads (for a variety of reasons) using the home-made kit. We have not tested the Illumina barcoding kit, but I imagine their product is sufficiently polished that this is less of an issue.

        The method definitely works for single-end reads (successful results with 36 to 60 bp reads) and paired-end reads (36 x 36).

        Comment


        • #5
          Multiplexing strategy

          This is right up my alley so this seems like a good opportunity to make my first post on this board.

          An elegant way to design error correcting bar codes can be found in Hamady et al. Nature Methods v5 p235-237 2008. But this was developed for 454 pyrosequencing and uses 8 bp tags that were a bit too long for our Solexa/Illumina application.

          I opted for a simpler scheme that identified the sample in the first part of the barcode and then adds a "checksum" base that allows you to detect any single mutations in the barcode and then throw out the entire sequence rather than assigning the sequence to the wrong sample.

          I assigned A = 0, C = 1, G = 2, T = 3 and used base 4 arithmetic. For example, the sample ID 94 in base 10 becomes 01132 in base 4 (0x256 + 1x64 + 1x16 + 3x4 + 2x1 = 94) which converts to ACCTG. The sum of each of these 5 digits is 1+1+3+2 = 7 (in base 10) and the remainder when divided by 4 is 3. This remainder is converted to a base (3 = T) and added to the end of the 5 bp portion. Thus sample id 94 would be converted to ACCTGT and any single mutation to any of these 6 bases will create a checksum base that doesn't match the rest of the barcode.

          These barcode sequences can be generated by a simple python script. The barcodes can be decoded and verified using an equally simple script and then the sequences with valid barcodes can be separated by which sample they belong to.

          This 6 bp scheme can be used with up to 4^5 = 1024 samples. We have used it to great success for multiplexing 96 samples and we are planning to test it with 384 multiplexed samples. This barcode with checksum base can be shortened if you don’t need to multiplex as many samples. For example, a 3 bp barcode could multiplex up to 16 samples, a 4 bp barcode could multiplex up to 64 samples and a 5 bp barcode could multiplex up to 256 samples.

          Originally posted by treebeard View Post
          Hi Greigite -

          Our lab group has been using the same, simple 3' barcode method outlined in the Cronn et al. paper, only with slightly longer barcodes (3 barcode nucleotides + 1 terminal "T" = 4 bp), and the paired-end adapter sequences described in Bentley et al. (Nature, 2008).

          We designed 20 of the 64 possible combinations, and we routinely use them for 6-8 plex reactions. We've even run multiplexes as high as 16-plex four different times. I don't have the updated numbers, but at last check we've run over 200 separate samples at an average multiplex level of 7X. Here are the exact results from a recent multiplex:

          sample a = 16-plex; 85% reads correctly tagged
          sample b = 16-plex; 87% reads correctly tagged
          sample c = 14-plex; 86% reads correctly tagged

          We do tend to lose about 10-15% of the reads (for a variety of reasons) using the home-made kit. We have not tested the Illumina barcoding kit, but I imagine their product is sufficiently polished that this is less of an issue.

          The method definitely works for single-end reads (successful results with 36 to 60 bp reads) and paired-end reads (36 x 36).

          Comment


          • #6
            Great posts. Wraithnot and treebeard, what's been your experience with read representation between the different barcodes? Is the difference between reads per barcode within a 2-fold? 5-fold?

            Comment


            • #7
              Originally posted by mhc View Post
              Great posts. Wraithnot and treebeard, what's been your experience with read representation between the different barcodes? Is the difference between reads per barcode within a 2-fold? 5-fold?
              Our experiments use 96 separate barcodes. If you aren't careful about normalizing the concentrations of each sample when you combine them you can get wildly different amounts of each barcoded sample. For a well behaved experiment the most represented barcode was represented less than two-fold higher than the median, and the 10% percentile barcode was represented about 2-fold less than the median. The very bottom of the distribution had a discontinuity and the least represented barcode was 20-fold down from the median. This might be due to an oligo quality issue for that particular barcode.

              Comment


              • #8
                It's a bit of a side issue to this discussion, but I've gotten some different advice on necessary purification levels for home-made adapters. One core I talked to uses HPLC for the adapter with the phosphorothioate bond to ensure a higher frequency of ligation. Another person uses desalt for all adapters, and the Quail et al Sanger paper uses HPLC for all theirs. Just FYI. I have no experience yet myself.

                Comment


                • #9
                  Originally posted by wraithnot View Post
                  Our experiments use 96 separate barcodes. If you aren't careful about normalizing the concentrations of each sample when you combine them you can get wildly different amounts of each barcoded sample. For a well behaved experiment the most represented barcode was represented less than two-fold higher than the median, and the 10% percentile barcode was represented about 2-fold less than the median. The very bottom of the distribution had a discontinuity and the least represented barcode was 20-fold down from the median. This might be due to an oligo quality issue for that particular barcode.

                  Wraitnot, I am very interested in your barcoding results. I would like to talk you offline about this if possible. I've sent you a private message with my contact information.

                  Thanks,
                  Wade

                  Comment


                  • #10
                    This thread is a few weeks old and I have been away from the board for a while but I wanted to reply and make an offer for any interested labs to try out our indexing methods. In our hands, we are seeing at least 98% efficiency in the parsing of the final reads to the correct barcodes so it is working really well. We also have some software that goes with the adaptors to correctly parse the final data. We are routinely doing a 12-plex but the designs are finished for 96-plex. We just haven't had a project that needs that high a multiplex.

                    The method has all the usual features such as error correction (SECDEC to be specific). Reply or PM me if interested. It would be great to compare results with a few different methods/designs.
                    HudsonAlpha Institute for Biotechnology
                    http://www.hudsonalpha.org/gsl

                    Comment


                    • #11
                      Hi,
                      There's a demultiplexing tool, novobarcode, in the Novoalign package that's free to use. It can take a file of index tags and distance (bp differences) between tags and then demux the reads into individual files. The current program expects the index tag to be part of the read rather than Illumina format in the header which should suit your custom tags. A new version is coming that supports Illumina format.
                      Alignment of index tags and reads uses bases qualities and allows some differences depending on the distance setting and a quality threshold.
                      Colin

                      Comment


                      • #12
                        Illumina tags

                        Hello,
                        I'm new to this site, but I wanted get feedback on a couple of ideas that our lab is tossing around.
                        1) Illumina's multiplexing adapters contain ID tags on the end of the fragment. Does this lead to any significant loss of the tag or misreading of the tag during sequencing?
                        2) We wondered if anyone has ever moved the tag so that it would essentially be the first 3/4 base pairs of the read? This would cut down on the read length, but it would make tags that will not be lost and could bypass any additional cost that sequencing centers charge for multiplexing. The decoding of the tags would be done by our computational people, thus reducing the cost.

                        I'd appreciate feedback or suggestions. Thanks.

                        Comment


                        • #13
                          Multiplexing with custom adapters

                          Originally posted by PUGenomeLab View Post
                          Hello,
                          I'm new to this site, but I wanted get feedback on a couple of ideas that our lab is tossing around.
                          1) Illumina's multiplexing adapters contain ID tags on the end of the fragment. Does this lead to any significant loss of the tag or misreading of the tag during sequencing?
                          2) We wondered if anyone has ever moved the tag so that it would essentially be the first 3/4 base pairs of the read? This would cut down on the read length, but it would make tags that will not be lost and could bypass any additional cost that sequencing centers charge for multiplexing. The decoding of the tags would be done by our computational people, thus reducing the cost.

                          I'd appreciate feedback or suggestions. Thanks.
                          Hi,

                          Re. the second part of this post, various groups (e.g. Cronn et al., NAR 2008; Harismendy & Frazer, Biotechniques 2009) have used custom adapters incorporating the barcode so that it becomes part of the read at the 5' end. In a bid to get my own head around the various stages, I've adapted the excellent figure posted by greigite (in the Tech Summary: Illumina's Solexa Sequencing Technology thread) to include some details of the approach used by Cronn et al., as well as a generic strategy to follow through the PCR and sequencing steps (any errors, please let me know!) - hope this is useful.

                          Also, check out the other posts in this thread for great ideas re. barcode design and bioinformatic processing of sequences.

                          Exeplex
                          Attached Files

                          Comment


                          • #14
                            Library-free addition of index sequences in circularized MIP captured sequences?

                            I am trying to come up with a library-free way to introduce index (barcode) sequences into circularized captured genome fragments (Molecular Inverted Probes – MIP) that are about 250 nt long. The MIP designs are described in Porreca et al. 2007 and in Turner et al 2009:
                            http://www.nature.com/nmeth/journal/...nmeth1110.html
                            http://www.nature.com/doifinder/10.1038/nmeth.f.248

                            The authors ran their 16 samples in separate lanes, therefore did not need to index. The 100-mer oligo MIP designs has a 30 nt common linker sequence, so that one can do inverse pcr using Illumina paired end primers directed against this common linker and then load into flowcell directly for cluster generation.

                            I like the approach a lot, but couldn’t yet find a way to append index sequences without the standard library prep using adapters described in Cronn et al. or Craig et al. mentioned in this thread.

                            Any thoughts?

                            Comment


                            • #15
                              Library-free addition of index sequences in circularized MIP captured sequences?

                              Originally posted by ShiveringFire View Post
                              I am trying to come up with a library-free way to introduce index (barcode) sequences into circularized captured genome fragments (Molecular Inverted Probes – MIP) that are about 250 nt long. The MIP designs are described in Porreca et al. 2007 and in Turner et al 2009:
                              http://www.nature.com/nmeth/journal/...nmeth1110.html
                              http://www.nature.com/doifinder/10.1038/nmeth.f.248

                              The authors ran their 16 samples in separate lanes, therefore did not need to index. The 100-mer oligo MIP designs has a 30 nt common linker sequence, so that one can do inverse pcr using Illumina paired end primers directed against this common linker and then load into flowcell directly for cluster generation.

                              I like the approach a lot, but couldn’t yet find a way to append index sequences without the standard library prep using adapters described in Cronn et al. or Craig et al. mentioned in this thread.

                              Any thoughts?
                              I haven't tried this, but in principle you should be able to carry out direct indexed sequencing from MIP-derived amplicons by incorporating a barcode into the middle of the reverse primer (between the linker-specific sequence and the flowcell oligo annealing sequence). You can then mimic the Illumina multiplex/indexing strategy by carrying out a 2nd short sequence read from the reverse primer to read the index - see the attached PDF for details (you might need to lengthen the common linker sequence slightly so as to make a longer primer for the index read).
                              The example shown is based on the strategy of Turner et al, who used single-end sequencing. The indexing strategy should work equally well for paired-end reads (carrying out the index read after the first sequencing reaction, before 'flipping' the product on the PE module and performing the reverse read of the captured sequence, as in the Illumina multiplex protocol), but you would just need to bear in mind that the flow cell oligos C & D on the PE flow cell are slightly longer, so you would need to adjust the adapter/primer tails accordingly.
                              Attached Files

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM
                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:37 PM
                              0 responses
                              8 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 06:07 PM
                              0 responses
                              8 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-22-2024, 10:03 AM
                              0 responses
                              49 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-21-2024, 07:32 AM
                              0 responses
                              66 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X