Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Strange Illumina library artifact.

    We ran some "RAD" (Restriction site Associated DNA) tag libraries on our Illumina HiScanSQ. The libraries were not constructed by us, but the protocol identifies them as:

    Adapted from “Sequenced RAD Markers for Rapid SNP Discovery and Genetic Mapping”, Paul D. Etter (University of Oregon) and modified by Michael R. Miller (University of Oregon)
    Briefly genomic DNA is digested with a restriction enzyme (SbfI, an 8-cutter that generates a 4 base 3' overhang). "F" adapters are ligated to the ends of the digested DNA utilizing the 3' overhang. Then the DNA is sonicated to a reasonable size, end-polished, A-tailed then ligated to the "R" adapter. Enrichment PCR, then gel size selection.

    The "F" adapter contains a 5 bp, in-line, index -- read prior to the remnant of the SbfI site. Since these are Illumina libraries it is important to get those 1st 5 bases randomized, so by pooling libraries that is accomplished.

    Now the artifact. See below:



    Lanes 2 and 3 show this artifact, whereas it is absent or much diminished in lanes 1 and 4. Note that the 6 remaining bases of the SbfI site are clearly visible in lanes 1 and 4, whereas the sequence upstream and downstream is nicely randomized. However, only about 1/2 the reads appear to share the SbfI site in lanes 2 and 3. The marauding sequence is identical to the 20 nt "reverse" primer used during enrichment PCR, CGTATGCCGTCTTCTGCTTG. So that would make it, what, a flow cell oligo?

    Of note: these samples cut from an agarose gel. They were 600-750 bp. But upon denaturation and running on an Agilent pico RNA chip, modest amounts of smaller fragments are visible. (See figure above.) The inset has a zoomed-in view of the putative culprits. Of note is that they are not present at high concentrations compared to the main peak. But nevertheless, in two of the samples (1083 and 1084 in lanes 2 and 3), look to be consuming about 1/2 of the sequence being generated.

    An earlier run had larger amounts of these and was nearly ruined by them. qPCR gave low estimates of their molar concentration (because of their low molecular weight) and so the lanes were tending towards over-clustering. Worse, the dimer predominated enough that it appeared to be interfering in cluster registration -- leading to low pass filter percentages.

    Just an FYI, really. Primer dimers, once they show up during enrichment PCR can be the devil to get rid of. They can anneal to the main library amplicons and thwart double-stranded size selection.

    --
    Phillip

  • #2
    Hi Phillip,

    We've been experiencing this issue too when we did RNAseq (lower input sequencing). We can clearly read primer dimers in base-pair composition display. But a bit surprise that you did "cutting out" your fragments.

    I also reckoned that although there's relatively lower concentration primer-dimer, it would affect the run significantly - so, I simply thought lower but smaler fragments has higher molarity thereby resulting in more severe than I expected. After this issue, we regularly perform dissociation analysis in qPCR to find whether samples include primer dimers or not. Sometimes, they are not shown in bioA clearly or even if one or two bioA wells miss them, there would be no way to sort them out.

    Comment


    • #3
      A couple of points:

      (1) Just to be clear, the main issues is that primer dimers share substantial amounts of sequence with the adapters of your library molecules. So one strand of a primer dimer molecule can anneal to the adapter of a good amplicon after a denaturation step. Once annealed to a longer molecule, they won't be removed by cutting a band of the correct size out of a gel: they will be included in the "correct size" fraction. Just a down-side to running DNA double-stranded. Nor will they be visible to dsDNA assays, probably.

      (2) Do dissociation assays really allow you to detect primer dimers? I thought that assay was designed for more traditional qPCR experiments where a single gene is being amplified. A single gene would have a particular dissociation profile and should produce a single peak.

      However libraries comprise a vast variety of molecules and, one could imagine situations in which they would be bimodal.

      --
      Phillip

      Comment


      • #4
        I think we are having a similar problem. The library size distribution looks great on the Tapestation and then we end up sequencing lots of the PCR primer dimers (or short inserts in general). Did you ever track down the cause of this?
        * Too high concentration of input primers?
        * Too many rounds of PCR (overamplified)?
        * Too little input to the enrichment PCR?

        Any solutions that you have come up with? Thanks!

        Comment


        • #5
          Bioanalyser trace indicates presence of more primer/adapter-dimers (fragments <200 bp) for samples in lanes 2 and 3 and accordingly they have more clusters with dimers. They also mask the restriction site overhang sequences for those samples which are obvious for samples in lanes 1 and 4. High percentage of A residue in lanes 2 and 3 is another indicator that short fragments have run into flow cell oligo lawn. General wavy shape of sequencers in all lanes is indicator of low diversity.

          @ipeikon, If you use D1000 ScreenTape you may not see the low concentration of dimers, try running them on HSD1000. Unused primers should be cleaned properly after PCR. If there is a large amount of dimers then two clean ups should reduce numbers significantly. To find a specific reason for your libraries issue more information is needed.

          Comment


          • #6
            If libraries are overamplified and the primer concentration drops then you can get interactions between adapters that make the fragment analyzer results look like there are not short fragments. One way to check is to take an aliquot of your PCR product and do a new PCR reaction for a cycle and check that product on the fragment analyzer as well. If you now see short fragments then primer exhaustion plus adapter interactions is the reason and you need to cut back on the cycle number or do a quick second round of PCR.
            Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            30 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            32 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            28 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            52 views
            0 likes
            Last Post seqadmin  
            Working...
            X