Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Miseq Paired-End Cluster Identification

    Hello All,

    I have been scouring the internet to try to find the answer to my question, but to no avail. We are attempting to sequence multiple barcoded 16s paired-end amplicons on the Miseq and to get around the low-diversity issue we plan to stagger the sequencing start position for our runs by 0-5bp (incorporated into our amplicon oligos).

    The question that I have concerns the cluster identification. If the read 1 sequence has the first 4bp at a ratio of 1:1:1:1 for A:T:G:C and the identification of clusters is good, are the values obtained here used for read 2 as well? Or does the same process occur on read 2 with cluster identification... I would assume not since the location on the flow-cell determines the clusters, and doing this twice may give two completely different cluster location "pictures".

    Any insight here would be greatly appreciated! Thanks!

  • #2
    Originally posted by retroant View Post
    Hello All,

    I have been scouring the internet to try to find the answer to my question, but to no avail. We are attempting to sequence multiple barcoded 16s paired-end amplicons on the Miseq and to get around the low-diversity issue we plan to stagger the sequencing start position for our runs by 0-5bp (incorporated into our amplicon oligos).

    The question that I have concerns the cluster identification. If the read 1 sequence has the first 4bp at a ratio of 1:1:1:1 for A:T:G:C and the identification of clusters is good, are the values obtained here used for read 2 as well? Or does the same process occur on read 2 with cluster identification... I would assume not since the location on the flow-cell determines the clusters, and doing this twice may give two completely different cluster location "pictures".

    Any insight here would be greatly appreciated! Thanks!
    Warning--more than just the first four bases of read one are critical, but not for cluster finding. We tried the game of putting four random bases at the start of read 1 based on what we knew about cluster finding. Clusters are found, and are used in read 2, but the calculation of phasing/prephasing depends on bases 1-12 of both reads. Unless you are okay with routinely spiking in 30% OR MORE (current recommendation from Illumina is 60% on Miseq), you wil have to design primers so that the first 12 bases read from both ends are high complexity. Illumina is working on a software fix. What we observed was a good read 1 (with 9 bases of randomness by design at start) and a failed read 2--okay to ~125 bases of 250 and then a precipitous drop off. We had ~20% PhiX included.

    Comment


    • #3
      retroant: See this thread. Check posts #47 onwards. http://seqanswers.com/forums/showthr...t=20455&page=3

      Using a hard-coded matrix (from a good phiX run) seems to help mitigate the problem to some extent. If you are using an upgraded MiSeq (running MCS v.2.0.5) then you can work with your FAS to see if this workaround helps. The quality values are still degraded but you may be willing to accept those specially if you have a good overlap between the two reads from the 2 x 250 bp run.
      Last edited by GenoMax; 10-16-2012, 05:02 AM.

      Comment


      • #4
        Originally posted by GenoMax View Post
        Using a hard-coded matrix (from a good phiX run) seems to help mitigate the problem to some extent. If you are using an upgraded MiSeq (running MCS v.2.0.5) then you can work with your FAS to see if this workaround helps. The quality values are still degraded but you may be willing to accept those specially if you have a good overlap between the two reads from the 2 x 250 bp run.
        Hi GenoMax, could you explain what you mean by degraded quality values? Did you analyze whether the Q values with hardcoded matrix/phasing still are correct?

        Comment


        • #5
          Originally posted by Vinz View Post
          Hi GenoMax, could you explain what you mean by degraded quality values? Did you analyze whether the Q values with hardcoded matrix/phasing still are correct?
          The quality values still dropped rapidly after ~130 cycles on both reads (2 x 250 bp) while using the hard-coded matrix/phasing (average Q30 was about 61%). No phiX was included in samples since this was a custom primer run.

          We have not been able to do a direct comparison of running a set of samples with and without the hard-coded matrix as yet. We should have that data by next week.

          We were not concerned about the quality for run I refer to since we knew that MiSeq has a problem with samples having reduced nucleotide diversity. We did get very good overlap with the two reads in the middle as expected with no errors (disclaimer: based on a few read pairs I checked, this data was for someone else).
          Last edited by GenoMax; 10-19-2012, 03:57 AM.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 08:47 AM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          57 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          53 views
          0 likes
          Last Post seqadmin  
          Working...
          X