Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • plumb
    Member
    • Sep 2009
    • 15

    pyrotag process

    I am new to pyrosequencing. We got some 16s rRNA data from sequencing center. It's fasta format file. they give me two set of the data:

    prepared 2 sets of files (zips attached);
    1) reads for each amplicon starting from the FLXA side (> 200 bp)
    2) reads for each amplicon starting from the FLXB side (> 200 bp)


    What's does that mean?

    We are using that to estimate the biodiversity in the sample. which data set I should use?


    They didn't give me quality file either. Is the quality file very important in the quality control step of the data process?
  • krobison
    Senior Member
    • Nov 2007
    • 734

    #2
    I don't have experience with this sort of data, but there is a general approach you can take to puzzle this out and which will help you understand what you have.

    If you have two files of sequences from a sequencing experiment, there are three probable relationships between them

    A) They are uncorrelated
    B) They are both derived from the same input sequence with an unread spacer separating them and in opposite orientation
    C) They are both derived from the same input sequence with an unread spacer separating them, but in the same orientation

    First, take about 20 reads from the top of each file (easy to write a program in your favorite language) and put them in new files.

    Okay, are the identifiers in the two files related to each other in a pairwise fashion? If so, then either B or C is likely.

    Next, use BLASTN to identify best matches for all of these in a database such as nr. Now look at the alignments. Are there pairs of sequences aligning to the same hit? In such pairs, are the hits in the same orientation or opposite orientation?

    Even if you did think you knew what the files SHOULD contain, these steps are a useful QC step.

    Comment

    • plumb
      Member
      • Sep 2009
      • 15

      #3
      figured that out. one set of file is from the forward reading and another is from reverse reading. I need to combine these sequence properly. Otherwise, we may double count the organism.

      Any suggestion for that?

      Comment

      • kmcarr
        Senior Member
        • May 2008
        • 1181

        #4
        Plumb,

        In 454 Amplicon sequencing, the PCR product is created with primers which have two different 454 specific sequences, one called 'A' and one called 'B'. When the PCR products are then attached to the beads for emulsion PCR you have the choice of attaching either by the A primer or the B primer. Which you choose will determine which end of your PCR product will be sequenced. You CAN NOT sequence both ends of the same DNA molecule using the 454 Amplicon method. It appears that your sequencing facility prepared two different emulsion PCR reactions, one using kit A and one using kit B and then sequenced each of those. The reads in these two files are not pairs.

        Comment

        • plumb
          Member
          • Sep 2009
          • 15

          #5
          Dear kmcarr, Thank you for your reply

          In our case, we have one forward and one reverse primers, but these were split into two sets with the
          454 A and B adaptors switched so that sequencing would be done in both directions.

          So, we think one dataset we got starts from the forward primer and one starts from the
          reverse primer. Any suggestion based on those information. Thank you!

          Comment

          • plumb
            Member
            • Sep 2009
            • 15

            #6
            forget my last post.

            n our case, we have one forward and one reverse primers, but these were split into two sets with the
            454 A and B adaptors switched so that sequencing would be done in both directions.

            Based on the above information, how should I merge the data together?

            Comment

            • kmcarr
              Senior Member
              • May 2008
              • 1181

              #7
              Plumb,

              First of all, what is purpose of your experiment? What is the source of your input 16S RNA? Was it a population of a single species for variation analysis or was it an environmental sample for a metagenomics experiment? The purpose of your experiment will define the processing pipeline.

              Comment

              • plumb
                Member
                • Sep 2009
                • 15

                #8
                Originally posted by kmcarr View Post
                Plumb,

                First of all, what is purpose of your experiment? What is the source of your input 16S RNA? Was it a population of a single species for variation analysis or was it an environmental sample for a metagenomics experiment? The purpose of your experiment will define the processing pipeline.

                this is the environmental sample for metagenomics experiment. we want to measure species diversity and detailed community profile.

                Comment

                • kmcarr
                  Senior Member
                  • May 2008
                  • 1181

                  #9
                  Plumb,

                  For this type of experiment (metagenomics by 16S sequencing) there is no point to sequencing the library from both the A and B primers. Since the method does not sequence the same molecule from both ends you can not combine two reads together to come up with one longer, more informative read. Look over the 16S pyrosequencing pipeline information at the Ribosomal Database Project (http://pyro.cme.msu.edu/pyro/help.jsp). They sequence their 16S V4 amplicons from just one end and then use that to assign the read based on alignment to a database of 16S genes. I'm not sure what region of the 16S you amplified but the principle is the same.

                  In your case I would treat the A and B read sets as if they were two different amplicons generated from the same environmental sample and see if the results from the two are consistent in terms of the community profile observed.

                  Comment

                  • HMorrison
                    Senior Member
                    • May 2009
                    • 121

                    #10
                    Dear Plumb and kmcarr,

                    A heads up--anyone who is doing pyrotag sequencing for community composition studies of this sort should let Roche know that they only want to sequence from one or the other end--the company is going to try to package Titanium reagents for bidirectional sequencing only, WHICH IS TOTALLY POINTLESS for this protocol since the reads can't be matched to a starting template molecules (unlike Solexa PE reads). I'm at the MBL, where the original work was done (Sogin et al., 2006), so I am familiar with this protocol and variations. As kmcarr mentions in the last reply, the only value of having an A and a B set would be as consistency check.

                    Comment

                    • 454Sequencing
                      Member
                      • Apr 2009
                      • 22

                      #11
                      Dear HMorrison,

                      I think I can help clarify things. It is true that the upcoming GS FLX Titanium series Amplicon emPCR kits will contain all reagents needed for generating enriched beads for bidirectional sequencing. However, the components supplied include separate 'A' and 'B' side reagents (i.e. independent A and B capture bead tubes and associated primers). This means that the kit will support a variety of sequencing strategies including the unidirectional approach of interest to you. There are sufficient reagents in a single kit to achieve this employing various experimental designs/run formats.

                      If you do not have a copy already, there is a Technical Bulletin on the topic of amplicon sequencing available on the my454 customer site. If you require further assistance please feel free to send a private message or contact your local Roche representative.

                      Best regards,
                      Jason

                      Technical Product Manager
                      454 Life Sciences, a Roche company
                      Technical Product Manager
                      454 Life Sciences, A Roche Company

                      Comment

                      • HMorrison
                        Senior Member
                        • May 2009
                        • 121

                        #12
                        Bidirectional sequencing

                        That isn't what I've been hearing from Tim Harkins. Even if true, we are buying and wasting components we don't need OR we have to completely revise our experimental design.

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          Yesterday, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Yesterday, 12:03 PM
                        0 responses
                        17 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, Yesterday, 11:40 AM
                        0 responses
                        13 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        29 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-26-2026, 10:12 AM
                        0 responses
                        31 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...