Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Lots of chimeric transcripts Ion PGM®

    Hello Bioinformatic community,

    We have recently sequenced a bacterial transcriptome with 316 chip from IonTorrent (1.5 million sequences). After filtering low quality data and trimming adapters we noticed that only 51.33% sequences were mapped on reference genome. Looking for the unmapped sequences we can see that most of them are chimeric transcripts, so impossible mapping for them an also causing bias on results. Also many of the unmapped are sequences lacking homology in 20% of the starting sequence.

    I would like to know your opinion about it.
    Should I have to move to 454 or Illumina? Our Sequencing Department have no idea of why we have so many chimeras.

    Thank you, Bernardo

  • #2
    Is the read length disitribution for the chimeras smooth, or does it have many spikes?

    Comment


    • #3
      Hi sulfobus and sorry for the dealay in response,

      The real distribution could be smother (See the attached picture).

      The problem is with other analysis in fastQC (See link for complete FastQC file of unmapped sequences).

      https://docs.google.com/folder/d/0B8...EzTGVPZFU/edit


      Also, I have overepresented sequences.

      Bernardo
      Last edited by bernardo_bello; 10-30-2012, 08:52 AM. Reason: Forgot something

      Comment


      • #4
        We've had chimeric sequences but they were in distinct groups with the same lengths. From your distribution it looks like most are spread out, although there are two peaks. In our case it was a PCR artefact, stemming from sequence similarity allowing chimeras to form during the plateau phase of the PCR (we used fusionprimers to attach adapters). We solved it by running fewer cycles.

        Our application appear to be quite different from yours though, so I don't know if that's of any help.

        Comment


        • #5
          Thank you sulfobus. I will tell you how this story ends. Ion Torrent technical support is revising my data.

          Bernardo

          Comment


          • #6
            chimeric reads PGM

            I have exactly the same problem with Ion Torrent reads for RNASeq and many chimeric reads after Bionformatic analyses with Bowtie2. Did you obtain an answer from Ion Torrent? Because with Illumina this problem was not here?

            Thank you, Veronique

            Originally posted by bernardo_bello View Post
            Hello Bioinformatic community,

            We have recently sequenced a bacterial transcriptome with 316 chip from IonTorrent (1.5 million sequences). After filtering low quality data and trimming adapters we noticed that only 51.33% sequences were mapped on reference genome. Looking for the unmapped sequences we can see that most of them are chimeric transcripts, so impossible mapping for them an also causing bias on results. Also many of the unmapped are sequences lacking homology in 20% of the starting sequence.

            I would like to know your opinion about it.
            Should I have to move to 454 or Illumina? Our Sequencing Department have no idea of why we have so many chimeras.

            Thank you, Bernardo

            Comment


            • #7
              Hi Veronique,

              When I asked them for this problem to LifeTech NGS Support http://ioncommunity.lifetechnologies.com/welcome said me that fusions transcripts can be caused by a bad library preparation.

              The also said that BWA is not the recomended mapper to Ion Torrent. So I tried the recomended one: TMAP https://github.com/iontorrent/TMAP or you can find it here https://test.g2.bx.psu.edu/ and when I repeated the mapping step, I got magical result, from 30% of mapped reads to 99%. At this point I am happy to have this result but a bit worried about the possible bias caused by the artificial mapping: as I read in https://github.com/iontorrent/TMAP TMAP splits fusion reads and maps the splited reads, but only the longest part of the read, discarding the shortest, if I understood well.


              We are now sequencing more samples. If I have the same strange result I will write LifeTech NGS Support to ask if this fusion transcripts are causing bias in my results.

              Regards, Bernardo
              Last edited by bernardo_bello; 01-11-2013, 09:40 AM.

              Comment


              • #8
                Depending on your lib prep method, the % chimeric lib fragments will be very dependent on adapter:insert ratio. Because insert molecules are 5'-phosphorylated, they can be ligated to one another to form concatemers, which is what would happen in the absence of adapters. It is only the relatively high concentration (ratio) of adapters versus inserts that prevents multiple insert molecules from being ligated to one another.

                In our experience, total molar conc of adapter (A and P1) is optimal at ~10:1. Much lower than 5:1 and you start to see an increase in insert fusions (chimeras). Too high, and you lose ligation efficiency, and struggle to remove all unligated adapter in the post-ligation cleanups.

                Comment


                • #9
                  chimeric data

                  Hi Bernardo,

                  The use of Tmap is not always a solution... if you need assembly data you can't with chimeric reads... the difference of quality of reads with Hiseq data is very important.

                  I also found that 20% of reads map rRNA even with a protocol specific to mRNA (polyA detect), did you detect the same problem?

                  bye, Veronique

                  Originally posted by bernardo_bello View Post
                  Hi Veronique,

                  When I asked them for this problem to LifeTech NGS Support http://ioncommunity.lifetechnologies.com/welcome said me that fusions transcripts can be caused by a bad library preparation.

                  The also said that BWA is not the recomended mapper to Ion Torrent. So I tried the recomended one: TMAP https://github.com/iontorrent/TMAP or you can find it here https://test.g2.bx.psu.edu/ and when I repeated the mapping step, I got magical result, from 30% of mapped reads to 99%. At this point I am happy to have this result but a bit worried about the possible bias caused by the artificial mapping: as I read in https://github.com/iontorrent/TMAP TMAP splits fusion reads and maps the splited reads, but only the longest part of the read, discarding the shortest, if I understood well.


                  We are now sequencing more samples. If I have the same strange result I will write LifeTech NGS Support to ask if this fusion transcripts are causing bias in my results.

                  Regards, Bernardo

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  17 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  22 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  16 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  46 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X