Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Chimeric reads

    Hello, my previous post seems very quiet. Here I raise this question in a different way.

    I use Bowtie 2 to find many 454 reads are mapped >1 times. I believe these are chimeric reads due to the use of multiple stand displacement for initial amplification of minute amount DNA template. Is anyone aware of some tools that can split chimeric reads into single non-chimeric reads? Thanks a lot.

  • #2
    Doesn't it just mean that the read comes from a region of the genome that is repeated? So it aligns to all copies of the repeat?

    Comment


    • #3
      Thanks. In fact, I know the problem is reads rather than the reference genome. I believe most reads are chimeric.

      Comment


      • #4
        UCHIME claims to be able to do some de novo chimera detection, or use bits of your known reference:

        Comment


        • #5
          Trost, Thanks your point. I made a search prior to the post, which found UCHIME. I didn't investigate it in detail because I guess UCHIME may need a high coverage, such as PCR amplicons. However, my RNA-Seq data is from human blood that is complicated with RNA from human genes, virus and fungi etc at low coverage.

          One way I thought about is to split reads into 2 parts and then see if they turn to be single mapped reads. This spit can be done easily with current scripts but I worry about potential lose of information. To my knowledge, there is really no way to handle this issue.

          I am using 454 and I think I should switch to Illumina. Short reads might be helpful.

          Comment


          • #6
            We have recently sequenced a bacterial transcriptome with 316 chip from IonTorrent (1.5 million sequences). After filtering low quality data and trimming adapters we noticed that only 51.33% sequences were mapped on reference genome. Looking for the unmapped sequences we can see that most of them are chimeric transcripts, so impossible mapping for them an also causing bias on results. Also many of the unmapped are sequences lacking homology in 20% of the starting sequence.

            I would like to know your opinion about it.
            Should I have to move to 454 or Illumina? Our Sequencing Department have no idea of why we have so many chimeras.

            Thank you, Bernardo

            Comment


            • #7
              To answer your question, I need to know 1) is there any amplification step prior to the sequencing? 2) what's the aligner you used for mapping.

              Comment


              • #8
                Hello fanx,

                Thank you for your response.

                1) Yes, there is a PCR step. We have used the hole transcriptome procedure described here 'Ion Total RNA-Seq Kit v2'

                2) I used BWA for Illumina in Galaxy with default settings.

                Comment


                • #9
                  1, If there is a PCR step, chimeric reads are not unexpected. Many polymerases, especially those assuming high fidelity, have strand displacement activity. The occurrance of chimeric reads depends on both polymerases and protocols.

                  2, Some chimeric reads may not be authenic ones. In this situation, I usually increase mapping stringency and found many of "chimeric" reads became single-hit ones.

                  3, For remaining "true" chimeric reads, there are 2 ways to go. One is just to discard them (as shown in many previous publications where this issue is largely ignored). If your data has a profound depth, I don't think this will affect/bias your final result. The other way is to extract these chimeric reads only and do some trimming, re-aligned to see whether if they become single-hit reads, and finally combine all single hit reads for downstream analysis.

                  4, Finally, I assume you already done quality control prior to the align.

                  Comment


                  • #10
                    >1, If there is a PCR step, chimeric reads are not unexpected. Many polymerases, especially those assuming high fidelity, have strand displacement activity. The occurrence of chimeric reads depends on both polymerases and protocols.

                    Ok, I think I'm wasting money.

                    >2, Some chimeric reads may not be authentic ones. In this situation, I usually increase mapping stringency and found many of "chimeric" reads became single-hit ones.

                    I'm mapping only >Q20 reads. I've seen were and how they are mapping and they have 100% similarity in both hits. Sometimes there are three hits for one read.

                    >If your data has a profound depth, I don't think this will affect/bias your final result.

                    I have 800.000 sequences mapped to a 2 Mb prokaryotic genome. My mean read length is about 150 bp.

                    >The other way is to extract these chimeric reads only and do some trimming

                    I would like to finish my PhD, trimming is not feasible! So many chimeras.

                    >4, Finally, I assume you already done quality control prior to the align.

                    Of course, only >Q20 and trimming low quality 3' region.

                    Thank you, Bernardo

                    Comment


                    • #11
                      Sorry, I forgot to say that I have finally 25% mapping sequences (of 3 million). My reference sequence is a draft genome in 47 segments.

                      Comment


                      • #12
                        FASTQC of unmapped reads

                        If you give me your email I can send you FASTQC output of unmapped reads to know your opinion.


                        Bernardo

                        Comment


                        • #13
                          aligner from sanger

                          This aligner seem to address the issue of chimeric reads: http://www.sanger.ac.uk/resources/software/smalt/
                          I personally didnt try this yet but will try it next time I need to align something.

                          Comment


                          • #14
                            Originally posted by jshaik View Post
                            This aligner seem to address the issue of chimeric reads: http://www.sanger.ac.uk/resources/software/smalt/
                            I personally didnt try this yet but will try it next time I need to align something.
                            Thanks, seems there is not an associated publication for SMALT. There is?

                            Comment


                            • #15
                              No there is no publication associated with it. But people have compared it with other aligners in their works.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM
                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              30 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              32 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              28 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-04-2024, 09:00 AM
                              0 responses
                              52 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X