Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • 3' Bias in RNA-Seq

    Would anyone happen to know what the specific factor in cDNA fragmentation is that causes the 3' bias? It seems I've read a few papers that mention the bias, but don't go into explaining why? Thank you!

    J

  • #2
    Moving to RNA-seq.

    Comment


    • #3
      If you are doing PolyA enrichment that would be the source of 3' bias. I don't know if there should be bias if not doing this (say, ribodepletion or dsn from total RNA).

      Comment


      • #4
        Originally posted by irit View Post
        If you are doing PolyA enrichment that would be the source of 3' bias. I don't know if there should be bias if not doing this (say, ribodepletion or dsn from total RNA).
        Hi, irit. No doubt that polyA enrichment sounds like a perfect culprit. What are your thoughts on cDNA fragmentation though? From "RNA-Seq: a revolutionary tool for transcriptomics":

        "Conversely, cDNA fragmentation is usually strongly biased towards the identification of sequences from the 3' ends of transcripts, and thereby provides valuable information about the precise identity of these ends."

        Comment


        • #5
          In our protocol, the cDNA isn't fragmented, the RNA is.

          Comment


          • #6
            I'd have thought polyA selection would have been the main source of 3' bias. Maybe also fragmentation of cDNA. On 454 at least, cDNAs of a certain size range don't nebulize well. This would bias reads of those fragments towards the transcript ends. If I remember correctly, this is no longer a problem with newer more rapid methods for non-normalized cDNA sequencing. For full-length, normalized cDNA sequencing (on 454), coligation (before nebulization) probably helps reduce the bias. Not sure if fragmentation produces bias on illumina as I'm not familiar with the library prep. I do know that we also have our cDNA coligated before nebulization for subsequent RNAseq on HiSeq2000.

            someone correct me if I'm wrong.

            Comment


            • #7
              Originally posted by VanessaS View Post
              In our protocol, the cDNA isn't fragmented, the RNA is.
              Hi, Vanessa. So, would you say it's definitely a result of the poly(A) selection?? Thanks!

              Comment


              • #8
                Originally posted by JohnK View Post
                Hi, Vanessa. So, would you say it's definitely a result of the poly(A) selection?? Thanks!
                I'm not qualified to say anything is definite, other than its not from shearing the cDNA. Maybe someone can comment on the kinds of biases, if any, introduced from enzymatic shearing of the RNA? We use RNAse III. I thought it was non-specific so no bias?

                Comment


                • #9
                  Originally posted by VanessaS View Post
                  I'm not qualified to say anything is definite, other than its not from shearing the cDNA. Maybe someone can comment on the kinds of biases, if any, introduced from enzymatic shearing of the RNA? We use RNAse III. I thought it was non-specific so no bias?
                  Hey, Vanessa. Not so much the fragmentation of the RNA, but the poly(A) purification step, which you might expect to definitely generate a 3' bias as someone stated above... What do you think of that?

                  Comment


                  • #10
                    Consider RNA stability issues too. Combined with poly(A) selection, this can result in a dramatic enrichment of terminal fragments.

                    Comment


                    • #11
                      Another contributor (depending on how you isolate and prepare your RNA) would be priming with oligo dT for the reverse transcription.

                      Comment


                      • #12
                        Also, the 5' cap on mRNA stabilizes the 5' end of RNA over the 3' end.

                        Comment


                        • #13
                          I was wondering if other people were also noticing a 3' bias in their Illumina prepped samples for RNA-Seq using the 8-sample bead-based poly-A selection kit. For shorter transcripts (< 6 kb or so) I do not notice any 3' bias but for longer transcripts there is definitely a fairly severe bias towards the 3' end of the transcript. I also see a peak at the 5' end of the longer transcripts as well which makes me think it is not due to degradation-- is that a reasonable thought? My total RNA looked great as assayed on a Bioanalyzer, but I'm not sure if that is true of the mRNA step, I wasn't sure how to check that. I visualized the size of the fragmented RNA by converting to cDNA and running on a gel and saw a fairly broad smear, so at least I know the entire mRNA library was not degraded. I've done hundreds of total RNA preps without RNAse contamination, it's hard for me to imagine that somehow I am introducing RNAse during the poly-A selection but this 3' bias sure does look like that is exactly what has happened.

                          Could the 3' bias in longer transcripts, as people have suggested above in the thread, simply be a byproduct of the poly-A selection? There are a couple of questionable vortexing steps in the Illumina protocol; I'm not worried that vortexing alone would shear the RNA since I do it as a standard part of my total RNA prep, but they do have you vortex the RNA-bound beads briefly. Could that be shearing the RNA? The flopping ends of the RNA banging into those beads, would that shear the longer transcripts? If so, why do I see a peak at the 5' end too? The 5' peak is about half the size of the 3' peak.

                          I visualized the 3'-5' bias using Simon Andrews' excellent SeqMonk visualizer, using view->probe trend plot on a probe list of mRNA. The 3' bias is there if I look at a probe list of the CDS as well. It is not there if I cut the annotations up into single exons and run. It is also much, much less pronounced (think about 10% difference) if I look at single exons which are > 5kb and there is no spike at both ends of the probe list. Am I maybe not understanding what the probe-trend plot is showing me?

                          Comment


                          • #14
                            Originally posted by roryk View Post
                            I visualized the 3'-5' bias using Simon Andrews' excellent SeqMonk visualizer, using view->probe trend plot on a probe list of mRNA. The 3' bias is there if I look at a probe list of the CDS as well. It is not there if I cut the annotations up into single exons and run. It is also much, much less pronounced (think about 10% difference) if I look at single exons which are > 5kb and there is no spike at both ends of the probe list. Am I maybe not understanding what the probe-trend plot is showing me?
                            Rory - glad to hear you're liking SeqMonk!

                            If you've put probes over mRNA features and then done a trend plot then the peak you see at the end might not be due to true 3' bias.

                            Different transcripts will have exons at different places along their length. Therefore the trend plot for any individual transcript will go up and down as you pass in and out of an exon. If you average over all transcripts then you'll see the combined signals from all of the transcripts doing this which will even itself out for the most part - however the only places you're guaranteed to be in an exon are at the beginning and end of each transcript, so a trend plot over all transcripts will probably show a peak at each end because of the higher probability of being in an exon. Since 3' exons are generally larger than 5' exons you'll probably also see a bigger peak at the 3' end.

                            What you'd need for a true view of the trend over a spliced transcript would be to concatenate the exons for each transcript together and do a trend plot over those - missing out the introns. This could actually be a good addition to the program so I'll look at adding that in the a future release.

                            This same problem wouldn't apply to trend plots over exons where you would expect the signal from the reads to be continuous.

                            Comment


                            • #15
                              Originally posted by simonandrews View Post
                              Rory - glad to hear you're liking SeqMonk!

                              If you've put probes over mRNA features and then done a trend plot then the peak you see at the end might not be due to true 3' bias.

                              Different transcripts will have exons at different places along their length. Therefore the trend plot for any individual transcript will go up and down as you pass in and out of an exon. If you average over all transcripts then you'll see the combined signals from all of the transcripts doing this which will even itself out for the most part - however the only places you're guaranteed to be in an exon are at the beginning and end of each transcript, so a trend plot over all transcripts will probably show a peak at each end because of the higher probability of being in an exon. Since 3' exons are generally larger than 5' exons you'll probably also see a bigger peak at the 3' end.

                              What you'd need for a true view of the trend over a spliced transcript would be to concatenate the exons for each transcript together and do a trend plot over those - missing out the introns. This could actually be a good addition to the program so I'll look at adding that in the a future release.

                              This same problem wouldn't apply to trend plots over exons where you would expect the signal from the reads to be continuous.
                              Ahh-- that makes complete sense. I had it in my head that when I was putting probes over the mRNA features I was putting them on an introns spliced out stitched-together version of the mRNA, thank you so much for the clarification.

                              Thanks again, Simon. I say again because you have answered about 15 other questions I have had while looking at my data when you were answering forum posts of other people!

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM
                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:37 PM
                              0 responses
                              8 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 06:07 PM
                              0 responses
                              8 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-22-2024, 10:03 AM
                              0 responses
                              49 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-21-2024, 07:32 AM
                              0 responses
                              67 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X