Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • question about RNA-seq

    Hi everyone,

    I have a question about RNA-seq.

    "Some manipulations during library construction also complicate the analysis
    of RNA-Seq results. For example, many shorts reads that are identical to each
    other can be obtained from cDNA libraries that have been amplified. These could be a genuine reflection of abundant RNA species, or they could be PCR artefacts. One way to discriminate between these possibilities is to determine whether the same sequences are observed in different biological replicates.(Nat Rev Genet. 2009 Jan;10(1):57-63)"

    Is there any other way in the above situation to discriminate between a genuine reflection of abundant RNA or PCR artefacts?

    Thanks,

    Hai-Ri Li

  • #2
    If your library has been randomly fragmented then it should be possible to look for fragments which appear way more frequently than would be expected by chance. In even a short mRNA you should have a range of different possible fragments and if this is an abundant transcript most or all of these should appear more frequently. PCR artefacts usually affect only a small subset of all possible fragments and would produce a very uneven distribution of fragments over the transcript.

    Comment


    • #3
      Thank you for interpretation. You mean we can ignore PCR artefacts because they affect only a small subsets of fragments and it will not affect final analysis results. However, sometimes we found so many tags hit the same positions so that we cannot ingore them.

      Again, is there any other way in the above situation to discriminate between a genuine reflection of abundant RNA or PCR artefacts?

      Thanks,

      Hai-Ri Li

      Comment


      • #4
        Originally posted by lihairi View Post
        You mean we can ignore PCR artefacts because they affect only a small subsets of fragments and it will not affect final analysis results.
        No, that's not what I'm saying. What I was trying to say was that you can normally distinguish PCR artefacts from expression changes because expression changes normally involved the even enrichment of a large number of different fragments over the expressed region, whereas PCR artefacts usually take only a small number of fragments and amplify them to an unnatural degree when viewed in the context of the surrounding fragments.

        We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps. If this value is above 5-10% then we reject it as a likely PCR artefact. I'm intending to move this to an observed/expected calculation though as this is less prone to errors in very short regions with high coverage.

        Comment


        • #5
          Last time you mentioned "We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps". Here region must mean a window, 100 base? 200 base?

          How to do observed/expected calculation?

          Thanks.

          Comment


          • #6
            Originally posted by lihairi View Post
            Last time you mentioned "We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps". Here region must mean a window, 100 base? 200 base?
            In our case region is pretty generic - sometimes we use fixed size windows (with a size which depends normally on our data density), in other cases we construct contigs from sets of overlapping reads, or we might design probes over particular classes of annotation feature (genes, exons, microRNAs, whatever). These things change depending on what kind of experiment you're running.


            Originally posted by lihairi View Post
            How to do observed/expected calculation?
            We're actually not using a proper O/E calculation at the moment (though it would be nice to move to that). Our filter calculates what percentage of reads which overlap a particular region come from exact overlaps, with the same start and end position. For randomly placed reads this value is usually very low (below 5%), but in some cases you will see towers of exactly duplicated reads which usually indicate a mapping or PCR problem, and we filter these out. You can also get high values from low absolute numbers of reads, so you either need to account for this, or ignore it if you're going to filter those regions anyway.

            I hope that makes things a bit clearer.

            Comment


            • #7
              Recently I downloaded a lot of RNA-seq data from NCBI and mapped to Reference RNA using eland_25. I found around 30-40% of tags were mapped to the exactly same postions as others (even though removing the effect of RNA isorforms), much higher than 5%. I do not know how to interpretate this results.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:37 PM
              0 responses
              8 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 06:07 PM
              0 responses
              8 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              49 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              67 views
              0 likes
              Last Post seqadmin  
              Working...
              X