Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How would you calculate the coverage for RNAseq?

    I am thinking that the most important quantification parameter must be how many reads are mapped within annotated regions?

    Hence it is not mapped reads per whole genome, but rather mapped reads per annotated region, like so:
    coverage = n_mapped_reads * read_length / (n_annotated_regions * average_length_of_annotated_regions)

    This way we get average number of times each annotated nucleotide was sequenced?

    L
    Last edited by LeonDK; 06-18-2015, 01:18 AM.

  • #2
    So if a region is not annotated you are going to consider reads mapping there not important/real?

    Comment


    • #3
      Originally posted by GenoMax View Post
      So if a region is not annotated you are going to consider reads mapping there not important/real?
      Hmm... No and yes... Of course if something maps it (in theory) means that a transcript originated from there. But if I am to use the annotations for GRCh38 for analysis of DEG, than the reads mapping in non-annotated regions are not considered?

      Comment


      • #4
        If you are explicitly interested in the "annotated" regions then that is a limitation you are imposing. Reads mapping elsewhere will still be real.

        Take a look at these: http://bedtools.readthedocs.org/en/l.../coverage.html http://bedtools.readthedocs.org/en/l...genomecov.html

        Comment


        • #5
          Why do you want to calculate coverage in the first place? If you're using it for gene-expression profiling, RNA-seq is a read-counting application, not a base-counting application, so for most purposes the read count is a more appropriate measure of sequencing depth.

          Comment


          • #6
            Originally posted by jwfoley View Post
            Why do you want to calculate coverage in the first place? If you're using it for gene-expression profiling, RNA-seq is a read-counting application, not a base-counting application, so for most purposes the read count is a more appropriate measure of sequencing depth.
            Good point!

            So in your opinion, it is sufficient to report the number of mapped reads? E.g. "we included samples with at least 10 mio. mapped reads" or something similar?

            Comment


            • #7
              Originally posted by LeonDK View Post
              Good point!

              So in your opinion, it is sufficient to report the number of mapped reads? E.g. "we included samples with at least 10 mio. mapped reads" or something similar?
              Yes, that is the single number that wraps it up. For reference, ENCODE's old standard for gene-expression profiling was 30 million paired-end reads (for human samples), but that's not the greatest recommendation anyway.

              Comment


              • #8
                Originally posted by LeonDK View Post
                So in your opinion, it is sufficient to report the number of mapped reads? E.g. "we included samples with at least 10 mio. mapped reads" or something similar?
                What kind of "report" are we referring to here? How do you decide on a number for cut-off? Is the distribution of mapped reads assured to be equally representative for all samples, if you are only going to report a total number for mapped reads? And so on ...
                Last edited by GenoMax; 06-18-2015, 09:15 AM.

                Comment


                • #9
                  Ok, so of course I know very well, that there is no magic number. In fact pretty much every cut-off we use is arbitrary, especially the infamous 0.05 - Anyhoo...

                  For publication-purposes it is necessary to account for how 'bad' samples were excluded.

                  Hart et al. (10.1089/cmb.2012.0283) states that: "In other words, a sequencing depth of 10 million reads will ensure that approximately 90% of all genes will be covered by at least 10 reads."

                  That's why a suggested 10 mio. as a 'nice' arbitrary number

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  30 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  32 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  28 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  52 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X