Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Should one combine normalization methods in RNA-seq?

    Does anyone know if RPKM normalization and quantile normalization of RNA-seq read counts can or should be combined?

    For normalization of paired-end reads in RNA-seq, is it accurate to normalize the raw read counts using the quantile method, then following that calculate the RPKM value for each transcript using the quantile normalized values?

    It seems that RPKM normalization using the total millions of mapped reads is somewhat redundant with an initial quantile normalization of read counts. Are these two methods mutually exclusive or should they be combined? Any recommendations?

    thanks!

  • #2
    It's a bit late for this answer but maybe future users will benefit from that:
    When you use FPKM / RPKM as a measure unit you already have a normalized unit, which accounts for the gene length. However, when you have many samples sometimes you have different sequencing depths and thus different expression values that are the same thing but at two different depths. To account for that, using a global scaling or a full quantile normalization will help. In fact, when doing it you will end up with all the different "lanes" having the same statistical properties and thus remove the problem of the sequencing depth.

    So, in conclusion, use RPKM/FPKM and then do a full quantile on all the samples, it works and is a good practice.

    Comment


    • #3
      A better solution would be to never use RPKM/FPKM, since it tends to kill your statistical power, which is related to the raw counts and gets normalized away by the RPKM transformation.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      29 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      32 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      52 views
      0 likes
      Last Post seqadmin  
      Working...
      X