Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • immatos
    Junior Member
    • Jul 2012
    • 1

    Log ratios and/or ratios of the log's

    Hello all.
    I have a very basic question about the analisys of my RNA-seq data...can some one help me out?

    Usually we work with the log ratio of RPKM’s, log2(X1 RPKM/ X2 RPKM).
    I would like to know if instead of using log ratios I can aply log to all my RPKM data sets and from then use this log2(rpkm) values for the rest of the analysis. So, basically istead of doing log ratios, do the ratio between log2(RPKM). Example log2(X1 RPKM)= 5 and log2(X2 RPKM) = 5 so ratio=1 meaning equal expression. This is completely wrong and I have to use the log ratios or can I do this way also?


    Thanks
  • ffinkernagel
    Senior Member
    • Oct 2009
    • 110

    #2
    log(a / b) = log(a) - log(b),

    so you can do what you're proposing, but you have to substract the two logs, not divide them.

    Comment

    • aprice67
      Member
      • Nov 2012
      • 49

      #3
      Originally posted by ffinkernagel View Post
      log(a / b) = log(a) - log(b),

      so you can do what you're proposing, but you have to substract the two logs, not divide them.

      I don't understand how that changes the problem. Log(0) is undefined. In that case should I just use 0? If so how will the log(25) - 0 results be different from a result like log(25) - log(x!=0).

      Comment

      • Richard Finney
        Senior Member
        • Feb 2009
        • 701

        #4
        This mabye a dumb question and I needs me some splainin', but ...

        I know folks always did log transforms on old school array data (because it better reflected the known inputs in spike in data).

        But ... why do people do log transforms on RPKMs from RNA seq data ? Isn't 2X reads on a gene really mean 2X expression?

        Comment

        • aprice67
          Member
          • Nov 2012
          • 49

          #5
          Originally posted by Richard Finney View Post
          This mabye a dumb question and I needs me some splainin', but ...

          I know folks always did log transforms on old school array data (because it better reflected the known inputs in spike in data).

          But ... why do people do log transforms on RPKMs from RNA seq data ? Isn't 2X reads on a gene really mean 2X expression?

          I'm not doing a typical experiment. I'm using RNA-Seq to predict transcriptome secondary structure in bacteria by the PARS method outlined in the 2010 nature paper that can be found here: http://genie.weizmann.ac.il/pubs/PARS10/index.html

          I run two protocols, one with a digestion that cuts at single stranded positions and one with a digestion that cuts at double stranded positions.

          To say if any position is in a secondary structure requires that I have counts of the how many reads start at each position, so it isn't measured in RPKM or any other measure usually used for differential expression. Now I have cleaned, aligned, and produced files containing counts for number of reads starting at each position.

          What I measure is the log(protocol1/protocol2) counts at each position to determine if there is secondary structure at each point. However when there is a case that is log(25 single strand/0 double strand) i cant accurately compute a score for that position.

          I still don't have a workable solution to this. I've been in the literature and found that there are some statistical methods that can apply to DE on a per gene basis that I may be able to apply to a per position basis if i tweak the statistics a bit, but I'd rather go with something that has been peer reviewed and tested if ya know what I mean.

          Comment

          • aprice67
            Member
            • Nov 2012
            • 49

            #6
            What I plan to try, is to get distributions of read counts for both protocols and use that to compute if a 0 is significant or not.

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM
            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Yesterday, 05:37 AM
            0 responses
            6 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            16 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            51 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            110 views
            0 reactions
            Last Post SEQadmin2  
            Working...