Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • metheuse
    Member
    • Jan 2013
    • 84

    FPKM=RPKM in single-read RNA-seq?

    Sorry this may be a silly old question.
    I saw some places say "FPKM is equivalent to RPKM in single-end RNA-seq". But I thought FPKM is based on the counts of fragments (reads extended by the fragment length, default to 200bp as the mean for single-end reads). So I don't understand how FPKM could be equal to RPKM for single-end reads.
  • westerman
    Rick Westerman
    • Jun 2008
    • 1104

    #2
    I think that you may be mis-interpreting the statement. It says "FPKM is equivalent" not "equal". "Equivalent" in this case -- as far as I interpret it -- is in the way of thinking about the problem. In other words if you are coming from a single-end-read background and use to thinking about Reads-Per-Kilobase_exon-per-Million_mapped_reads then once you are in the paired-end world and having to deal with pairs that can map to multiple places with multiple overall lengths thus making FPKM more useful, well, all you have to do is substitute the word "FPKM" into places where you use to think "RPKM".

    Comment

    • metheuse
      Member
      • Jan 2013
      • 84

      #3
      Originally posted by westerman View Post
      I think that you may be mis-interpreting the statement. It says "FPKM is equivalent" not "equal". "Equivalent" in this case -- as far as I interpret it -- is in the way of thinking about the problem. In other words if you are coming from a single-end-read background and use to thinking about Reads-Per-Kilobase_exon-per-Million_mapped_reads then once you are in the paired-end world and having to deal with pairs that can map to multiple places with multiple overall lengths thus making FPKM more useful, well, all you have to do is substitute the word "FPKM" into places where you use to think "RPKM".
      Thanks for the answer.
      Yes, I understand FPKM is more different from RPKM in paired-end RNA-seq, because one read does not necessarily correspond to one fragment.
      But in single-end, fragment is basically extended read, so one read corresponds to one fragment absolutely. So counts of fragments would tend to be larger than counts reads, since the extended reads definitely cover larger space. Is this right?

      Comment

      • westerman
        Rick Westerman
        • Jun 2008
        • 1104

        #4
        Well, I wasn't going to discuss your interpretation of FPKM as applied to single-end reads ...

        reads extended by the fragment length, default to 200bp as the mean for single-end reads
        ... since I do not think that it is correct. Cufflinks, with which I am most familiar, does not calculate FPKM in that manner -- as far as I understand the program. But, heck, probably someone somewhere uses that definition. If you could provide a reference to your interpretation of FPKM then perhaps the rest of us can provide more clarification.

        Comment

        • chadn737
          Senior Member
          • Jan 2009
          • 392

          #5
          One potential issue is defining FPKM for discordantly mapped read paired-end reads, however in a single-end context, this would not be an issue.

          The way Cufflinks calculates it, multi-mapping reads are divided up amongst all locations. As a result, one cannot directly calculate the number of reads mapping to a locus simply by multiplying by the locus length and number of reads.

          Comment

          • metheuse
            Member
            • Jan 2013
            • 84

            #6
            Originally posted by westerman View Post
            Well, I wasn't going to discuss your interpretation of FPKM as applied to single-end reads ...



            ... since I do not think that it is correct. Cufflinks, with which I am most familiar, does not calculate FPKM in that manner -- as far as I understand the program. But, heck, probably someone somewhere uses that definition. If you could provide a reference to your interpretation of FPKM then perhaps the rest of us can provide more clarification.
            Thanks.
            I was asking if my understanding is correct. I got the impression that fragments are extended reads from some tutorial slides for RNA-seq, but it's not for Cufflinks. So I may mess it up.
            I was trying to find the definition in the cufflinks paper, but it mainly talks about paired-end data. For single-end, honestly I don't know how it gets fragments from reads.
            If you can tell me your interpretation, that's very appreciated.

            Comment

            • chadn737
              Senior Member
              • Jan 2009
              • 392

              #7
              Originally posted by metheuse View Post
              Thanks.
              I was asking if my understanding is correct. I got the impression that fragments are extended reads from some tutorial slides for RNA-seq, but it's not for Cufflinks. So I may mess it up.
              I was trying to find the definition in the cufflinks paper, but it mainly talks about paired-end data. For single-end, honestly I don't know how it gets fragments from reads.
              If you can tell me your interpretation, that's very appreciated.
              Its simply the name. RPKM was used in the original Mortazavi paper. This calculation is relatively strait-forward where as the Cufflinks method attempts to rescue multi-mapping reads by dividing it up amongst each location. I think part of the motivation then is to distinguish this calculation from the original RPKM calculation.

              Comment

              • metheuse
                Member
                • Jan 2013
                • 84

                #8
                Originally posted by chadn737 View Post
                One potential issue is defining FPKM for discordantly mapped read paired-end reads, however in a single-end context, this would not be an issue.

                The way Cufflinks calculates it, multi-mapping reads are divided up amongst all locations. As a result, one cannot directly calculate the number of reads mapping to a locus simply by multiplying by the locus length and number of reads.
                Thanks for the explanation.
                I still don't understand how cufflinks get "fragments" from "reads" exactly, in a single-end case? There are some parameters in the program to control the estimation of fragment length. I don't know how these are used to get to "fragment counts".

                Comment

                • metheuse
                  Member
                  • Jan 2013
                  • 84

                  #9
                  Let me just ask one question:

                  What does "fragments" exactly mean, in single-end case? (parts of reads? combination of reads? Extension of reads? Extension of parts of reads? Or something else?)

                  Comment

                  • chadn737
                    Senior Member
                    • Jan 2009
                    • 392

                    #10
                    Originally posted by metheuse View Post
                    Let me just ask one question:

                    What does "fragments" exactly mean, in single-end case? (parts of reads? combination of reads? Extension of reads? Extension of parts of reads? Or something else?)
                    Its a name. I would be more concerned with understanding the calculation of it rather than getting hung up over the difference between a read and a fragment.

                    Comment

                    • SrCardgage
                      my other car is a limozeen
                      • Feb 2012
                      • 23

                      #11
                      ontology is important

                      Originally posted by chadn737 View Post
                      Its a name. I would be more concerned with understanding the calculation of it rather than getting hung up over the difference between a read and a fragment.
                      I understand your sentiment. However, I have come across the use of the work "fragment" without knowing exactly what it means. If I don't know what it means, I can't completely wrap my head around the definition that I happen to be reading at the time.

                      I guess what I'm really asking is: does "fragment" have no standard definition when it comes to NGS data? If so, that is very bad.

                      Thanks your all your help.

                      Comment

                      • chadn737
                        Senior Member
                        • Jan 2009
                        • 392

                        #12
                        Originally posted by SrCardgage View Post
                        I understand your sentiment. However, I have come across the use of the work "fragment" without knowing exactly what it means. If I don't know what it means, I can't completely wrap my head around the definition that I happen to be reading at the time.

                        I guess what I'm really asking is: does "fragment" have no standard definition when it comes to NGS data? If so, that is very bad.

                        Thanks your all your help.
                        It does have a definition. Are you familiar with how libraries are prepared? At some step in the actual bench work, the DNA or RNA is fragmented into smaller segments and only fragments of a certain length are then used in sequencing.

                        A read is the portion of the fragment that has been sequenced. Illumina sequencers can sequence from one end of the fragment, giving a single read per fragment, or from both ends in a paired-end fashion, giving two reads per fragment. When you try to assess gene expression from paired-end data, you count the two reads from either end as one count, i.e. one fragment.

                        In single-end data, the read is synonymous with the fragment, however, what you have to remember is that how RPKM, FPKM, etc are calculated differs even with single-end data. So that it can become a bit confusing if you don't understand how these things are being calculated.

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Pathogen Surveillance with Advanced Genomic Tools
                          by seqadmin




                          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                          03-24-2025, 11:48 AM
                        • seqadmin
                          New Genomics Tools and Methods Shared at AGBT 2025
                          by seqadmin


                          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                          The Headliner
                          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                          03-03-2025, 01:39 PM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 03-20-2025, 05:03 AM
                        0 responses
                        49 views
                        0 reactions
                        Last Post seqadmin  
                        Started by seqadmin, 03-19-2025, 07:27 AM
                        0 responses
                        57 views
                        0 reactions
                        Last Post seqadmin  
                        Started by seqadmin, 03-18-2025, 12:50 PM
                        0 responses
                        50 views
                        0 reactions
                        Last Post seqadmin  
                        Started by seqadmin, 03-03-2025, 01:15 PM
                        0 responses
                        200 views
                        0 reactions
                        Last Post seqadmin  
                        Working...