Seqanswers Leaderboard Ad

**westerman** · 05-01-2013, 10:21 AM

I think that you may be mis-interpreting the statement. It says "FPKM is equivalent" not "equal". "Equivalent" in this case -- as far as I interpret it -- is in the way of thinking about the problem. In other words if you are coming from a single-end-read background and use to thinking about Reads-Per-Kilobase_exon-per-Million_mapped_reads then once you are in the paired-end world and having to deal with pairs that can map to multiple places with multiple overall lengths thus making FPKM more useful, well, all you have to do is substitute the word "FPKM" into places where you use to think "RPKM".

**metheuse** · 05-01-2013, 10:46 AM

Originally posted by westerman View Post

I think that you may be mis-interpreting the statement. It says "FPKM is equivalent" not "equal". "Equivalent" in this case -- as far as I interpret it -- is in the way of thinking about the problem. In other words if you are coming from a single-end-read background and use to thinking about Reads-Per-Kilobase_exon-per-Million_mapped_reads then once you are in the paired-end world and having to deal with pairs that can map to multiple places with multiple overall lengths thus making FPKM more useful, well, all you have to do is substitute the word "FPKM" into places where you use to think "RPKM".

Thanks for the answer.
Yes, I understand FPKM is more different from RPKM in paired-end RNA-seq, because one read does not necessarily correspond to one fragment.
But in single-end, fragment is basically extended read, so one read corresponds to one fragment absolutely. So counts of fragments would tend to be larger than counts reads, since the extended reads definitely cover larger space. Is this right?

**westerman** · 05-01-2013, 11:19 AM

Well, I wasn't going to discuss your interpretation of FPKM as applied to single-end reads ...

reads extended by the fragment length, default to 200bp as the mean for single-end reads

... since I do not think that it is correct. Cufflinks, with which I am most familiar, does not calculate FPKM in that manner -- as far as I understand the program. But, heck, probably someone somewhere uses that definition. If you could provide a reference to your interpretation of FPKM then perhaps the rest of us can provide more clarification.

**chadn737** · 05-01-2013, 11:49 AM

One potential issue is defining FPKM for discordantly mapped read paired-end reads, however in a single-end context, this would not be an issue.

The way Cufflinks calculates it, multi-mapping reads are divided up amongst all locations. As a result, one cannot directly calculate the number of reads mapping to a locus simply by multiplying by the locus length and number of reads.

**metheuse** · 05-01-2013, 11:51 AM

Originally posted by westerman View Post

Well, I wasn't going to discuss your interpretation of FPKM as applied to single-end reads ...

... since I do not think that it is correct. Cufflinks, with which I am most familiar, does not calculate FPKM in that manner -- as far as I understand the program. But, heck, probably someone somewhere uses that definition. If you could provide a reference to your interpretation of FPKM then perhaps the rest of us can provide more clarification.

Thanks.
I was asking if my understanding is correct. I got the impression that fragments are extended reads from some tutorial slides for RNA-seq, but it's not for Cufflinks. So I may mess it up.
I was trying to find the definition in the cufflinks paper, but it mainly talks about paired-end data. For single-end, honestly I don't know how it gets fragments from reads.
If you can tell me your interpretation, that's very appreciated.

**chadn737** · 05-01-2013, 11:55 AM

Originally posted by metheuse View Post

Thanks.
I was asking if my understanding is correct. I got the impression that fragments are extended reads from some tutorial slides for RNA-seq, but it's not for Cufflinks. So I may mess it up.
I was trying to find the definition in the cufflinks paper, but it mainly talks about paired-end data. For single-end, honestly I don't know how it gets fragments from reads.
If you can tell me your interpretation, that's very appreciated.

Its simply the name. RPKM was used in the original Mortazavi paper. This calculation is relatively strait-forward where as the Cufflinks method attempts to rescue multi-mapping reads by dividing it up amongst each location. I think part of the motivation then is to distinguish this calculation from the original RPKM calculation.

**metheuse** · 05-01-2013, 11:58 AM

Originally posted by chadn737 View Post

One potential issue is defining FPKM for discordantly mapped read paired-end reads, however in a single-end context, this would not be an issue.

The way Cufflinks calculates it, multi-mapping reads are divided up amongst all locations. As a result, one cannot directly calculate the number of reads mapping to a locus simply by multiplying by the locus length and number of reads.

Thanks for the explanation.
I still don't understand how cufflinks get "fragments" from "reads" exactly, in a single-end case? There are some parameters in the program to control the estimation of fragment length. I don't know how these are used to get to "fragment counts".

**metheuse** · 05-01-2013, 12:06 PM

Let me just ask one question:

What does "fragments" exactly mean, in single-end case? (parts of reads? combination of reads? Extension of reads? Extension of parts of reads? Or something else?)

**chadn737** · 05-01-2013, 12:08 PM

Originally posted by metheuse View Post

Let me just ask one question:

What does "fragments" exactly mean, in single-end case? (parts of reads? combination of reads? Extension of reads? Extension of parts of reads? Or something else?)

Its a name. I would be more concerned with understanding the calculation of it rather than getting hung up over the difference between a read and a fragment.

**SrCardgage** · 07-12-2013, 01:11 PM

ontology is important

Originally posted by chadn737 View Post

Its a name. I would be more concerned with understanding the calculation of it rather than getting hung up over the difference between a read and a fragment.

I understand your sentiment. However, I have come across the use of the work "fragment" without knowing exactly what it means. If I don't know what it means, I can't completely wrap my head around the definition that I happen to be reading at the time.

I guess what I'm really asking is: does "fragment" have no standard definition when it comes to NGS data? If so, that is very bad.

Thanks your all your help.

**chadn737** · 07-12-2013, 01:21 PM

Originally posted by SrCardgage View Post

I understand your sentiment. However, I have come across the use of the work "fragment" without knowing exactly what it means. If I don't know what it means, I can't completely wrap my head around the definition that I happen to be reading at the time.

I guess what I'm really asking is: does "fragment" have no standard definition when it comes to NGS data? If so, that is very bad.

Thanks your all your help.

It does have a definition. Are you familiar with how libraries are prepared? At some step in the actual bench work, the DNA or RNA is fragmented into smaller segments and only fragments of a certain length are then used in sequencing.

A read is the portion of the fragment that has been sequenced. Illumina sequencers can sequence from one end of the fragment, giving a single read per fragment, or from both ends in a paired-end fashion, giving two reads per fragment. When you try to assess gene expression from paired-end data, you count the two reads from either end as one count, i.e. one fragment.

In single-end data, the read is synonymous with the fragment, however, what you have to remember is that how RPKM, FPKM, etc are calculated differs even with single-end data. So that it can become a bit confusing if you don't understand how these things are being calculated.

Topics	Statistics	Last Post
Bacterial Timeline Study Suggests Oxygen Use Preceded Photosynthesis by seqadmin Started by seqadmin, Today, 12:59 PM	0 responses 6 views 0 reactions	Last Post by seqadmin Today, 12:59 PM
New Software Simplifies 3D Gene Expression Mapping by seqadmin Started by seqadmin, Yesterday, 10:17 AM	0 responses 8 views 0 reactions	Last Post by seqadmin Yesterday, 10:17 AM
AI Tool Creates High-Resolution 3D Maps of the Mouse Brain by seqadmin Started by seqadmin, 03-20-2025, 05:03 AM	0 responses 49 views 0 reactions	Last Post by seqadmin 03-20-2025, 05:03 AM
Studying Microbial Gene Transfer with RNA Barcoding by seqadmin Started by seqadmin, 03-19-2025, 07:27 AM	0 responses 60 views 0 reactions	Last Post by seqadmin 03-19-2025, 07:27 AM

Seqanswers Leaderboard Ad

FPKM=RPKM in single-read RNA-seq?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News