SEQanswers

Go Back   SEQanswers > Applications Forums > RNA Sequencing



Similar Threads
Thread Thread Starter Forum Replies Last Post
Pca tpm fpkm RikyJKD Bioinformatics 0 09-05-2016 04:51 AM
Zero fpkm estimation using cuffquant/cuffnorm (Cufflinks 2.2.1) Rainbird Bioinformatics 2 02-17-2015 04:46 PM
Cuffnorm no cds or tss jake13 Bioinformatics 0 01-19-2015 03:35 PM
TPM vs RPKM Benh Bioinformatics 3 09-03-2014 09:42 AM
RNASeq TPM vs. FPKM - What's the point if differential expression tools use counts? ndovu9 Bioinformatics 1 11-25-2013 10:37 AM

Reply
 
Thread Tools
Old 04-16-2017, 09:34 AM   #1
farri
Junior Member
 
Location: Bielefeld

Join Date: Nov 2015
Posts: 4
Default Cuffnorm's FPKM - advisable to perform TPM conversion?

Hi,

I performed a detailed time-course mRNA-seq experiment without any sequencing replicates (the input RNA, however, was a pool of equal amounts of total RNA from biological replicates). As I don't have any replicate information for my sequencing data, I refrained from doing any statistical testing.

I used the Tuxedo protocol for data analysis (Trapnel et al, 2012), i.e. read mapping with TopHat2, transcript assembly by Cufflinks for each individual time point, executed Cuffmerge afterwards, and - instead of Cuffdiff, due to the absence of replicates - I ran Cuffquant and Cuffnorm, the later with default settings.

As a result, Cuffnorm outputs sample-normalized FPKM values. It was brought to my attention by Wagner et al (2012), that TPM values are preferable to FPKM values. Converting my FPKM data to TPM values according to https://haroldpimentel.wordpress.com...ression-units/ changes data analysis partly significantly.

I wonder if it is "allowed" to convert cuffnorm-normalized FPKM values to TPM?
And does it make any sense to convert cuffnorm-FPKM values to TPM?
Or is the cuffnorm-normalization already sufficient to enable robust sample-wise comparison of changes in transcript abundances, hence FPKM to TPM conversion would not be necessary?
farri is offline   Reply With Quote
Old 04-24-2017, 10:00 PM   #2
Dario1984
Senior Member
 
Location: Sydney, Australia

Join Date: Jun 2011
Posts: 155
Default

Of course it makes a substantial difference. TPM doesn't normalise by the length of the gene, FPKM does. Are you comparing between genes, such as by plotting the gene expression values in a heatmap? Then, you need to use FPKM. Otherwise, the heatmap may be misleading. Or, are you doing a differential expression analysis, such as calculating the fold changes of each gene between two timepoints? Then, TPM is fine, since you are comparing the same gene which has the same length between two timepoints (assuming that no alternative splicing is occurring).
Dario1984 is offline   Reply With Quote
Old Today, 04:44 AM   #3
farri
Junior Member
 
Location: Bielefeld

Join Date: Nov 2015
Posts: 4
Default

The focus of my study is on fold change(FC)-based evaluation of transcriptional changes, thus intra-gene comparison. I am interested in genes which are strongly differentially regulated between two conditions. However, I also do some inter-gene comparison, such as comparing the transcript abundance of transcript A with transcript B, in order to determine which transcript is more abundant. For instance, if transcript A is up-regulated 10-fold, but from FPKM 1 to FPKM 10, yet transcript B is up-regulated 3-fold from FPKM 1000 to FPKM 3000, I would conclude that the up-regulation of transcript B might be metabolically more relevant. However, for most cases, I stick only to FC-values.

My key concern is that the use of TPM is advocated over FPKM. FPKM can be easily converted to TPM by dividing the each FPKM value by the sum of all FPKM values of the respective sample, and multiplying this by 1e6. This yields TPM.

However, Cufflinks internally has complicated statistics to compute FPKM values. Therefore I wonder if I violate any rules by converting FPKM to TPM? Does this particularily apply as I ran Cuffnorm, which already performs a normalization?

What do you recommend? Stick to Cuffnorm's FPKM values? Or perform the simple FPKM-to-TPM conversion from above and use TPM values instead?
farri is offline   Reply With Quote
Old Today, 05:00 PM   #4
Dario1984
Senior Member
 
Location: Sydney, Australia

Join Date: Jun 2011
Posts: 155
Default

I would use the FPKM you obtained from Cufflinks. Before you do, check the range of FPKM values calculated by Cufflinks. We find some are theoretically impossible values by considering how many reads the sample had and assuming that they all mapped to only the gene with the highest FPKM value determined by Cufflinks, then ask "what would the theoretical maximum FPKM be"? Cufflinks sometimes calculates FPKMs that are unrealistically large. If, however, for your dataset, Cufflinks calculates FPKMs that seem reasonable, then just use those. Although TPM is more popular these days, there's no reason why it's more valid than FPKM, if you are confident that the FPKM values are being calculated correctly by the software you're using.
Dario1984 is offline   Reply With Quote
Reply

Tags
cuffnorm, fpkm, tpm

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 09:17 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO