Seqanswers Leaderboard Ad

**dpryan** · 04-24-2015, 04:11 AM

tldr: No one should ever do what they did.

Longer reply:
What they did makes no sense. It's a sad critique of peer review that this even got accepted, since likely no one that knew anything about data analysis actually reviewed the paper...only pure wet-lab people.

So people often have cases where they need to use some sort of expected counts rather than pure integer values, often due to only having assembled transcriptomes or needing to do transcript-level analyses. The better method to deal with this is to get expected counts (e.g., with eXpress, or rsem, or ...) and then use things like limma/voom or even edgeR with those (you could use DESeq2 in theory, but it'll throw an error).

Edit: Heck, you're even better of with rounded expected counts than rounded 10xFPKMs. The former has less precision loss.

Edit2: Is it sad that I quickly checked to ensure that I don't work directly with any of the authors before I posting?

**SylvainL** · 04-24-2015, 05:40 AM

Originally posted by dpryan View Post

Edit2: Is it sad that I quickly checked to ensure that I don't work directly with any of the authors before I posting?

No, I had the same first reflex... I think this kind of paper will not be accepted in a short term future. Personally, I was already asked twice in a month to specifically review the Data analysis part, at the second stage of revision... Hope it will be soon automatic!!

**frymor** · 04-27-2015, 03:38 AM

Originally posted by dpryan View Post

tldr: No one should ever do what they did.

yes, this is exactly what I thought.
The paper is "relatively" old and I don't think something like that will be accepted nowadays (I hope so).

Originally posted by dpryan View Post

So people often have cases where they need to use some sort of expected counts rather than pure integer values, often due to only having assembled transcriptomes or needing to do transcript-level analyses. The better method to deal with this is to get expected counts (e.g., with eXpress, or rsem, or ...) and then use things like limma/voom or even edgeR with those (you could use DESeq2 in theory, but it'll throw an error).

This I don't understand.
Why can't I just use htseq-count or featureCounts to get the read counts and than run DESeq like a normal work flow?
Why can I run edgeR but not DESeq?

thanks
Assa

**dpryan** · 04-27-2015, 03:43 AM

DESeq2 is explicitly written to throw an error if you try to do this. That's the only reason. You could change the code to allow this and it'll be just as reliable as edgeR.

**frymor** · 04-28-2015, 03:39 AM

Originally posted by dpryan View Post

DESeq2 is explicitly written to throw an error if you try to do this.

Do you mean here "working with expected counts"?

Can edgeR work with them?

**dpryan** · 04-28-2015, 03:46 AM

Yes, or anything else that isn't an integer.

Yes, edgeR doesn't throw an error (at least the last time I looked), so it'll work. I'm personally a bit more comfortable with limma/voom for this sort of thing, but that's personal preference.

**hartmaier** · 07-16-2015, 10:03 AM

Originally posted by dpryan View Post

tldr: No one should ever do what they did.

Longer reply:
What they did makes no sense. It's a sad critique of peer review that this even got accepted, since likely no one that knew anything about data analysis actually reviewed the paper...only pure wet-lab people.

Wow! So now that we (i.e. those on this forum) know there is a likely catastrophic flaw in the RNAseq analysis in this paper (which is a major focus of the study), is there a responsibility to notify the journal? This is in PNAS. After my quick read of the paper, it looks like 3/4 figures directly use the results from this flawed analysis, so it likely has more than a trivial impact on the study's conclusions.

**fanli** · 07-16-2015, 01:14 PM

From the paper:

...analyzed using state-of-the-art methods.

**dpryan** · 07-17-2015, 04:08 AM

Originally posted by hartmaier View Post

Wow! So now that we (i.e. those on this forum) know there is a likely catastrophic flaw in the RNAseq analysis in this paper (which is a major focus of the study), is there a responsibility to notify the journal? This is in PNAS. After my quick read of the paper, it looks like 3/4 figures directly use the results from this flawed analysis, so it likely has more than a trivial impact on the study's conclusions.

I suppose that one could try, but I wouldn't hold my breath that that would get a reply. What might be more worthwhile is to redo the analysis properly and see if the results change drastically. If so, then it'd be useful to notify the authors/journal. If not, maybe post a comment on pubmed central noting that so others don't need to redo the analysis to see if the results actually hold up.

**hartmaier** · 07-17-2015, 07:52 AM

Originally posted by dpryan View Post

What might be more worthwhile is to redo the analysis properly and see if the results change drastically. If so, then it'd be useful to notify the authors/journal.

Yeah, that's what I was thinking as well. Something to do on a rainy weekend I guess.

**pashu912** · 04-29-2016, 12:25 PM

Hi,

I am doing cross species study and found a paper about similar work. I think the data analysis in the paper is not appropriate and decided to ask here!
They have done differential gene expression analysis of FPKM data consisting of different species as follows:

1. They generate FPKM data with trinity.
2. Then they Normalize the FPKM data to account for length difference in orthologs.
3. They scale the normalized FPKM data by a common factor such that the lowest expressed gene’s value becomes 1
4. Then they round the values to the nearest integer and use edgeR.

Will the above approach give sensible results? I doubt because I don't think scaling the FPKM data makes it any similar to raw count data in terms of mean-variance relationship!

**dpryan** · 04-29-2016, 12:29 PM

They may have gotten lucky and gotten sensible results with that method, but I suspect that they got mostly gibberish results.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

DESeq with FPKM values

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News