Seqanswers Leaderboard Ad

**crazyhottommy** · 07-15-2014, 09:13 AM

read this paper may help http://genomebiology.com/2013/14/9/R95

**mbblack** · 07-15-2014, 09:35 AM

First of all, it sounds like you do not have very many differentially expressed genes no matter how you slice it. Given that, no, I'm not surprised the different analysis were that different. In a study where any one analysis is only giving a hundred or so DEGs at most, it would be surprising to me if different analysis did not give quite widely different gene lists.

I would NOT suggest using different thresholds for some analyses over another just to increase the numbers and the degree of overlap - that smacks of cherry picking your stringency just to get the lists to align the way you want. Whatever cutoff you choose needs to be uniformly applied to be comparable. As always, you will get your most reliable DGE results by simultaneiously applying a statistical threshold (e.g. FDR<0.05) and a fold change threshold (e.g. absolute value of fold change >1.5 or >2.0).

In your case, that will end up just reducing your lists even further.

It sounds to me like you simply have very little DGE actually going on in your treatments. Either that, or you are lacking in biological replicates and/or read depth to adequately detect the differences that do exist. If these are all fairly low expressor genes, then read depth may be your single limiting factor as it takes much higher read depth to detect subtle changes (at least with any real statistical rigor) then it does to detect large scale gene expression changes.

Just how many biological replicates did you run? Do you still have library material left that you could use to increase read depth? Low expressors are low count features, which also have the highest variance within your mapped reads count data. They will inherently be the most difficult to detect reliably and with statistical confidence, and both increasing biological replication and increasing read depth can help identify them.

**JuliaS** · 07-15-2014, 11:22 AM

Thank you both for your helpful responses. I find that article interesting because they found that cuffdiff has a lot more false positives than other methods, so maybe the increased amount of DEGs in my cuffdiff results are actually just false positives...
As far as biological replicates, I have 5 females and 5 males per diet group. I analyzed males and females separately as well as together.
Our average read depth is about 18 million reads per sample, and this is single-end sequencing, so that read depth should be sufficient right?
I understand your point about using the same cutoff for all analyses, but would it be ok to stick with my cuffdiff output cutoff (q< 0.05), and then pick a different cutoff to use for all of the DESeq2 analyses? DESeq2 reports adj. p values, not q values, anyway. Can you suggest a particular cutoff or how I should determine the cutoff?
Thanks for your help!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 31 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Finding overlap in 3 RNA-Seq analyses of same dataset

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News