Hi, I am using cuffdiff on single end illumina data. I naturally get a lot more significantly differentially expressed genes if I lower my threshold from 500 to 300, for example. When I use the default 500, much of my data comes out with NOTEST. What is an acceptable -c value to use to still find significantly differentially expressed genes?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I'm not going to claim to be an expert on just how to manipulate the code Cuffdiff uses, but it seems to me 500 is very high unless you have some ridiculous coverage. I had the same issues even with relatively large amounts of total RNA (5-10 ug) used and with genes I know have good expression levels from other experiments. So I cut the threshold down to 250. The P-values for most called differences where still way below .05. I know you run into alpha error inflation, because you're running these test 20000 time or more, depending on which genome you're working with, but you have to balance those false positive reportings with the false negatives for having the cutoff too high.
Anyway, I'm betting you're going to have to do replicates somehow regardless, so I rather set the cuttoff too low for RNA-seq, then shrink that list down with what ever kind of validation you're doing.
-
Thank you very much for your response. I have actually changed my -c option to 0, seeing that there are genes with a smaller amount of reads that still can be differentially expressed. Can anyone comment to this approach, or if I am getting a lot of false values?
Thanks again!
Comment
-
-c 0
I have also set this parameter to zero and lowered my FDR to reduce false discovery. I believe the key to analyzing this data is to realize that there are no absolutes and that we are creating a model and fitting the data as best as possible to this model. Be aware of your assumptions and the pitfalls of those assumptions and get into the data and work with it. Patterns will emerge and from that you can develop testable hypotheses.
Comment
Latest Articles
Collapse
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
-
by seqadmin
The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.
Avian Conservation
Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...-
Channel: Articles
03-08-2024, 10:41 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 06:37 PM
|
0 responses
10 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:37 PM
|
||
Started by seqadmin, Yesterday, 06:07 PM
|
0 responses
9 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:07 PM
|
||
Started by seqadmin, 03-22-2024, 10:03 AM
|
0 responses
49 views
0 likes
|
Last Post
by seqadmin
03-22-2024, 10:03 AM
|
||
Started by seqadmin, 03-21-2024, 07:32 AM
|
0 responses
67 views
0 likes
|
Last Post
by seqadmin
03-21-2024, 07:32 AM
|
Comment