Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Tryptamine
    Junior Member
    • Mar 2016
    • 1

    Differential expression of small subset of genes from RNASeq dataset

    Hello all - apologies in advance if this is a statistically naive question!

    We've recently been looking at some of the abundant RNASeq data from the NIH's cancer genome atlas to find differentially expressed genes in biological replicates from paired normal/tumour samples, different tumours, etc. We've had success using DESeq2 and SAMSeq on raw count data, and the high sample number has been giving low FDRs/q values.

    We're only interested in a small set (~30) of genes of interest vary between conditions. I was wondering, then, what are the statistical pitfalls of excluding other genes before input to DESeq2, SAMSeq etc? I'm aware that these apply normalisation which takes into account reads across all genes. Is there any other part of the DE analysis that might be thrown off by this? On the other hand, is there anything to be said for reducing the number of multiple comparisons being performed? Or is this just generally a bad idea?

    (there are some peripheral benefits to excluding the genes, including easier data extraction and less processing time over hundreds of samples, at least in DESeq2).

    Thanks in advance! Jon

Latest Articles

Collapse

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, 06-05-2026, 10:09 AM
0 responses
15 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-04-2026, 08:59 AM
0 responses
33 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-02-2026, 12:03 PM
0 responses
35 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-02-2026, 11:40 AM
0 responses
23 views
0 reactions
Last Post SEQadmin2  
Working...