Unconfigured Ad

**feralBiologist** · 11-07-2013, 11:34 AM

It is better to have a single count table. This shall lead to increased statistical power. You are likely to get somewhat higher number of differentially expressed genes as DESeq would be able to tease out more signal from the noise.

**rndouglas** · 11-07-2013, 11:59 AM

This is what I had been doing all along (running everything in one count table), but then this morning I thought I'd try A vs. B in a separate count table.

I was surprised to find almost double the loci with padj < 0.05 compared to when I ran everything in one count table (and hence my new-found concern).

**dpryan** · 11-07-2013, 12:17 PM

Have a look at the size factors. If one of them from the full dataset is very different than the others, that can cause this sort of result.

**feralBiologist** · 11-07-2013, 01:14 PM

Originally posted by rndouglas View Post

This is what I had been doing all along (running everything in one count table), but then this morning I thought I'd try A vs. B in a separate count table. I was surprised to find almost double the loci with padj < 0.05 compared to when I ran everything in one count table (and hence my new-found concern).

This is really strange and counter-intuitive. Do you find more loci for all the contrasts? I can imagine this being the case for a small subset of the contrasts where there happens to be less variation within the sample groups but I find it hard to believe that you would get in total a much higher number of differentially expressed features by splitting the count table. What sort of normalisation do you do? Can you give more details about your bioinformatics workflow?

**rndouglas** · 11-08-2013, 06:54 AM

I just realized I'm only seeing this in my smallRNA libraries (adapter removed, t/rRNA removed, size-selected for 20-25nt in sRNA Workbench).

I map the reads using bowtie (-v 0).

I generate read counts with htseq-count, then build my count table(s).

I run DESeq following along with the vignette section 3.1.

So far, every 1v1 count table I've looked (7 of the possible 15) at has called more significantly changed loci (padj < 0.05) than I get when looking at the exact same comparison using a count table that includes all 30 of my bio-reps.

The biggest 'jump' was from 76 to 372 loci for one comparison.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

What to include in my count table(s) for DESeq

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News