Seqanswers Leaderboard Ad

**KristenC** · 12-03-2013, 05:40 PM

No one has any ideas?

I'm at loss here...

**Skiaphrene** · 07-14-2014, 08:45 PM

Hi Kristen!

I have a few comments that may help, even if they don't actually answer your 2 questions per se.

First, if you like DESeq, then I strongly recommend "upgrading" to DESeq2, which has many improvements over its predecessor - for example, extended support for more complex experimental designs (Bioconductor page: http://www.bioconductor.org/packages...ml/DESeq2.html).

Second, to my understanding the base DESeq methodology assumes that most genes are not DE only in the way it calculates its scaling factors (DESeq: http://genomebiology.com/2010/11/10/R106 ; Normalisation method review: http://bib.oxfordjournals.org/content/14/6/671.full), i.e. the median-of-ratios approach.
Regarding gene expression dispersion estimation, it only assumes that most genes are not DE when there are no replicates - section "Working without replicates" in the DESeq paper. I'm going to assume you have several samples per condition...
If you expect most/all of your genes to be DE, as one might expect when doing targeted sequencing, then as you pointed out this would be a problem. However, DESeq (and DESeq2) offer the possibility of using supplied scaling factors, rather than the DESeq ones, so if you can come up with alternative scaling factors that don't rely conceptually on most genes being non-DE, then you can use those instead.
Which leads me to my third remark!

Third, rather than use "reference genes", you could consider using ERCC RNA spike-ins as internal controls (Publications that may be of interest: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3166838/ ; http://www.plosone.org/article/info%...l.pone.0041356 ; Life Technologies page: http://www.lifetechnologies.com/orde...roduct/4456739). The general idea is to add in RNA molecules with known but varying concentrations into your sample. This allows one to derive several sequencing QC checks, helps make data comparable across experiments, and more - especially relevant here, they can be used for sample normalisation, either by normalising to one or more ERCCs, or by letting DESeq calculate its scaling factors on just the ERCCs (see the 2nd reply to this reply of this topic: https://www.biostars.org/p/81803/#81817). Either way it should be more robust that "housekeeping" genes.

Obviously I may be biased as we're a) using ERCC spike-ins in our 3 sample-per-condition RNA seq project and b) using DESeq2 to analyse the expression estimates, and we're pretty happy with the results!

Let me know what you think!

Hope this helps,

-- Alex

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

DE Analysis of targeted RNA sequencing with many DE genes

Comment

Comment

Latest Articles

ad_right_rmr

News