Seqanswers Leaderboard Ad

**sdriscoll** · 11-08-2012, 09:18 AM

This is by design. I think it would be wiser and more science like to use the following type of pipeline:

Align reads once for expression analysis.
Quantify gene expression once as either read counts, cufflinks estimates or try RSEM
Use a DE tool like DESeq, edgeR or the more recent EBSeq which seems to have improved on previous methods a bit.

It doesn't make sense to get multiple expression estimates from the same data even though it makes sense from the computation logic side of things.

I think RSEM might be my new favorite except that it doesn't run well on my Mac system. If you have a strong Linux system then it should be good. If you run RSEM multiple times on the same data you will see some variation in its isoform level assignment of expression however that's a result of variable/random behavior of the aligner. The gene level estimates are more stable. This only exposes the fact that we have all probably been dealing with this extra uncertainty in expression values all along. Their pipeline requires bowtie and they run it in a specific way for good quantification estimates. So if you want to try the RSEM pipeline you only need to do that and you can skip the initial alignments because RSEM provides them for you. You would run it once per sample and then merge the data for DE analysis. They recommend EBSeq, in fact they package it with their software.

**sdriscoll** · 11-08-2012, 05:45 PM

today I discovered eXpress (http://bio.math.berkeley.edu/eXpress/overview.html) which uses the same basic algorithm as RSEM but is much faster and it produces more verbose output. so far I like it and I've seen that it's expression estimates correlate very highly (r > 0.8) with 'true' expressions from synthetic data analysis. someone shared a slideshow with me outlining an evaluation of current possible pipelines using the BEERS pipeline (http://www.cbil.upenn.edu/BEERS/). cufflinks wasn't even on the map with count estimates correlating 0 < r < 0.2 - or in other words the expression estimates looked like random noise compared to the true values.

**hlwright** · 11-09-2012, 04:51 AM

Thanks sdriscoll - I have been getting more and more frustrated with cufflinks / cuffdiff so will explore these other options.

Really appreciate you replying to my post.

Helen

Topics	Statistics	Last Post
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 12 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 16 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 21 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM

Seqanswers Leaderboard Ad

Announcement

Different RPKM values in same dataset using Cufflinks or Cuffdiff (v1.3.0)

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News