Seqanswers Leaderboard Ad

**sdriscoll** · 04-10-2012, 01:27 PM

i guess the biggest and most important step in DE testing is the normalization step. second to that is the variance modeling and finally the statistical test used. if the normalization isn't done properly then every is going to be off. I've had some trouble with cuffdiff's normalization myself though not exactly what you're saying happened to you.

item 1:
i think the "normal" in this case depends on the samples. check the log2 fold change values between the two outputs. If you're seeing wildly different fold changes then I'd go with the -N output. this might indicate that read count normalization is introducing some heavy skew in your samples. the quartile normalization is slightly more robust, or so it has been shown (http://www.biomedcentral.com/1471-2105/11/94/)

2. the gene expression information that comes out of cufflinks wouldn't be used if you are continuing on to use cuffdiff for DE. you'll only use the expression values from cufflinks if you intend to do something with each sample's expressions (like sample clustering). in the Nature pipeline I think the cufflinks step is mostly about getting the "transcripts.gtf" files so you can use cuffmerge and create a customized GTF for your samples.

3. I think regardless of the normalization you'll see similar gene expression densities since cuffdiff uses some unknown scaling factor post -N normalization to "correct" the FPKM values to make them look like FPKM's calculated "per million mapped reads" style. If you calculate FPKM's "per upper quartile" then you get expressions several orders of magnitude larger than "per million mapped reads". so...i wouldn't expect those density plots to look different.

As a final check between the normalizations I'd make a scatter plot for each and have a look at those (the csScatter command). Have a look at those plots and make sure the main body of the scatter is more or less centered on the diagonal line that would indicate 1-fold change. If the data is noticeably pulled to one side or the other that would indicate some normalization problems. For an example see my post where I found cuffdiff to blow the normalization if one of my samples has low read depth: http://seqanswers.com/forums/showthread.php?t=19104

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Cuffdiff with and without Quantile Normalization

Comment

Latest Articles

ad_right_rmr

News