Seqanswers Leaderboard Ad

**dpryan** · 03-11-2016, 12:09 PM

I suspect that had you not disabled outlier removal that you would have gotten the expected results. The lfcMLE variance for the gene of interest is going to be pretty high, which is why there's such a discrepancy between the lfcMLE and the shrunken lfc.

BTW, if you haven't already done so, do look at a PCA plot of that data. From the two examples you showed I suspect that that will be telling.

**Jane M** · 03-14-2016, 01:27 AM

Thank you dpryan for your answer.

Originally posted by dpryan View Post

I suspect that had you not disabled outlier removal that you would have gotten the expected results. The lfcMLE variance for the gene of interest is going to be pretty high, which is why there's such a discrepancy between the lfcMLE and the shrunken lfc.

There is no outlier to me in the 12 expression values of my gene of interest. The 12 values are in the same range, with one "outsider" in the second group - in its group. I guess this value has no chance to be flagged as outlier, since the outlier detection is based on all the samples, if I am right.

I reran the analysis without disabling outlier removal:

library("DESeq2")
DataFrame=data.frame(Nom,Fichier,HIST)

###############################################
### Model ###
###############################################
dds_raw=DESeqDataSetFromHTSeqCount(DataFrame,"/home/Results/Data/26022016", design= ~HIST)
str(colData(dds_raw)$HIST)

dds_raw$HIST = relevel(dds_raw$HIST, ref="Cond1")
dds <- DESeq(dds_raw)

#####################################
### Differential expression Tests ###
#####################################
resMLE <- results(dds, addMLE=TRUE, cooksCutoff=TRUE,alpha=0.05)
summary(resMLE)
resMLEOrdered <- resMLE[order(resMLE$padj),]

write.table(resMLEOrdered, file="ResMLETable",quote=FALSE,row.names=TRUE, col.names=TRUE,sep="\t")
write.table(counts(dds,normalized=TRUE), file="NormalizedTable",quote=FALSE,row.names=TRUE, col.names=TRUE,sep="\t")

I got the exact same statistics for both genes.

Originally posted by dpryan View Post

BTW, if you haven't already done so, do look at a PCA plot of that data. From the two examples you showed I suspect that that will be telling.

I performed PCA on my 12 samples based on the 2000 genes showing highest variablity, as performed by default wih plotPCA(). I am not sure what you meant: PCA on this lonely gene?

Another suggestion?

**dpryan** · 03-14-2016, 06:05 AM

No, not PCA on a single gene, that wouldn't make any sense. The results of plotPCA on 2000 genes should suffice.

Your 10th sample appears to be an outlier in both examples you posted, presumably the default cutoff isn't catching it (I would expect this to be group-based). Does the 10th sample cluster with the others correctly?

**Jane M** · 03-14-2016, 06:36 AM

Originally posted by dpryan View Post

Does the 10th sample cluster with the others correctly?

On the default plotPCA(), this 10th sample is indeed the only outlier among its group of 6 samples. In the second condition, the 6 samples show more variance.

**dpryan** · 03-14-2016, 06:44 AM

OK, so if you remove that one do you get the expected results? It's then likely that the defaults for outlier removal just aren't working ideally for you.

**Jane M** · 03-14-2016, 07:33 AM

Originally posted by dpryan View Post

OK, so if you remove that one do you get the expected results? It's then likely that the defaults for outlier removal just aren't working ideally for you.

Thank you for your help dpryan.

After removing this sample (that is 6 vs 5 samples):

Code:

dds <- DESeq(dds_raw)
resMLE <- results(dds, addMLE=TRUE, cooksCutoff=FALSE,alpha=0.05)

I got the following normalized counts and stats:

91.6129056374 0 0 2.3943405175 0 0 949.8886211299 409.1870751597 674.1357056464 1044.6794670421 443.478082265

baseMean log2FoldChange lfcMLE lfcSE stat pvalue padj
328.6705633998 -1.1765941557 -5.4877708718 0.5332537169 -2.2064434215 0.0273529676 0.2734306821

And using:

Code:

dds <- DESeq(dds_raw, minReplicatesForReplace=5)
resMLE <- results(dds, addMLE=TRUE, cooksCutoff=TRUE,alpha=0.05)

I got very similar statistics. From that, I am not surprised because to me, there is no outlier in this gene.
Nevertheless, the gene is still not significant! I am very surprised

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

DESeq2: Why is my gene not differentially expressed?

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News