SEQanswers

SEQanswers (http://seqanswers.com/forums/index.php)
-   Bioinformatics (http://seqanswers.com/forums/forumdisplay.php?f=18)
-   -   DEXSeq questions on power and counting bins (http://seqanswers.com/forums/showthread.php?t=31569)

alittleboy 07-02-2013 11:00 AM

DEXSeq questions on power and counting bins
 
I originally post this in the Bioconductor mailing list, but I hope that more colleagues can see it and help me clarify the questions. If you've seen this in the mailing list, please ignore this one ;-)

I am using DEXSeq for testing differential exon usage (DEU) between two conditions, each having 3 biological replicates. The design matrix contains a covariate called "subject", i.e. the same subject had both the control and treatment. I have three questions:

(1) In the vignette of DEXSeq, in Figure 3 the author compared the adjusted p-values from two tests: without the batch effect (x-axis) and with the batch effect (y-axis). Clearly from the plot, the p-values are smaller when the batch effect is accounted for. However, I don't know if we can conclude from such a plot that "with the type-aware analysis, detection power for DEU due to condition is improved"? It is from a real data analysis, so how do we know the significant genes are really true positives? BTW, in my analysis, after accounting for the subject effect, the number of genes with DEU increases from 14 to 24, and the plot for comparing the p-values are similar to Figure 3. Will properly accounting for covariates in DEXSeq always lead to such a conclusion (i.e. increased detection power)?

(2) I got the HTML outputs using DEXSeqHTML() in R. For each gene with DEU, I can see different plot options: counts, expression, splicing and transcripts. By only looking at the plot with differentially expressed exon(s) in color, it seems that the conclusion is only based on "splicing" as I can see the "distance" between the two conditions, but such distance is very small in other plot options. In the vignette, the author also recommend specifying "splicing=TRUE". Can I know what are the differences among those options, and which one is preferred to use (making more biological sense)?

(3) DEXSeq's inference is based on "counting bins", i.e. not real exons but exonic regions redefined from GTF file. My question is, once I obtain a gene with DEU (ENSG00000056558), how can I know which *real exon* are deferentially used? From Ensembl website, this gene has 3 transcripts (splice variants), but in the DEXSeqHTML output, it has 11 exonic regions, and E010 shows DEU -- how can I tell E010 in this case correponds to which real exon?

Thank you so much for your suggestions! :)


All times are GMT -8. The time now is 10:47 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.