Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
[DEXSeq] exon counts to "PSI" (exon inclusion level) yerbol Bioinformatics 3 11-23-2015 05:32 PM
edgeR: Common dispersion and pseudocounts flobpf Bioinformatics 0 07-05-2013 08:46 PM
[EdgeR] exactTest dispersion="common" syintel87 Bioinformatics 0 05-23-2013 06:12 PM
How to compute exression ratio of Human Exon 1.0 ST array on probe level tujchl Bioinformatics 14 10-30-2012 07:30 AM
EdgeR condition-specific dispersion tfwillems RNA Sequencing 1 10-05-2012 06:21 AM

Thread Tools
Old 11-24-2013, 02:00 PM   #1
Junior Member
Location: St Louis

Join Date: Nov 2013
Posts: 1
Default edgeR spliceVariants: gene- and exon-level dispersion


I'm trying to detect alternative splicing between 2 experimental conditions
using edgeR's spliceVariants (and DE(X)Seq).

For each gene, spliceVariants uses a single dispersion calculated by
estimateExonGeneWiseDisp, which simply aggregates all exon counts
within a gene and calculates a per-gene dispersion based on those
aggregated counts. This seems highly anti-conservative (i.e., gives
extremely low dispersions). The counts being fit are exon-level counts--
i.e., smaller numbers with larger dispersions. Am I missing some theoretical
or intuitive justification for this choice? Wouldn't a less severe
anti-conservative choice be the min dispersion across all exons within
the gene (still larger than that provided by estimateExonGeneWiseDisp)?
While an intuitive conservative choice is the max?

If I understand this statistical framework correctly, I should be able to use
a per-exon dispersion--clearly this is possible if I take my tags to be exons,
but in theory it should also be possible in the spliceVariants scenario in
which the tags are genes, though the counts represent exons. DEXSeq
appears to be doing this. Is there a straightforward means of doing this within edgeR? The interface to glmFit seems to preclude it.

Thank you,
Brian is offline   Reply With Quote
Old 11-25-2013, 03:19 AM   #2
Devon Ryan
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,480

You might have to ask this on the bioconductor email list so someone from Gordon Smyth's group can reply. I would be rather hesitant to rely on spliceVariants() for the reasons that you list.
dpryan is offline   Reply With Quote
Old 11-27-2013, 02:49 AM   #3
Location: Berlin

Join Date: Oct 2010
Posts: 71

Hi Brian,

I am also interested in this topic, so could you please keep this thread updated in case you find your answer?

rboettcher is offline   Reply With Quote
Old 09-11-2014, 03:47 PM   #4
Gordon Smyth
Location: Melbourne, Australia

Join Date: Apr 2011
Posts: 91

We have are most of the way through a major overall and improvement of edgeR's spliceVariants() function. We haven't made the new version public yet -- will do when it is stable.

In the meantime, you might try the diffSplice() function in the limma package, which is very fast and controls the false discovery rate conservatively.
Gordon Smyth is offline   Reply With Quote

dexseq, edger

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 08:00 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO