![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
variant annotation | Balat | Bioinformatics | 3 | 03-05-2013 02:59 AM |
variant annotation for yeast (saccer) | christophpale | Bioinformatics | 7 | 01-31-2012 02:59 PM |
dbSNP updates and the effect on variant annotation results | warrenemmett | Bioinformatics | 0 | 10-18-2011 01:31 PM |
New version of GAMES (V1.3) for annotation and interpretation of variant from NGS | m_elena_bioinfo | Bioinformatics | 0 | 05-25-2011 09:12 AM |
Annotation tools for NGS data | groody | Vendor Forum | 3 | 01-26-2011 11:24 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Palo Alto Join Date: Apr 2009
Posts: 213
|
![]()
Hi all,
I'm looking for suggestions of variant annotation tools for large data sets. For example, I've called variants using Samtools pileup and now I want to go from a huge list of variants to a list of annotations and a simple method for filtering them. Any thoughts on things I might try?
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog] Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post] Projects: U87MG whole genome sequence [Website] [Paper] |
![]() |
![]() |
![]() |
#2 |
Member
Location: Valencia, Spain Join Date: Aug 2009
Posts: 70
|
![]()
We've done similar things at my lab. We haven't dealt with very large datasets, just a couple of illumina and 454 together. If you want to take a look at our documentation you can. We also did a small tutorial session on the topic.
I hope that could serve you as inspiration. |
![]() |
![]() |
![]() |
#3 |
Member
Location: India Join Date: Oct 2008
Posts: 36
|
![]()
You can try VarScan...takes pileup as input
|
![]() |
![]() |
![]() |
#4 | |
Senior Member
Location: Palo Alto Join Date: Apr 2009
Posts: 213
|
![]()
Thanks for the feedback.
I'm going to try VarScan because I've already done the variant calling and have the pileup files. Does anyone have suggestions on annotation and filtering programs downstream of VarScan for annotation? For example, going from the list of variants to coding consequences (marking whether and how variants affect coding sequences), and parsing by type of variant (indels vs SNVs) or coverage/quality? I'm actually also having trouble getting VarScan to work, actually: I used samtools 0.1.7-5 (r528) to generate pileup using the -c -a -f hg18.fa -r 0.0000007 options. When I tried running one of the "pileup2" commands in VarScan, this is happening: Quote:
I'm also wondering what the possible Options are when running each command in VarScan. I don't see a list on the site (and if it's in the code, I'm afraid I may not be savvy enough to figure that out myself so assistance is appreciated). For example, can I play with "min avg qual" and such? Thanks.
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog] Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post] Projects: U87MG whole genome sequence [Website] [Paper] Last edited by Michael.James.Clark; 06-18-2010 at 02:45 PM. |
|
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: Germany Join Date: May 2010
Posts: 101
|
![]()
The VarScan manual site says that it cannot process pileup created with the -c option:
"Do NOT use the -c parameter. It generates consensus format, which is different from pileup format. The next release of VarScan will recognize both formats. Note, to save disk space and file I/O, you can redirect pileup output directly to VarScan with a "pipe" command. For example: samtools pileup -f reference.fasta myData.bam | java -jar VarScan.v2.1.jar pileup2snp" c stands for consensus and it looks just as the parsing exception was caused by that consensus "A". So you should run pileup without -c to use it for VarScan. Or wait for the promised next release/someone to do a clever hack to the code ... |
![]() |
![]() |
![]() |
#6 |
Member
Location: India Join Date: Oct 2008
Posts: 36
|
![]()
try samtools pileup -vcf
gives only varients |
![]() |
![]() |
![]() |
#7 |
Senior Member
Location: Palo Alto Join Date: Apr 2009
Posts: 213
|
![]()
Great, thanks guys. I think last week I was only seeing the "Documentation" not the "Manual" from the site. The Manual describes just what I wanted to know.
Rao, the -c option's consensus output appears to be the issue. Can still potentially use -v to only output variants, though.
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog] Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post] Projects: U87MG whole genome sequence [Website] [Paper] Last edited by Michael.James.Clark; 06-21-2010 at 10:36 AM. |
![]() |
![]() |
![]() |
#8 | |
Senior Member
Location: Palo Alto Join Date: Apr 2009
Posts: 213
|
![]()
Alright, that worked and I got output. It looks believable to me, but I've encountered some issues.
For one thing, I can't get the "filter" command to report anything. No matter what settings I use, it reports 0 variants passing filter, and for the other, when I delve into the variant file, I can find variants that should pass filter. Has anyone gotten it to work? I also tried the somatic command, and it looks like it worked, but I've got some curiosities in it as well. Example output: Quote:
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog] Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post] Projects: U87MG whole genome sequence [Website] [Paper] |
|
![]() |
![]() |
![]() |
#9 | |
Member
Location: Nashville Join Date: Oct 2009
Posts: 14
|
![]() Quote:
You could try SVA in DUKE (http://people.genome.duke.edu/~dg48/sva/index.php). I think this big guy can satisfy your request if you have a big computer. Wu |
|
![]() |
![]() |
![]() |
#10 | |
Nils Homer
Location: Boston, MA, USA Join Date: Nov 2008
Posts: 1,285
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#11 |
Member
Location: Bonn Join Date: Feb 2010
Posts: 30
|
![]()
You might want to try www.gene-talk.de
|
![]() |
![]() |
![]() |
#12 |
Junior Member
Location: SF bay area Join Date: Nov 2011
Posts: 7
|
![]()
Is there an update to this post recommending tools for variant annotation and analysis? I'm trying to use R's VariantAnnotation package but the learning curve is frustrating me and I'm not sure it's worth the effort...
|
![]() |
![]() |
![]() |
#13 |
Member
Location: United States Join Date: May 2011
Posts: 26
|
![]()
I believe the two most commonly used tools are annovar and SNPEff. Annovar handles many types of annotations and is built for filtering. SNPEff produces some nice html files for your web-viewing enjoyment in addition to text files.
|
![]() |
![]() |
![]() |
#14 |
Member
Location: Bonn Join Date: Feb 2010
Posts: 30
|
![]()
I agree annovar and SNPeff seem to be most widely used for variant annotation. For variant analysis there are e.g. ingenuity (commercial), annotate-it and www.gene-talk.de. We are using GeneTalk at the institute for medical genetics at Berlin Charité and are collaborating with the R&D. The platform seems to be rather commonly used now. We have currently about one hundred single exomes analyzed per day by about 500 unique users. The annotation is based on annovar. The filtering and interpretation tools are codeveloped by us but it is generally a project open to any kind of collaboration. We just added a new filter for compound heterozygous filtering so if this is something you are interested in, just try it out,...
|
![]() |
![]() |
![]() |
#15 |
Member
Location: St Louis, USA Join Date: Nov 2011
Posts: 14
|
![]()
I agree with Annotation and snpEff being widely used. I had a chance to use SeattleSeq annotation recently when I had to calculate some Grantham scores - http://snp.gs.washington.edu/SeattleSeqAnnotation137/ You could check it out if you'd prefer a web interface to submit jobs to. Galaxy does a bit of annotation as well ( I've used Galaxy for obtaining PhyloP scores).
|
![]() |
![]() |
![]() |
#16 |
Junior Member
Location: SF bay area Join Date: Nov 2011
Posts: 7
|
![]()
thanks for the suggestions.
|
![]() |
![]() |
![]() |
Tags |
annotation, variants |
Thread Tools | |
|
|