Seqanswers Leaderboard Ad

**Lluc** · 08-22-2012, 01:56 AM

I recently succeeded annotating the variants of a non-model species using annovar. It is not straight forward, and I am still checking that I got it right, but I'm satisfied for the moment.

In addition to the list of variants, I prepared 3 files. First, I converted the gff3 annotation to a table like knownGenes from UCSC (name it ending in "_knownGene.txt"). This may be the most difficult step. I wrote a script of my own, that I could share... Second, I generated a dummy file similar to the kgXref from UCSC in which I only filled up the two first columns (known gene ID, and mRNA ID). Name it with the same prefix as before, and ending in "_kgXref.txt". And third, I used the script retrieve_seq_from_fasta.pl, provided with annovar, and the reference genome of my organism in fasta format to generate the fasta file of transcript sequences. Name it with the same prefix and ending in "_knownGeneMrna.fa".

Once you have those three files in the same directory (e.g., /home/database), you run annovar like this:

annotate_variation.pl --geneanno --outfile <outfile> --dbtype knownGene --buildver <prefix_of_your_files> <list_of_variants> /home/database/

Don't forget the final slash in the last argument, indicating the directory where your "database" is.

Good luck.

**chrishah** · 08-23-2012, 02:28 PM

Hi Lluc,

THank you very much for your answer and for sharing your approach!!! I appreciate it! I ll have to farmiliarize me a little bit with annovar and will then try your method (might come back to you with another question..).

For now, I have found another approach that might be interesting for you also, if you want to test an alternative to Annovar: snpEFF in fact also enables you to use your own database, which can be created from your draft assembly. The few steps are explained on this website: http://snpeff.sourceforge.net/supportNewGenome.html. After that you just run snpEFF like here: http://snpeff.sourceforge.net/examples.html#ex3. For me it worked right away. I am just trying to assess the results now.

Maybe you wanna try it..

Thanks again for your help! Good luck! much obliged!

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

variant effects for non-model organisms

Comment

Comment

Latest Articles

ad_right_rmr

News