Seqanswers Leaderboard Ad

**Lluc** · 08-22-2012, 01:56 AM

I recently succeeded annotating the variants of a non-model species using annovar. It is not straight forward, and I am still checking that I got it right, but I'm satisfied for the moment.

In addition to the list of variants, I prepared 3 files. First, I converted the gff3 annotation to a table like knownGenes from UCSC (name it ending in "_knownGene.txt"). This may be the most difficult step. I wrote a script of my own, that I could share... Second, I generated a dummy file similar to the kgXref from UCSC in which I only filled up the two first columns (known gene ID, and mRNA ID). Name it with the same prefix as before, and ending in "_kgXref.txt". And third, I used the script retrieve_seq_from_fasta.pl, provided with annovar, and the reference genome of my organism in fasta format to generate the fasta file of transcript sequences. Name it with the same prefix and ending in "_knownGeneMrna.fa".

Once you have those three files in the same directory (e.g., /home/database), you run annovar like this:

annotate_variation.pl --geneanno --outfile <outfile> --dbtype knownGene --buildver <prefix_of_your_files> <list_of_variants> /home/database/

Don't forget the final slash in the last argument, indicating the directory where your "database" is.

Good luck.

**chrishah** · 08-23-2012, 02:28 PM

Hi Lluc,

THank you very much for your answer and for sharing your approach!!! I appreciate it! I ll have to farmiliarize me a little bit with annovar and will then try your method (might come back to you with another question..).

For now, I have found another approach that might be interesting for you also, if you want to test an alternative to Annovar: snpEFF in fact also enables you to use your own database, which can be created from your draft assembly. The few steps are explained on this website: http://snpeff.sourceforge.net/supportNewGenome.html. After that you just run snpEFF like here: http://snpeff.sourceforge.net/examples.html#ex3. For me it worked right away. I am just trying to assess the results now.

Maybe you wanna try it..

Thanks again for your help! Good luck! much obliged!

Topics	Statistics	Last Post
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 149 views 0 likes	Last Post by seqadmin 03-03-2025, 01:15 PM
Highlights from AGBT 2025 – Part II by seqadmin Started by seqadmin, 02-28-2025, 12:58 PM	0 responses 223 views 0 likes	Last Post by seqadmin 02-28-2025, 12:58 PM
Highlights from AGBT 2025 – Part I by seqadmin Started by seqadmin, 02-24-2025, 02:48 PM	0 responses 590 views 0 likes	Last Post by seqadmin 02-24-2025, 02:48 PM
Selecting the Right AI Model for Bioinformatics Research by seqadmin Started by seqadmin, 02-21-2025, 02:46 PM	0 responses 259 views 0 likes	Last Post by seqadmin 02-21-2025, 02:46 PM

Seqanswers Leaderboard Ad

Announcement

variant effects for non-model organisms

Comment

Comment

Latest Articles

ad_right_rmr

News