Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to map short reads to a distant genome? ynwh Illumina/Solexa 5 08-03-2011 05:56 AM
Assembling De Novo 454 Transcriptome Contigs and Singletons with Illumina Short Reads Vickenstein Bioinformatics 7 03-05-2011 12:43 AM
Using PHRAP to assemble 454 contigs and Sanger reads cleoho175 Bioinformatics 8 11-24-2010 11:45 AM
454 and Illumina data; classifying reads/contigs poisson200 Bioinformatics 0 10-29-2010 02:53 AM
Best tool to map 454 reads onto sanger reads? dan Bioinformatics 3 07-27-2009 08:51 AM

Thread Tools
Old 03-25-2011, 12:39 AM   #1
Junior Member
Location: Sweden

Join Date: Mar 2011
Posts: 1
Default How to map 454 reads/contigs to a mitochondrial genome?

This is my first post so I would like to start with a hello to all SEQanswears users

What I would like to do is to map the coverage/abundance over the whole mt genome. I would be very thankful for any tips and advices in the best way doing this (tools, scripts, programs).

l want to make a graph with the mt genome on the x-axis and coverage/abundance of reads on y-axis. Yes, the mitochondrial genome only have one starting point for transcription and I guess I could expect an homogenous distribution over the whole length. But when I have tried to assemble the mtgenome with 454 reads it is not complete and some regions get a lot of hits with reads starting at the same position (technical artefact?)
First-strand synthesis was done from the polyA end towards the 5' end and Sequencing directed from the 5' end (directionally sequenced EST-library).

What I have is:
1) a non-normalized EST-library from about one 454 run (5 sff files) and there are a lot of sequences with a mitochondrial origin in the library
2) a complete mitochondria genome (circular, 16.150 nucleotides, sanger sequenced with proof reading taq).

First I make up my mind if I should use the reads or the contigs. When I blast the mt-genome against all contigs (46.375) I get 5.594 hits with a cut-off value of 1.0e-03. The bit score value range from 1.300 to 50 hence sometimes nearly the whole length of the contigs is matched and sometimes just a tiny fractions of the contig (Which I find strange). I am afraid that this might mess up graph! I may be able to make a perl-script (have very basic skills, but with enough time..) that parse out just the aligned parts into a new fasta file and then find a program that can map/plot those against the mtgenome. Would it that fasta file be possible to use Mira and make a new assembly with the mtgenome file as a scaffold. The reason to why I ask it because it produce a .ace file that can be read by Tablet (too get a stacked graph that visualise the abundance of sequences over the mtgenome.

Do you think this is a good approach or do you have any other suggestions?

All guidance is very appreciated

fruktimport is offline   Reply With Quote
Old 03-28-2011, 10:12 AM   #2
Location: Oslo, Norway

Join Date: Nov 2008
Posts: 415

If you could get your hands on newbler from 454, you could map the reads using gsMapper/runMapping and check the 454AlignmentInfo.tsv:

"It lists, for all contigs, for each base, the read depth, quality and signal intensity. Using graph software (for example R, I use Origin) you can plot the depth for one or a large number of contigs." (quoting myself from an earlier post)
flxlex is offline   Reply With Quote
Old 03-28-2011, 10:35 AM   #3
Peter (Biopython etc)
Location: Dundee, Scotland, UK

Join Date: Jul 2009
Posts: 1,543

I'm not aware of any mappers which support circular reference sequence directly - one hack is to extend the end of the mitochrondria with wrapped sequence from the start (use at least as many bp as your longest read -- or just map to a doubled mitochrondira). You then have to massage the mapping positions afterwards - not too hard if all you want is a coverage plot file.
maubp is offline   Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 11:45 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO