SEQanswers

Go Back   SEQanswers > General



Similar Threads
Thread Thread Starter Forum Replies Last Post
BAM/SAM to a gapped multiple sequence alignment query Bioinformatics 6 04-06-2011 05:07 AM
Gapped alignment with RNA-Seq agc RNA Sequencing 2 12-21-2010 08:03 PM
bfast gapped alignment Protaeus Bioinformatics 1 08-30-2010 09:33 PM
Gapped Alignment agc Bioinformatics 3 06-07-2010 09:20 AM
Gapped alignment using SOAP? MattB Bioinformatics 0 11-06-2009 09:28 PM

Reply
 
Thread Tools
Old 10-13-2010, 08:46 PM   #1
polyhedron
Member
 
Location: China

Join Date: Aug 2009
Posts: 12
Default ask for gapped (indel) alignment software

i have now single-end solexa data and later will be also working on PE.

i've tried soap2 for the -g 5 option for gapped alignment, however, on known indels, there's nothing properly mapped. although i've seen on seqanswers someone did gapped alignment successfully (but on soap rather than soap2?). and the developer says soap2 can do gapped PET alignment, but "gapped" means the interval between the ends rather than indels?

bowtie says it doesn't support gapped alignment yet.

may anyone tell me which free software can do gapped alignment (for indels) on both SE and PE data? many thx!

Last edited by polyhedron; 10-14-2010 at 09:09 AM.
polyhedron is offline   Reply With Quote
Old 10-14-2010, 12:24 AM   #2
francois.sabot
Member
 
Location: France

Join Date: Dec 2009
Posts: 41
Default

I may say that BWA accepts indel...
__________________
Francois Sabot, PhD

Be realistic. Demand the Impossible.
www.wikiposon.org
francois.sabot is offline   Reply With Quote
Old 10-14-2010, 01:14 AM   #3
Michael.James.Clark
Senior Member
 
Location: Palo Alto

Join Date: Apr 2009
Posts: 213
Default

Novoalign, BWA, BFAST

Oh, Novoalign is not exactly free. You can get a free trial from them, though, and it is not very expensive.
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
Projects: U87MG whole genome sequence [Website] [Paper]

Last edited by Michael.James.Clark; 10-14-2010 at 01:18 AM.
Michael.James.Clark is offline   Reply With Quote
Old 10-20-2010, 06:36 AM   #4
polyhedron
Member
 
Location: China

Join Date: Aug 2009
Posts: 12
Default

Thanks very much, François and Michael!

I've been testing bwa these days, and find bwa have considerable better mappability than soap2, and good with indels. The trade-off is that bwa is 3 or 4 times slower than soap2, and I haven't find so flexible output options as in soap2, like

Quote:
-r INT How to report repeat hits, 0=none; 1=random one; 2=all, [1]
Therefore, I think when I've got really huge data, I would rather run soap2 first to get some basic idea and results from my reads, then just let bwa running for days to get more precise results.

Last edited by polyhedron; 10-20-2010 at 06:44 AM.
polyhedron is offline   Reply With Quote
Old 11-03-2010, 01:59 AM   #5
sci_guy
Member
 
Location: Sydney, Australia

Join Date: Jan 2008
Posts: 83
Default

SHRiMP is very robust to INDELs, but is very slow (as it performs Smith-Waterman alignments).
sci_guy is offline   Reply With Quote
Old 11-03-2010, 03:18 AM   #6
lry198010
Member
 
Location: Wuhan China

Join Date: Aug 2008
Posts: 13
Default

Quote:
-r INT How to report repeat hits, 0=none; 1=random one; 2=all, [1]
you can try options -n and -N in sampe or samse subrouting!
lry198010 is offline   Reply With Quote
Old 11-03-2010, 04:36 AM   #7
drio
Senior Member
 
Location: 41°17'49"N / 2°4'42"E

Join Date: Oct 2008
Posts: 323
Default

Quote:
Originally Posted by polyhedron View Post
Therefore, I think when I've got really huge data, I would rather run soap2 first to get some basic idea and results from my reads, then just let bwa running for days to get more precise results.
Days? BWA is very fast and accurate. How much data are you trying to align? On what environment? A lane of hiseq (75bp single ended) should take less than 1 day on a typical 8 core machine. You can further reduce the running times by splitting and computing the data into multiple machines.
__________________
-drd
drio is offline   Reply With Quote
Old 11-03-2010, 09:49 AM   #8
Michael.James.Clark
Senior Member
 
Location: Palo Alto

Join Date: Apr 2009
Posts: 213
Default

Yeah, I'm with drio on this one. Something is wrong with your approach using BWA if it's taking days, even for a very large data set. You may not be running it in an optimal fashion for your computational resources if that's the case.

All three gapped aligners that I mentioned--Novoalign, BFAST, BWA--are quite fast with Illumina reads, so if it seems slow, there may be an incorrect setting or it may be getting run in a suboptimal way.
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
Projects: U87MG whole genome sequence [Website] [Paper]
Michael.James.Clark is offline   Reply With Quote
Old 11-09-2010, 07:33 AM   #9
olesk
Junior Member
 
Location: Copenhagen, DK

Join Date: Nov 2010
Posts: 6
Default

Depending on the size of your reference sequence and amount of sequence data you could test if my program R2R is your solution.
Find it at: http://milne.ruc.dk/R2R/
olesk is offline   Reply With Quote
Old 11-12-2010, 01:14 AM   #10
lexa
Member
 
Location: MPI

Join Date: Jun 2010
Posts: 17
Default

there is a new mapper available called STAMPY (http://www.well.ox.ac.uk/project-stampy). the paper seems promising but I haven't tested it myself.
lexa is offline   Reply With Quote
Old 03-23-2011, 02:19 AM   #11
colindaven
Senior Member
 
Location: Germany

Join Date: Oct 2008
Posts: 415
Default

The best tools I've used for alignment of small indels are Stampy and Shrimp2. We've validated quite a number of these.
Unfortunately they are slow as well.

As a follow-up I've been using Dindel on Stampy aligned data for further testing. No wet-lab data on this yet though.
colindaven is offline   Reply With Quote
Old 03-23-2011, 03:01 AM   #12
ttnguyen
Member
 
Location: Ireland

Join Date: Mar 2010
Posts: 41
Default

Novoalign and BWA work quite well for this problem. BWA is faster than the free version of Novoalign (not support MPI), but Novoalign maybe more accurate.
It might be worth taking a look at these surveys - though they are not the most up to date:

http://lh3lh3.users.sourceforge.net/NGSalign.shtml
http://bib.oxfordjournals.org/content/11/5/473.full
ttnguyen is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 03:55 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO