Seqanswers Leaderboard Ad

**lh3** · 04-18-2010, 05:32 AM

Use an aligner that is capable of gapped alignment. This is ESSENTIAL. No variant caller can work well with an ungapped aligner.

**eyalbd** · 04-18-2010, 06:52 AM

Thanks a lot for the reply!

Which aligners are capable of gapped alignment? I understand MAQ is, but I couldn't get it to run as I do not have access to a cluster, so I need a software that can run on my core i7 with 12gb (so 10gb max for alignment).

Many thanks,

Eyal

**Thomas Doktor** · 04-18-2010, 10:15 AM

Since Li Heng is too polite to suggest BWA I will recommend it, it's comparable to Bowtie in terms of speed and supports gapped alignments: http://bio-bwa.sourceforge.net/index.shtml

**eyalbd** · 04-18-2010, 11:27 PM

Thanks. I'll try it.

**eyalbd** · 04-19-2010, 02:58 AM

I tried using BWA, I used the supplied solid2fastq.pl file to create a gzip of my reads in fastq. Used the default settings for BWA, and later pileup, got me very bad alignment, with no connection at all between the reference genome (I'm checking only the mitochondria) and the consensus call.
What coud I be doing wrong?

**Thomas Doktor** · 04-19-2010, 03:21 AM

I'm not really sure since I don't work with SOLiD reads, but I think that BWA actually just uses the fastq format to store the colorspace reads in and uses ACGT as color representations. If you then try to align these fastq files normally, you will get many errors because of the nature of the colorspace encoding.

What you should do is generate a colorspace reference of your genome of interest and then align against that. The command looks like this for a human sized genome:

bwa index -a bwtsw -c genome.fa

You then align your reads in colorspace:

bwa aln -c genome.fa reads.fastq > alignment.sai

In any case, you should definetely always align colorspace reads in colorspace.

**eyalbd** · 04-19-2010, 03:38 AM

These steps are exactly the ones I followed. When I look at the SAM file now, I see many N's in the reads, in similar places in the sequence, for instance:

NGGNGNNNTAGGGNANNNANGCCNGNTNGNGNTNGNNNGATNGNCNNNN
NTCNTNNNAGTGCNANNNGNGTGGGNGNGNTNANCGNNGCGCGNANNNN

etc...

**Thomas Doktor** · 04-19-2010, 08:52 AM

Looks odd, how do the "raw" fastq files look?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

very short deletion messes up SAMtools SNP calling

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News