![]() |
BWA - input files
Hi,
I am trying to align paired end reads (Illumina) to a reference genome using BWA. I have 10 reads files, 5 for each direction. In this old post 'totalnew' says to align the files separately: Code:
bwa aln database.fasta 4_1.fq > 1_1.fq.sai Code:
bwa sampe database.fa 1_1.fq.sai 2_1.fq.sai 3_1... 1_2.fq.sai 2_2.fq.sai 3_2... 1_1.fq 2_1.fq 3_1... 1_2.fq 2_2.fq 3_2... > alignment.sam What if I concatenate all reads files into s_1_sequence.txt and s_2_sequences.txt and then run bwa aln twice and bwa samse once? cheers! ps (offtopic) @lh3: I tried downloading bwa 0.5.8a but it seems as though there are files missing. Here what I get: Code:
wget http://sourceforge.net/projects/bio-bwa/files/bwa-0.5.8a.tar.bz2/download |
I think you'd better to run sampe for each PE files separately:
bwa sampe [options] <prefix> <in1.sai> <in2.sai> <in1.fq> <in2.fq> Although you concatenate all read files into two PE seq is okay, I recommend you split them (for example: 100M reads per files) and run in different CPU cores or computer nodes (MPI mode), so that it would be more faster. |
Hi,
Thanks for your reply. So you suggest running sampe at least five times and then merge the resulting sam files? I will try this. Cheers! *** edit Thanks, works fine. |
All times are GMT -8. The time now is 08:25 PM. |
Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.