Dear NGS community,
although I this is my first post here I would like to acknowledge you all for the helping in all the tough NGS analysis.
I have SOLiD single-end reads 50 bp long with good quality that I want to map into a reference genome of approximately 12.6 Mb. The reads are all in fasqcssanger following the Galaxy pipeline. I tried several settings in BWA and all of them gave different results and I'm really struggling to find the most accurate/specific of them (if this is possible??) in order to proceed to SNP calling. That's the reason of this post.
Setting the default parameters (-n 0.04; -l -1; -k2) only ~51% of the reads were mapped.
By tuning the parameters I was able to improve the fraction of mapped reads:
allowing for more mismatches and setting -k to 3, more reads were mapped (-n 10; -l -1; -k 3): ~71% of mapped reads.
tuning the seed to a small number, also increased this number (-n 10; -l 25; -k3): ~77% of reads mapped.
I'm now waiting for the results of disabling the seed (-n 10; -l 1000; -k3): ???
However, I'm not really sure with what option to follow with. Can please someone advise me about this issue. What parameters are more appropriate for downstream SNP calling?
Thanks to all. This forum has been a real help for my NGS adventure.
PAL
although I this is my first post here I would like to acknowledge you all for the helping in all the tough NGS analysis.
I have SOLiD single-end reads 50 bp long with good quality that I want to map into a reference genome of approximately 12.6 Mb. The reads are all in fasqcssanger following the Galaxy pipeline. I tried several settings in BWA and all of them gave different results and I'm really struggling to find the most accurate/specific of them (if this is possible??) in order to proceed to SNP calling. That's the reason of this post.
Setting the default parameters (-n 0.04; -l -1; -k2) only ~51% of the reads were mapped.
By tuning the parameters I was able to improve the fraction of mapped reads:
allowing for more mismatches and setting -k to 3, more reads were mapped (-n 10; -l -1; -k 3): ~71% of mapped reads.
tuning the seed to a small number, also increased this number (-n 10; -l 25; -k3): ~77% of reads mapped.
I'm now waiting for the results of disabling the seed (-n 10; -l 1000; -k3): ???
However, I'm not really sure with what option to follow with. Can please someone advise me about this issue. What parameters are more appropriate for downstream SNP calling?
Thanks to all. This forum has been a real help for my NGS adventure.
PAL
Comment