I am using BWA MEM to map 250 bp MiSeq PE reads. I am doing targeted genomic sequencing, enriching the library for specific transgenic sequences. I make a reference sequence, map all of the reads to that reference (using the default parameters). What I get are many improperly mapped reads. Some of the reads that map have 20 - 50 mismatches between the read and the reference, but it maps them none-the-less. There are similar sequences contaminating the sample apparently, but I don't want these spurious mappings. How can I increase the stringency of the mapping, so I don't get all of these imporperly mapped reads? I have more than enough coverage to throw out any bad mappings and still have tons of coverage.
Thanks!
CHObot
P.S. I am looking for chimeric reads at the ends of the mappings to determine transgenic insertion sites. Is BWA MEM the best algorithm for this? It seems to work, albeit with a lot of manual examination of chimeric reads.
Thanks!
CHObot
P.S. I am looking for chimeric reads at the ends of the mappings to determine transgenic insertion sites. Is BWA MEM the best algorithm for this? It seems to work, albeit with a lot of manual examination of chimeric reads.
Comment