View Single Post
Old 01-03-2012, 03:42 PM   #4
gene coder
Location: Ireland

Join Date: Jul 2011
Posts: 18

Thanks. I found that SamToFastq in Picard did the job on a chromosome of NA12878 from the 1000 Genomes Project.

1. I separated SE reads and PE reads by library into separate BAM files using SamTools.
2. For PE reads, I also had to get rid of read-pairs that were unmapped (bit 3) or whose mate was unmapped (bit 4) or that were not properly aligned (bit 2).
3. I further separated the BAM files by read length.
4. Each BAM file now contained SE or PE reads only of the same read length and the same library.
5. Now I could run SamToFastq to convert a SAM file to a DAT file for use with MAQ simutrain.

Those were the steps to do to get what I wanted.
gene coder is offline   Reply With Quote