Hi,
I am attempting to align paired end Illumina reads to the human genome with Bfast.
The problem is that most aligners expect paired end reads in 2 fastq files while Bfast expects 1 file. Having searched this and other forums, the most commonly suggested/accepted solution appears to be conversion of the 2 files to 1 file using the ill2fastq script supplied with Bfast.
I have now converted the files and run the alignment however there appear to be problems with the results. I have aligned the same sequences with Stampy and BWA (separately) and I get completely different results. The output of samtools flagstat is shown below.
Does anyone have any suggestions on how best to utilise Bfast under such circumstances?
BWA
200000 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
191942 + 0 mapped (95.97%:nan%)
200000 + 0 paired in sequencing
100000 + 0 read1
100000 + 0 read2
190714 + 0 properly paired (95.36%:nan%)
191364 + 0 with itself and mate mapped
578 + 0 singletons (0.29%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
Stampy
200000 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
199376 + 0 mapped (99.69%:nan%)
200000 + 0 paired in sequencing
100000 + 0 read1
100000 + 0 read2
197774 + 0 properly paired (98.89%:nan%)
198768 + 0 with itself and mate mapped
608 + 0 singletons (0.30%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
BFAST
125396 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
123238 + 0 mapped (98.28%:nan%)
50792 + 0 paired in sequencing
25396 + 0 read1
25396 + 0 read2
0 + 0 properly paired (0.00%:nan%)
49516 + 0 with itself and mate mapped
326 + 0 singletons (0.64%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
I am attempting to align paired end Illumina reads to the human genome with Bfast.
The problem is that most aligners expect paired end reads in 2 fastq files while Bfast expects 1 file. Having searched this and other forums, the most commonly suggested/accepted solution appears to be conversion of the 2 files to 1 file using the ill2fastq script supplied with Bfast.
I have now converted the files and run the alignment however there appear to be problems with the results. I have aligned the same sequences with Stampy and BWA (separately) and I get completely different results. The output of samtools flagstat is shown below.
Does anyone have any suggestions on how best to utilise Bfast under such circumstances?
BWA
200000 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
191942 + 0 mapped (95.97%:nan%)
200000 + 0 paired in sequencing
100000 + 0 read1
100000 + 0 read2
190714 + 0 properly paired (95.36%:nan%)
191364 + 0 with itself and mate mapped
578 + 0 singletons (0.29%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
Stampy
200000 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
199376 + 0 mapped (99.69%:nan%)
200000 + 0 paired in sequencing
100000 + 0 read1
100000 + 0 read2
197774 + 0 properly paired (98.89%:nan%)
198768 + 0 with itself and mate mapped
608 + 0 singletons (0.30%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
BFAST
125396 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
123238 + 0 mapped (98.28%:nan%)
50792 + 0 paired in sequencing
25396 + 0 read1
25396 + 0 read2
0 + 0 properly paired (0.00%:nan%)
49516 + 0 with itself and mate mapped
326 + 0 singletons (0.64%:nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
Comment