![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
count reads or count base-pairs | yuelics | Introductions | 3 | 07-29-2011 06:41 AM |
Quantification: count reads or count base pairs? | yuelics | Bioinformatics | 0 | 07-27-2011 05:48 AM |
BWA: specifying SAM/BAM file header fields before read alignment? | nora | Bioinformatics | 3 | 12-04-2010 10:11 PM |
Picard MarkDuplicates - How to identify duplicates in generated BAM file | makarovv | Bioinformatics | 6 | 11-10-2010 09:02 AM |
BWA Uniquely Mapped Reads | NF_seq | Bioinformatics | 0 | 09-06-2010 04:32 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: USA Join Date: Aug 2010
Posts: 103
|
![]()
Hi all,
If this topic has been solved in a previous post, I apologize for that. And please direct me to the solution. I used BWA to align exome sequencing data. And I want to count the number of uniquely mapped reads and the number of duplicates reads (not multi-hits reads) in the obtained BAM files. Samtools flagstat is supposed to work: uniquely = mapped - duplicate. Unfortunately samtools flagstat always get "0+0 duplicate" for BWA aligned BAM files. I wonder if anybody has an easy solution for this. Thank you all. |
![]() |
![]() |
![]() |
#2 | |
Senior Member
Location: San Diego Join Date: May 2008
Posts: 912
|
![]() Quote:
Software like Picard's MarkDuplicates will flag reads as duplicates where appropriate. samtools rmdup will just get rid of duplicates. |
|
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: USA Join Date: Aug 2010
Posts: 103
|
![]()
Hi swbarnes2,
Thanks for your reply. I seldom use Picard's MarkDuplicates but it seems to me that if we apply it on a BAM file and then the duplicates will be labeled and then we are able to count the number of duplicates in the new BAM. I will try that. Please let me know if you have a better idea. Thank you. |
![]() |
![]() |
![]() |
#4 | |
Senior Member
Location: San Diego Join Date: May 2008
Posts: 912
|
![]() Quote:
Those tools should yield almost exactly the same thing (one known exception, rmdup won't touch reads where each read falls in a different chromosome, MarkDuplicates should deal with those properly.) |
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|