If I use samtools' "idxstats" commands, it returns information with:
1. reference sequence name (chromosome name in my case)
2. length of reference sequence
3. number of mapped reads assigned to that reference sequence
4. number of unmapped reads assigned to that reference sequence
Number 4 makes no sense to me. If a read is unmapped then by definition you can't assign it to a genomic location. If I remove everything with map quality less than 1, most of these go away but not all of them, i.e. I still end up with unmapped reads assigned to chromosomes. Does anyone understand this?
Thank you.
Eric
1. reference sequence name (chromosome name in my case)
2. length of reference sequence
3. number of mapped reads assigned to that reference sequence
4. number of unmapped reads assigned to that reference sequence
Number 4 makes no sense to me. If a read is unmapped then by definition you can't assign it to a genomic location. If I remove everything with map quality less than 1, most of these go away but not all of them, i.e. I still end up with unmapped reads assigned to chromosomes. Does anyone understand this?
Thank you.
Eric
Comment