MD tags with different ref seq

radood

Junior Member

Join Date: Oct 2019

Posts: 5
- Share
- Tweet
#1

MD tags with different ref seq

09-23-2021, 08:39 AM

Hi, I came across this puzzling finding. I have two different reads in a bamfile, with the same exact sequence. For one of them, when I look at the MD aligned pairs, it seems like the last position (position 42) matches to the reference sequence (the T is written in capital letter case).

TTCTGAATTAGCTGTATCGTCAAGGCACTCTTGCCTACGCCAT
(42, 25398283, 'T')

But for the other read, the MD aligned pairs show that the last position (position 42) is a mismatch and the reference sequence should be a C at that position (C is written in small letter case).

TTCTGAATTAGCTGTATCGTCAAGGCACTCTTGCCTACGCCAT
(42, 25398283, 'c')

In reality, the reference sequence at that position (chr12:25398283 in hg19) is a C but I don't understand why there is a difference in the MD aligned pairs between these two different reads that have the same exact sequence. Any ideas?

Thanks so much!

Note: this numbering is 0-based indexing in pysam. So looking up the position in a 1-based indexing genome browser would be off by 1 (25398284 instead of 25398283)

Last edited by radood; 09-23-2021, 08:42 AM.
Tags: None

Previous template Next

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad