Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA mapping quality score is unstable

    Hi Everyone,

    Has anyone else noticed that the BWA mapping quality score is not deterministic so that if you map the exact same read/pair multiple times you sometimes get different mapping quality scores? This is not for pure repeats but for reads that have reasonable mapping quality scores around 30 and the read maps to the same position each time.

    I ran an experiment where I simulated 100 error-free paired-end reads from each position in dm3 chrM (read length=100, outer distance=300), and mapped them back to all of dm3 to compute the mapping quality score of each position. For about 1/3 of the starting positions, I would get 2 different mapping quality scores such that about half the reads at a position would have a mapping quality score of X and half the time it would get a score of X+7. For example the reads simulated starting at position 500 either get a mqs of 29 or 36.

    Does anyone have a good explanation for this? My understanding is the mapping quality score computation should be completely deterministic (except maybe if the pair distance is reestimated), but the results look like there is a random component - it is not always exactly 50-50 split between 2 values, but a tight distribution around 50-50.

    Thank you,

    Mike

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 08:47 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
54 views
0 likes
Last Post seqadmin  
Working...
X