Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Mapping quality and clipping in BWA

    Hi @all,

    I have a problem in understanding some alignments produced by BWA mem (Version: 0.7.8-r455):

    Code:
    IQ4WJ2H02II3G5	0	chromosome_1	256920	60	512S33M	*	0	0
    IQ4WJ2H02GYKYP	0	chromosome_1	380024	12	312S33M142S	*	0	0
    IQ4WJ2H01BLH7R	16	chromosome_1	794344	7	79M178S	*	0	0
    1) Besides the obvious (all alignments are crap)... is MapQ influenced by clipping at all? Why not? What about other aligners?

    2) Next question would be if it is possible to prevent things like this from happening? I tried to adjust the "-L" parameter but it seems to have almost no effect - I tried standard (5), 50, 100, 10000 and these alignments occur in all 4 runs.

    3) Has anyone of you ever seen something like this before? How do you handle/filter these reads then? The only thing that occurs to me is selecting by S/H with values greater than 50(?),100(?),10% read length(?)

    Any suggestion helps!

  • #2
    1) BBMap's map scores are not directly affected by the bases in the clipped portion, though they are affected by the length of the aligned portion, so 100= would score slightly higher than 50=50S. Also, shorter alignments are more likely to be coincidentally ambiguous so they will tend to have a slightly lower score. That said, clipping is disabled by default as BBMap is a global aligner. For BWA, I don't know how the score is calculated.

    3) I've seen alignments like that when mapping to the wrong reference (e.g. contaminant reads). You might want to gather some of those mostly-clipped reads and blast them to nt; that may give you insight into how to filter them. Also, if a read starts out normal then becomes junk (for example, short-insert read that hits adapter sequence and then random letters, or a PacBio read where the enzyme breaks down partway through) you get those local alignments with most of the read clipped.

    Comment


    • #3
      1. Not directly, see a discussion here.
      2. Maybe decrease -B, though I worry that this will just decrease the reliability of alignments. It'd be better to just blast things.
      3. Nope, as Brian suggested, I'd blast the soft-clipped portions of a few reads.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Advances in Sequencing Analysis Tools
        by seqadmin


        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
        Yesterday, 07:48 AM
      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 06:57 AM
      0 responses
      7 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 07:17 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-02-2024, 08:06 AM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-30-2024, 12:17 PM
      0 responses
      21 views
      0 likes
      Last Post seqadmin  
      Working...
      X