Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How does BWA deal with multi mapping reads?

    Bowtie manual clearly says the default mode is to report one alignment per read, and it will be random choice when tied.

    BWA doesn't seem to be quite clear at this point in its manual. Can anyone explain to me how BWA reports alignments if a read can be mapped to multiple locations? If I want it to report only one, which parameter should I use for that?

    I'm particularly asking for BWA aln because I'm mapping 50bp SE reads. But it would be even better if you could also explain for BWA mem if it's different.

    Thanks a lot!
    Last edited by metheuse; 03-16-2015, 01:42 PM.

  • #2
    Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

    Comment


    • #3
      Thanks, but I read that post before posting my questions.

      There are controversial replies in that post. One reply says BWA makes random choice when there is a tie, but the last reply says: "if there are two equal-scoring locations, bwa gives both a mapping score of 3 (equivalent to 50% probability), and so forth."

      If the latter reply is true, it's still not clear to me if I can set some parameter to restrict BWA to report only one location (and how). Bowtie chooses one randomly but BWA's manual doesn't say anything about it.
      Last edited by metheuse; 03-17-2015, 09:48 AM.

      Comment


      • #4
        For bwa-aln, apparently, the assignment is done after alignment in the sam-file generation phase:

        samse bwa samse [-n maxOcc] <in.db.fasta> <in.sai> <in.fq> > <out.sam>

        Generate alignments in the SAM format given single-end reads. Repetitive hits will be randomly chosen.
        I'm not sure about bwa-mem.

        Comment


        • #5
          Originally posted by Brian Bushnell View Post
          For bwa-aln, apparently, the assignment is done after alignment in the sam-file generation phase:



          I'm not sure about bwa-mem.
          Yes I read that and was wondering what "Repetitive hits will be randomly chosen." exactly means...
          If it means choosing a random alignment when multiple locations are found to be equally best, then it's indeed what I need.
          I think sampe is equivalent to samse, but the former is for bwa mem.
          Thanks!

          Comment


          • #6
            Originally posted by metheuse View Post
            Yes I read that and was wondering what "Repetitive hits will be randomly chosen." exactly means...
            If it means choosing a random alignment when multiple locations are found to be equally best, then it's indeed what I need.
            I think sampe is equivalent to samse, but the former is for bwa mem.
            Thanks!
            Sampe and samse have the same description, and yes, the description means reads with multiple equally-scoring alignments.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            39 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            35 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X