Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • pparg
    Member
    • Aug 2008
    • 19

    BWA paired end mapping quality

    I used BWA to map my PE sequencing data to reference genome. I try to use paired mapping quality to filter bad read pairs out for downstream analysis.
    How BWA calculate paired mapping quality? I understand it calculates single-end mapping quality like MAQ does. But I am not sure how it proceeds after having the single mapping quality for both ends? Simply add up or something more complicated? I’ve checked the source code, but the program does not make much sense without a good understanding of the variable names/notations. FYI, the relevant source code is located in the ‘static int pairing’ function of the bwape.c file.
    I would really appreciate your input.
    pparg
  • pparg
    Member
    • Aug 2008
    • 19

    #2
    Hello, does anybody have any ideas on this? Thank you!

    Comment

    • Seq84
      Member
      • Feb 2011
      • 19

      #3
      Hi all, I'm interested too! Could someone post a link or a brief description of BWA quality mapping scoring ?

      Thanks in advance.

      Comment

      • Azazel
        Member
        • Oct 2010
        • 52

        #4
        +1, I have the exact same question, too

        +1, I have the exact same question, too

        I'd also like to know how the mapping quality for paired end reads is computed, is it just the sum of the quality of the two separate reads?

        Comment

        • nilshomer
          Nils Homer
          • Nov 2008
          • 1283

          #5
          Unfortunately, the best documentation is from the original paper (single end) as well as the code (paired end). Try modifying the code to print out the relevant variables to understand the calculation etc.

          Comment

          • trickytank
            Member
            • Dec 2010
            • 19

            #6
            Hey I'm interested in this too. In particular, what if one read maps to one location on the reference, but the the other read maps to somewhere differently (such that it does not have the correct orientation and/or distance)? What I really want to know if such pairs are down weighted by low mapping quality in some way?

            Comment

            • dp05yk
              Member
              • Dec 2010
              • 66

              #7
              It says in the paper that BWA will find all single-end alignments for each mate and sort them in ascending order of chromosomal coordinates. Then it uses an estimated insert size to determine which of the chromosomal coordinates are best for both mates.

              The insert size is determined in the function infer_isize, and I believe the pairing is determined in the function pairing :-) both are contained in bwape.c.

              Comment

              • angerusso
                Member
                • Oct 2011
                • 47

                #8
                Hello All,

                I have a WholeExome paired end sample and I reached the step where I am performing the alignment to human genome (hg19.fa) on a 10 node cluster.

                I am running the command:
                bwa aln hg19.fa sample1_1.fastq > sample1_1.sai
                bwa aln hg19.fa sample1_2.fastq > sample1_2.sai

                But it's taking forever. I understand this could due to couple of reasons, main reason being that I am not doing any pre-filtering. I saw that packages like GenomeQuest do lot of pre-filtering which can make the alignment faster.

                I am total new-bie and i am wondering if I can get help here regarding how and what kind of pre-filtering can I run with this sample before using bwa for alignment. I am kind of in a hurry to get some results so any result will be extremely appreciated.

                Thanks,
                angel

                Comment

                • swbarnes2
                  Senior Member
                  • May 2008
                  • 910

                  #9
                  I've run bwa on exome capture DNA with no filtering at all. And takes a while, but it doesn't take forever, and every minute or so it updates the screen telling me how many more reads its finished processing.

                  Using multiple processors with the -t option would certainly speed things along, if your computer has that capacity.

                  Comment

                  • angerusso
                    Member
                    • Oct 2011
                    • 47

                    #10
                    Thanks swbarnes2 very much for your reply.

                    I hope my files will finish by tomorrow. The size of one paired-end fastq file in my case is 63GB.

                    I will try the multi-threading mode you mentioned tomorrow.

                    Angel

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    11 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    46 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    105 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    125 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...