Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA automatic parameter estimation

    Hi,

    I am using Illumina paired-end data (read length 250 bp) for producing a bam file using BWA and Samtools. As BWA does not support read length greater than 200 bp. I had to use BWA-MEM in this case with some testing parameters (such as gap opening penalty 30, gap extension penalty 5, etc.). However, one of my colleagues used BWA from Miseq machine and got a totally different alignment and mapping.

    As I am trying to replicate her work process, I thought of using the same parameters as she used. However, when I went through Miseq manual, it appears that Miseq-BWA automatically adjusts parameters based on read lengths and error rates, and then estimates insert size distribution (MiSeq manual page 22: BWA). From the bam file, from Miseq-BWA, we found that the genome has some deletions and this is actually authenticated by Sanger sequencing. So in this case, I am more or less sure that MiSeq-BWA is doing the right thing. However, the bam file created by my pipeline using BWA-MEM does not show the same result.

    Now my questions are:

    1. Is Miseq-BWA and the available BWA tools differently implemented? If not, then which one I should be using, BWA-SW or BWA-MEM?

    2. Is there any way of adjusting parameters based on read lengths and error rates, and then estimating insert size distribution? If I do not specify any parameter, BWA should be taking the default one, I guess.

    I know the questions are quite broad. But it would be great help if anyone has some advice for me.
    Last edited by opulcy97; 03-11-2013, 04:36 AM. Reason: Typo

  • #2
    bwa-mem was introduced in 0.7.0 - I'm pretty sure the version with the MiSeq software is a 0.6 derivative. I'm not sure that helps but you're not doing a like for like comparison.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    58 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    45 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X