Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA MEM question

    I am using BWA MEM to map 250 bp MiSeq PE reads. I am doing targeted genomic sequencing, enriching the library for specific transgenic sequences. I make a reference sequence, map all of the reads to that reference (using the default parameters). What I get are many improperly mapped reads. Some of the reads that map have 20 - 50 mismatches between the read and the reference, but it maps them none-the-less. There are similar sequences contaminating the sample apparently, but I don't want these spurious mappings. How can I increase the stringency of the mapping, so I don't get all of these imporperly mapped reads? I have more than enough coverage to throw out any bad mappings and still have tons of coverage.

    Thanks!
    CHObot

    P.S. I am looking for chimeric reads at the ends of the mappings to determine transgenic insertion sites. Is BWA MEM the best algorithm for this? It seems to work, albeit with a lot of manual examination of chimeric reads.

  • #2
    you should be able to threshold the alignments by the MAPQ value. I'm not sure of the specific cutoff but let's say 20.

    Code:
    samtools view -bq 20 alignments.bam > filtered.bam
    bwa has a very descriptive MAPQ field so this *should* work. Optionally maybe there's another tool out there that can specifically filter by mismatches if you want to keep the mismatches down to say 4 or 5% of the read length.
    /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
    Salk Institute for Biological Studies, La Jolla, CA, USA */

    Comment


    • #3
      can some post general commands to align fq PE with hg19
      where should I start with ?

      Comment


      • #4
        the bwa manual webpage gives examples of the various bwa commands:

        Comment


        • #5
          I used bwa aln -t 4 -f testsample.sai /GATKbundle/ucsc.hg19.fasta /testsample.fastq
          It was finished within 2 minutes and generated testsample.sai. Is this the alignment file ?
          I used bowtie and it took 4-5 hrs but bwa is so fast ??? something is wrong ????

          Originally posted by mastal View Post
          the bwa manual webpage gives examples of the various bwa commands:

          http://bio-bwa.sourceforge.net/bwa.shtml

          Comment


          • #6
            bwa aln is part of a 2-step process.

            the first step generates the .sai file.

            the second step is bwa samse/sampe, depending on whether you have single end or paired-end reads, and should give you a .sam file.

            the manual will give you the details.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 11:49 AM
            0 responses
            13 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-24-2024, 08:47 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            61 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            60 views
            0 likes
            Last Post seqadmin  
            Working...
            X