Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA MEM question

    I am using BWA MEM to map 250 bp MiSeq PE reads. I am doing targeted genomic sequencing, enriching the library for specific transgenic sequences. I make a reference sequence, map all of the reads to that reference (using the default parameters). What I get are many improperly mapped reads. Some of the reads that map have 20 - 50 mismatches between the read and the reference, but it maps them none-the-less. There are similar sequences contaminating the sample apparently, but I don't want these spurious mappings. How can I increase the stringency of the mapping, so I don't get all of these imporperly mapped reads? I have more than enough coverage to throw out any bad mappings and still have tons of coverage.

    Thanks!
    CHObot

    P.S. I am looking for chimeric reads at the ends of the mappings to determine transgenic insertion sites. Is BWA MEM the best algorithm for this? It seems to work, albeit with a lot of manual examination of chimeric reads.

  • #2
    you should be able to threshold the alignments by the MAPQ value. I'm not sure of the specific cutoff but let's say 20.

    Code:
    samtools view -bq 20 alignments.bam > filtered.bam
    bwa has a very descriptive MAPQ field so this *should* work. Optionally maybe there's another tool out there that can specifically filter by mismatches if you want to keep the mismatches down to say 4 or 5% of the read length.
    /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
    Salk Institute for Biological Studies, La Jolla, CA, USA */

    Comment


    • #3
      can some post general commands to align fq PE with hg19
      where should I start with ?

      Comment


      • #4
        the bwa manual webpage gives examples of the various bwa commands:

        Comment


        • #5
          I used bwa aln -t 4 -f testsample.sai /GATKbundle/ucsc.hg19.fasta /testsample.fastq
          It was finished within 2 minutes and generated testsample.sai. Is this the alignment file ?
          I used bowtie and it took 4-5 hrs but bwa is so fast ??? something is wrong ????

          Originally posted by mastal View Post
          the bwa manual webpage gives examples of the various bwa commands:

          http://bio-bwa.sourceforge.net/bwa.shtml

          Comment


          • #6
            bwa aln is part of a 2-step process.

            the first step generates the .sai file.

            the second step is bwa samse/sampe, depending on whether you have single end or paired-end reads, and should give you a .sam file.

            the manual will give you the details.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Advancing Precision Medicine for Rare Diseases in Children
              by seqadmin




              Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
              12-16-2024, 07:57 AM
            • seqadmin
              Recent Advances in Sequencing Technologies
              by seqadmin



              Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

              Long-Read Sequencing
              Long-read sequencing has seen remarkable advancements,...
              12-02-2024, 01:49 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 12-17-2024, 10:28 AM
            0 responses
            33 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 12-13-2024, 08:24 AM
            0 responses
            49 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 12-12-2024, 07:41 AM
            0 responses
            34 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 12-11-2024, 07:45 AM
            0 responses
            46 views
            0 likes
            Last Post seqadmin  
            Working...
            X