Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Alignment program that allows the greatest number of mismatches

    I am trying to align reads allowing for as many mismatches as possible. I was using Novocraft before and that program allows me to align up to about 9 mismatches. Is there a program out there that can allow more?

    thanks

  • #2
    Stampy?

    As far as I know Stampy is not restricted to a specific number of mismatches...

    http://www.well.ox.ac.uk/project-stampy

    Stampy has the following features:

    - Maps single, paired-end, mate pair Illumina reads to a reference
    - Fast: about 10 (with BWA) or 15 hours (without) per Gbase
    - Low memory footprint: 2.7 Gb shared memory for a 3Gbase genome
    - High sensitivity for indels and divergent reads, up to 10-15%
    - Low mapping bias for reads with SNPs or indels
    - Well calibrated mapping quality scores
    - Input: Fastq and Fasta; gzipped or plain; SAM and BAM
    - Output: SAM, Maq's map file
    - Optionally calculates per-base alignment posteriors
    - Optionally processes part of the input
    - Handles reads up to 4500 bases

    To calculate correct mapping qualities, Stampy needs to know the
    expected divergence from the reference. This is set with the
    --substitutionrate= option. The default is 0.001 substitutions per
    site.

    Increasing the read length, and using paired-end reads, helps mapping
    divergent reads. The following table gives an indication of the
    divergence at which a reasonable proportion of reads can be correctly
    mapped. These numbers were obtained by simulation, using the human
    genome as reference, and should be taken as an indication only; they
    are dependent on error rates, the repetitiveness of the genome, the
    insert size distribution, and local variations in divergence; in
    addition no indel mutations were included.

    36bp 36bp 72bp 72bp
    divergence | single paired single paired
    -------------------------------------------------------
    0% | 82% 95% 87% 96%
    3% | 73% 91% 80% 94%
    6% | 60% 83% 72% 92%
    9% | 41% 56% 56% 88%
    12% | 28% 51% 48% 80%

    Comment


    • #3
      BFAST does a better job than most aligners (Figure 3 of the paper shows comparative analysis), although I don't believe Stampy was included.

      Comment


      • #4
        How long are the reads you want to align?
        ecSeq Bioinformatics is Europe’s leading provider of hands-on bioinformatics workshops and professional data analysis in the field of Next-Generation Sequencing (NGS).

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Investigating the Gut Microbiome Through Diet and Spatial Biology
          by seqadmin




          The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
          02-24-2025, 06:31 AM
        • seqadmin
          Quality Control Essentials for Next-Generation Sequencing Workflows
          by seqadmin




          Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

          Nucleic Acid Quality Control
          Preparing for NGS starts with isolating the...
          02-10-2025, 01:58 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        154 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 02-28-2025, 12:58 PM
        0 responses
        237 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 02-24-2025, 02:48 PM
        0 responses
        607 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 02-21-2025, 02:46 PM
        0 responses
        263 views
        0 likes
        Last Post seqadmin  
        Working...
        X