Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Alignment program that allows the greatest number of mismatches

    I am trying to align reads allowing for as many mismatches as possible. I was using Novocraft before and that program allows me to align up to about 9 mismatches. Is there a program out there that can allow more?

    thanks

  • #2
    Stampy?

    As far as I know Stampy is not restricted to a specific number of mismatches...

    http://www.well.ox.ac.uk/project-stampy

    Stampy has the following features:

    - Maps single, paired-end, mate pair Illumina reads to a reference
    - Fast: about 10 (with BWA) or 15 hours (without) per Gbase
    - Low memory footprint: 2.7 Gb shared memory for a 3Gbase genome
    - High sensitivity for indels and divergent reads, up to 10-15%
    - Low mapping bias for reads with SNPs or indels
    - Well calibrated mapping quality scores
    - Input: Fastq and Fasta; gzipped or plain; SAM and BAM
    - Output: SAM, Maq's map file
    - Optionally calculates per-base alignment posteriors
    - Optionally processes part of the input
    - Handles reads up to 4500 bases

    To calculate correct mapping qualities, Stampy needs to know the
    expected divergence from the reference. This is set with the
    --substitutionrate= option. The default is 0.001 substitutions per
    site.

    Increasing the read length, and using paired-end reads, helps mapping
    divergent reads. The following table gives an indication of the
    divergence at which a reasonable proportion of reads can be correctly
    mapped. These numbers were obtained by simulation, using the human
    genome as reference, and should be taken as an indication only; they
    are dependent on error rates, the repetitiveness of the genome, the
    insert size distribution, and local variations in divergence; in
    addition no indel mutations were included.

    36bp 36bp 72bp 72bp
    divergence | single paired single paired
    -------------------------------------------------------
    0% | 82% 95% 87% 96%
    3% | 73% 91% 80% 94%
    6% | 60% 83% 72% 92%
    9% | 41% 56% 56% 88%
    12% | 28% 51% 48% 80%

    Comment


    • #3
      BFAST does a better job than most aligners (Figure 3 of the paper shows comparative analysis), although I don't believe Stampy was included.

      Comment


      • #4
        How long are the reads you want to align?
        ecSeq Bioinformatics is Europe’s leading provider of hands-on bioinformatics workshops and professional data analysis in the field of Next-Generation Sequencing (NGS).

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM
        • seqadmin
          Recent Advances in Sequencing Technologies
          by seqadmin



          Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

          Long-Read Sequencing
          Long-read sequencing has seen remarkable advancements,...
          12-02-2024, 01:49 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 12-17-2024, 10:28 AM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-13-2024, 08:24 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-12-2024, 07:41 AM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-11-2024, 07:45 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Working...
        X