Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What's the best short read aligner now?

    I've been out of the loop for a bit and I'm wondering what is considered the best for aligning reads around 50bp-300bp to human genome. By best, I am referring to fastest runtime with above average quality. FPGA and GPU aligners would be interesting too.

    Thanks.

  • #2
    Don't get people started

    Earlier this year in a Biostars thread: https://www.biostars.org/p/125020/

    You have to try BBMap out after reading that thread though.

    Comment


    • #3
      My vote is for Bowtie2, on the type of reads you described.
      I use nearly exclusively Bowtie2, without ever having experienced any issues.
      For Exome-Seq only, I use BWA since it is the aligner used in the Broad Institute Best Practices.
      I'm pretty sure Bowtie2 would work just as well though.
      I seem to remember having use Bowtie2 once by accident on Exome-Seq data, and found to my surprise, that the results were pretty much the same when I repeated the alignment with BWA, give or take a few reads.

      I've come to the general conclusion that the results for the main aligners are so similar for the type of alignments that you are describing, that it is irrelevant which aligner is used.
      I've tried BWA, Bowtie2, and Bowtie on the same datasets with near identical results.
      I didn't keep the results, so I couldn't show them to you right now.

      I'm quite skeptical of benchmarks, since they never seem to be available for the current versions of the programs. There have been more than seven releases of Bowtie2 since the benchmarks on the website given above were published.

      My only advice would be to pick any of the main aligners and become very familiar with its parameters and the format of the output.
      There are some variations in the default settings, the parameters that can be changed, and the format of the output.
      Last edited by blancha; 10-20-2015, 11:48 AM.

      Comment


      • #4
        Regarding your interest in GPU aligners, you might check out SOAP3:


        To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.


        I included the journal article for its table comparing performance of many of the aligners you'll read about.

        Good luck!

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 11:49 AM
        0 responses
        15 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        61 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X