Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Adapter searching

    Hi,
    We are having Illumina paired end sequenced data with adapters in both 3' and 5' tags.

    Is there any open source software to fish for adapters in both 3' (7 bps long) and 5'(6 bps long). Rght now we are using smith-waterman to do local alignment allowing 2 mismatches. Tags are from Illumina sequencing and hence 3' or 5' adapter cab be found in eirther read1 or read2 of Illumina paired end sequnced data.

    The whole process is very slow because of smith-waterman.

    I am looking for a tool which can look for both 3' and 5' adapter allowing 2 mismatches in the adapter.

    Thanks.

  • #2
    Here are some alternatives, some of these can probably do the job: http://biostar.stackexchange.com/que...ation-illumina

    Comment


    • #3
      Hi seq_GA,

      Some of the trimming algorithms use Hamming distance (only looking at mismatches and not indels). These would probably be faster than a Smith-Waterman since the search space is smaller.

      For methods that accommodate indels, you might try Btrim: it uses an optimized Smith-Waterman implementation using bit-level parallelism. I haven't tried the program myself (only available on Linux and no source code), but I think it is an interesting approach.

      Justin

      Comment


      • #4
        cutadapt is pretty good too : http://code.google.com/p/cutadapt/

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 08:47 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        54 views
        0 likes
        Last Post seqadmin  
        Working...
        X