Seqanswers Leaderboard Ad

**Mathgon** · 09-15-2010, 06:07 AM

I have the same problem with errors in homopolymers. Have you finally found a tool to correct them in comparison with a refrence genome?

Best regards

**SeaJane** · 03-18-2011, 06:26 PM

AmpliconNoise http://code.google.com/p/ampliconnoise/downloads/list is the best tool I've found for removing homopolymer error.

**mbakker** · 07-05-2011, 01:44 PM

running AmpliconNoise

SeaJane, have you been able to get AmpliconNoise up and running?

**colindaven** · 07-07-2011, 04:58 AM

I can install AmpliconNoise, yet all the steps for running it do seem daunting, especially with the number of datasets I have.

Has anyone got any scripts etc for automation available ? After all this is a fairly common task for 454 users.

**aartacho** · 03-12-2012, 04:46 AM

HMM-FRAME: 'state of the art' in pyrosequencing frameshift correction and protein domain classification (includes 454 error model in Viterbi HMMER algorithm). Unlike AmpliconNoise, it only corrects coding reads

HMM-FRAME: accurate protein domain classification for metagenomic sequences containing frameshift errors - BMC Bioinformatics

http://www.biomedcentral.com/1471-2105/12/198

Background Protein domain classification is an important step in metagenomic annotation. The state-of-the-art method for protein domain classification is profile HMM-based alignment. However, the relatively high rates of insertions and deletions in homopolymer regions of pyrosequencing reads create frameshifts, causing conventional profile HMM alignment tools to generate alignments with marginal scores. This makes error-containing gene fragments unclassifiable with conventional tools. Thus, there is a need for an accurate domain classification tool that can detect and correct sequencing errors. Results We introduce HMM-FRAME, a protein domain classification tool based on an augmented Viterbi algorithm that can incorporate error models from different sequencing platforms. HMM-FRAME corrects sequencing errors and classifies putative gene fragments into domain families. It achieved high error detection sensitivity and specificity in a data set with annotated errors. We applied HMM-FRAME in Targeted Metagenomics and a published metagenomic data set. The results showed that our tool can correct frameshifts in error-containing sequences, generate much longer alignments with significantly smaller E-values, and classify more sequences into their native families. Conclusions HMM-FRAME provides a complementary protein domain classification tool to conventional profile HMM-based methods for data sets containing frameshifts. Its current implementation is best used for small-scale metagenomic data sets. The source code of HMM-FRAME can be downloaded at http://www.cse.msu.edu/~zhangy72/hmmframe/ and at https://sourceforge.net/projects/hmm-frame/ .

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Identifying (and removing) 454 homopolymers frameshifts

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News