I wanted to give you all the heads up on a new paper published in The ISME Journal this month (4, 852-861, July 2010), on a program for sorting and assembling next gen sequencing data called PANGEA. The paper was published by Eric Triplett's lab at the University of Florida in Gainesville.
A top level review of the article and links to the paper and source codes are at http://bit.ly/bCEyEc
Brief overview of PANGEA:
PANGEA is used to compile and align the huge datasets generated after 454/Roche next generation amplicon sequencing. In this publication, PANGEA was used to demonstrate assembly and annotation of barcoded 16S rRNA libraries for the purpose of microbial identification in a metagenomic analysis. PANGEA takes the sequencing data directly from the 454 and performs all the necessary steps to generate files used for sequence identification by BLAST. The program includes statistical analysis to look at diversity of communities as well.
*****
The authors describe many advantages of PANGEA over other web based programs. Looks like a fast and easy method to handle large data sets.
A top level review of the article and links to the paper and source codes are at http://bit.ly/bCEyEc
Brief overview of PANGEA:
PANGEA is used to compile and align the huge datasets generated after 454/Roche next generation amplicon sequencing. In this publication, PANGEA was used to demonstrate assembly and annotation of barcoded 16S rRNA libraries for the purpose of microbial identification in a metagenomic analysis. PANGEA takes the sequencing data directly from the 454 and performs all the necessary steps to generate files used for sequence identification by BLAST. The program includes statistical analysis to look at diversity of communities as well.
*****
The authors describe many advantages of PANGEA over other web based programs. Looks like a fast and easy method to handle large data sets.
Comment