SEQanswers

Go Back   SEQanswers > Literature Watch



Similar Threads
Thread Thread Starter Forum Replies Last Post
Bacterial contamination in salival sample ulz_peter Sample Prep / Library Generation 2 02-20-2012 10:57 AM
Merging genomic Assemblies rahularjun86 De novo discovery 0 02-03-2012 02:08 AM
sequencing a sample containing ONLY sequence repeats rrr General 11 09-14-2011 10:06 AM
Main difference between Interspersed repeats and tandem repeats problem asking... edge General 0 06-17-2010 01:36 AM
Using bwa to align cDNAs to genomic assemblies? jmartin Bioinformatics 3 01-27-2010 07:16 PM

Reply
 
Thread Tools
Old 01-14-2010, 01:31 AM   #1
flxlex
Moderator
 
Location: Oslo, Norway

Join Date: Nov 2008
Posts: 415
Default Identification of Genomic Repeats and Sample Contamination in Assemblies of 454 Pyros

Identification and Quantification of Genomic Repeats and Sample Contamination in Assemblies of 454 Pyrosequencing Reads

AJ Nederbragt, TB Rounge, K. Kausrud, and KS Jakobsen

Contigs assembled from 454 reads from bacterial genomes demonstrate a range of read depths, with a number of contigs having a depth that is far higher than can be expected. For reference genome sequence datasets, there exists a high correlation between the contig specific read depth and the number of copies present in the genome. We developed a sequence of applied statistical analyses, which suggest that the number of copies present can be reliably estimated based on the read depth distribution in de novo genome assemblies. Read depths of contigs of de novo cyanobacterial genome assemblies were determined, and several high read depth contigs were identified. These contigs were shown to mainly contain genes that are known to be present in multiple copies in bacterial genomes. For these assemblies, a correlation between read depth and copy number was experimentally demonstrated using real-time PCR. Copy number estimates, obtained using the statistical analysis developed in this work, are presented. Per-contig read depth analysis of assemblies based on 454 reads therefore enables de novo detection of genomic repeats and estimation of the copy number of these repeats.
Additionally, our analysis efficiently identified contigs stemming from sample contamination, allowing for their removal from the assembly.

http://www.hindawi.com/journals/seq/2010/782465.html

DISCLAIMER I am the lead author on this paper
flxlex is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 11:56 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO