Seqanswers Leaderboard Ad

**maven** · 08-15-2011, 07:46 AM

Take it to your support rep. Newbler's genome size estimate are calculated directly from the coverage depth histogram immediately preceding it in the 454NewblerMetrics.txt file. So if that histogram is skewed for some reason the size estimate will be off. 2000 contigs for a 4.5 Mb genome is very bad, therefore you probably have something about your experiment that is causing it to contig poorly. The first thing that springs to mind is that your 1.3 million reads seems like way more than you really need. If they are 400 bp each and they all derive from your genome of interest, that's something like 115x. Newbler likes 20-35x the best, so try assembling less data. Second, if you have contaminants from other genomes then you could really be looking at a combination of contigs from your genome of interest, plus a whole lot of garbage from the contamination. The total contig length might give you a clue about this, as would the size distribution and coverage distribution of contigs (check 454ContigGraph.txt). Finally if the genome is highly repetitive or extremely biased in AT content (like P.falciparum, which is somethiing like 80% AT) then it might be really hard to get good contigging no matter what you do. Do any of these scenarios seem likely to you?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 37 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

De novo trouble

Comment

Latest Articles

ad_right_rmr

News