SEQanswers

SEQanswers (http://seqanswers.com/forums/index.php)
-   454 Pyrosequencing (http://seqanswers.com/forums/forumdisplay.php?f=5)
-   -   Summary Statistics in Newbler? (http://seqanswers.com/forums/showthread.php?t=1800)

gaster 05-22-2009 12:29 PM

Summary Statistics in Newbler?
 
The Mapper output comes with handy xls file with a lot of great statistics in it. The Newbler output doesn't seem to have anything like this? I am particularly looking for statistics about the coverage in the contigs (avg, min, max, etc.)

Anyone know where this info is or a good program to use to find it?

sklages 05-23-2009 12:49 AM

Hi,

you might want to have a look at 454NewblerMetrics.txt (or csv) for assembly statistics and 454ReadStatus.txt to see what happened to your reads ...

A detailed contig description (for parsing) gives you 454AlignmentInfo.tsv .

hth,
Sven

flxlex 05-25-2009 11:34 PM

And/or have newbler produce the 454ContigGraph.txt file which gives you average coverage per contig (not min/max). This can be done by setting the -g flag (note: this sets the minimum contig length to report from 100 bp to 1 bp...).

gaster 05-28-2009 07:17 AM

Thanks for your reply, Flxlex. Couple questions: How do you set the g-flag? Is there no way to get min/max coverage?

Thanks.

gaster 05-28-2009 07:21 AM

Thanks, Sklages. The metrics files don't seem to have info about coverage, right?

The 454AlignmentInfo.tsz files only gives me partial info on one contig. Is that right? It can't be. Any ideas?

Thanks so much.

sklages 05-28-2009 09:33 AM

The 454NewblerMetrics.txt reports only common stats about the assembly.

Well, and the tsv file covers all contigs:

Code:

$ fgrep -c \>contig 454AlignmentInfo.tsv
430
$ fgrep -c \> 454AllContigs.fna
430

cheers,
Sven

kmcarr 05-28-2009 12:14 PM

gaster,

Are you using the Roche GUI (gsAssembler) to view the 454AlignmentInfo.tsv? If so this viewer truncates the display at 50,000 lines. The file is located in the assembly subdirectory of your main project directory. You can view the entire file with a standard text editor. (Careful, if your assembly is large this file will get huge.) You will need to process the file using some sort of script to extract and calculate the coverage information you asked about (ave, min, max, etc.).


All times are GMT -8. The time now is 05:11 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.