Unconfigured Ad

Collapse
This topic is closed.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • AdrianP
    Senior Member
    • Apr 2011
    • 130

    Quick Evaluator of Denovo Assemblies

    Hello,

    I was wonder if there is, or if it is possible to make a script that would quickly evaluate a given fasta file with contigs.

    Something that VelvetOptimiser spits out:

    Total number of contigs: 49644
    n50: 546
    length of longest contig: 21951
    Total bases in contigs: 25417931
    Number of contigs > 1k: 3773
    Total bases in contigs > 1k: 6821103

    I know you can get the n50 from a simple perl script:
    Code:
    perl -e 'my ($len,$total)=(0,0);my @x;while(<>){if(/^[\>\@]/){if($len>0){$total+=$len;push@x,$len;};$len=0;}else{s/\s//g;$len+=length($_);}}if ($len>0){$total+=$len;push @x,$len;}@x=sort{$b<=>$a}@x; my ($count,$half)=(0,0);for (my $j=0;$j<@x;$j++){$count+=$x[$j];if(($count>=$total/2)&&($half==0)){print "N50: $x[$j]\n";$half=$x[$j]}elsif($count>=$total*0.9){print "N90: $x[$j]\n";exit;}}' CONTIGS.fasta
    Anyone have any idea if such a thing exists? I am not a big scripter myself so I can't help myself now
  • flxlex
    Moderator
    • Nov 2008
    • 412

    #2
    Double posting in Bioinformatics thread - where you are getting relevant answers: http://seqanswers.com/forums/showthread.php?t=33684

    Comment

    Latest Articles

    Collapse

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    25 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    30 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-04-2026, 08:59 AM
    0 responses
    39 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-02-2026, 12:03 PM
    0 responses
    62 views
    0 reactions
    Last Post SEQadmin2  
    Working...