Hello,
I was wonder if there is, or if it is possible to make a script that would quickly evaluate a given fasta file with contigs.
Something that VelvetOptimiser spits out:
Total number of contigs: 49644
n50: 546
length of longest contig: 21951
Total bases in contigs: 25417931
Number of contigs > 1k: 3773
Total bases in contigs > 1k: 6821103
I know you can get the n50 from a simple perl script:
Anyone have any idea if such a thing exists? I am not a big scripter myself so I can't help myself now
I was wonder if there is, or if it is possible to make a script that would quickly evaluate a given fasta file with contigs.
Something that VelvetOptimiser spits out:
Total number of contigs: 49644
n50: 546
length of longest contig: 21951
Total bases in contigs: 25417931
Number of contigs > 1k: 3773
Total bases in contigs > 1k: 6821103
I know you can get the n50 from a simple perl script:
Code:
perl -e 'my ($len,$total)=(0,0);my @x;while(<>){if(/^[\>\@]/){if($len>0){$total+=$len;push@x,$len;};$len=0;}else{s/\s//g;$len+=length($_);}}if ($len>0){$total+=$len;push @x,$len;}@x=sort{$b<=>$a}@x; my ($count,$half)=(0,0);for (my $j=0;$j<@x;$j++){$count+=$x[$j];if(($count>=$total/2)&&($half==0)){print "N50: $x[$j]\n";$half=$x[$j]}elsif($count>=$total*0.9){print "N90: $x[$j]\n";exit;}}' CONTIGS.fasta
Comment