![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
no cuffcompare .stats with more than 6 samples | sudders | Bioinformatics | 5 | 05-07-2014 03:29 AM |
Velvet assembly stats | NGS_New_User | Bioinformatics | 1 | 08-02-2013 03:58 PM |
plinkseq v stats | pepsimax | Bioinformatics | 0 | 11-28-2012 08:23 AM |
stats from sam/bam | chrishah | Bioinformatics | 7 | 10-23-2012 11:40 AM |
Simple stats question | jvanleuven | Bioinformatics | 4 | 05-14-2012 07:53 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Ottawa Join Date: Apr 2011
Posts: 130
|
![]()
Hello,
I was wonder if there is, or if it is possible to make a script that would quickly evaluate a given fasta file with contigs. Something that VelvetOptimiser spits out: Total number of contigs: 49644 n50: 546 length of longest contig: 21951 Total bases in contigs: 25417931 Number of contigs > 1k: 3773 Total bases in contigs > 1k: 6821103 I know you can get the n50 from a simple perl script: Code:
perl -e 'my ($len,$total)=(0,0);my @x;while(<>){if(/^[\>\@]/){if($len>0){$total+=$len;push@x,$len;};$len=0;}else{s/\s//g;$len+=length($_);}}if ($len>0){$total+=$len;push @x,$len;}@x=sort{$b<=>$a}@x; my ($count,$half)=(0,0);for (my $j=0;$j<@x;$j++){$count+=$x[$j];if(($count>=$total/2)&&($half==0)){print "N50: $x[$j]\n";$half=$x[$j]}elsif($count>=$total*0.9){print "N90: $x[$j]\n";exit;}}' CONTIGS.fasta ![]() |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: US Join Date: Dec 2010
Posts: 452
|
![]()
The assemblathon_stats.pl script from here is what I am using:
https://github.com/ucdavis-bioinform...thon2-analysis It requires the FAlite.pm module from the same page. |
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Denmark Join Date: Apr 2009
Posts: 153
|
![]() |
![]() |
![]() |
![]() |
#4 |
Moderator
Location: Oslo, Norway Join Date: Nov 2008
Posts: 415
|
![]()
Or Quast (even without a reference): http://quast.bioinf.spbau.ru/
|
![]() |
![]() |
![]() |
#5 |
Member
Location: Cambridge, UK Join Date: Dec 2011
Posts: 48
|
![]()
leaff --stats is also nice, and very fast.
|
![]() |
![]() |
![]() |
#6 |
Senior Member
Location: Boston area Join Date: Nov 2007
Posts: 747
|
![]()
You show a pretty mongo Perl one-liner but "aren't good at scripting"? Perhaps you are selling yourself short.
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|