![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
sort contigs based on fasta header | akjones | Bioinformatics | 3 | 03-19-2014 12:14 PM |
multiple sequences extraction from fasta | sebl | Bioinformatics | 6 | 03-01-2014 10:41 PM |
fasta header | polijana | Bioinformatics | 2 | 03-31-2013 04:01 PM |
FastaHack - FASTA file manipulation and subsequence extraction utilities | ekg | Bioinformatics | 13 | 01-30-2013 11:25 AM |
alignable portion of a genome | fadista | General | 1 | 05-11-2009 12:06 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: USA Join Date: Nov 2012
Posts: 51
|
![]()
Hi,
I have a multi fasta file whose header is like below, ">WLPTB:00031:00193;size=517;" now i want to extract just the "size=517" portion from all the sequences.. any help??? Best, Chayan |
![]() |
![]() |
![]() |
#2 |
Member
Location: USA Join Date: Nov 2012
Posts: 51
|
![]()
And if possible basicslly i want to extract only the integer "517" after the "size=" portion
thnks |
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Cambridge, UK Join Date: May 2010
Posts: 311
|
![]() |
![]() |
![]() |
![]() |
#4 |
Devon Ryan
Location: Freiburg, Germany Join Date: Jul 2011
Posts: 3,480
|
![]()
Something like this should work:
Code:
grep ">" foo.fasta | cut -d "=" -f 2 | cut -d ";" -f 1 Last edited by dpryan; 01-05-2015 at 01:25 AM. Reason: Forgot "-f"! |
![]() |
![]() |
![]() |
#5 |
Devon Ryan
Location: Freiburg, Germany Join Date: Jul 2011
Posts: 3,480
|
![]()
Behold, the usefulness and flexibility of standard unix command line tools!
|
![]() |
![]() |
![]() |
Tags |
fasta file editing, fasta format, fasta sequence cut, fasta-reader, headers |
Thread Tools | |
|
|