Seqanswers Leaderboard Ad

**chayan** · 01-05-2015, 12:10 AM

And if possible basicslly i want to extract only the integer "517" after the "size=" portion

thnks

**dariober** · 01-05-2015, 12:49 AM

Originally posted by chayan View Post

And if possible basicslly i want to extract only the integer "517" after the "size=" portion

thnks

This should do (seq.fa is your fasta file):

Code:

grep -P -o 'size=\d+' seq.fa | sed 's/size=//'

**dpryan** · 01-05-2015, 12:49 AM

Something like this should work:

Code:

grep ">" foo.fasta | cut -d "=" -f 2 | cut -d ";" -f 1

You could also just use biopython or bioperl, which would allow you to more easily keep these values associated with their sequences if that's needed.

**dpryan** · 01-05-2015, 12:50 AM

Behold, the usefulness and flexibility of standard unix command line tools!

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, Yesterday, 06:35 AM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 18 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 17 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

Extraction of a portion of a fasta header

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News