Seqanswers Leaderboard Ad

**westerman** · 03-19-2014, 09:06 AM

4MB is not a large file. Reading line by line would be fine. However your code was not enclosed in your post thus it is hard to say what is wrong with it.

**akjones** · 03-19-2014, 10:59 AM

Sorry, it wasn't enclosed because as of right now it doesn't do anything other than read the file, so I didn't think posting it would be helpful. I don't know how to go about solving the other problems I stated in my question which is why I asked for input. But since you asked here it is:

Code:

use Bio::SeqIO;

$seqio_obj = Bio::SeqIO->new (-file =>"/Users/annaliesejones/Desktop/Assembly6contigs.fasta", -format => "fasta" );

while($seq_obj = $seqio_obj->next_seq) {

print $seq_obj->seq;

}

Even that doesn't work well, it prints all sequences together without a newline or the fasta header. But that's not really the point, I just wanted to see if I could read the file.

And now I'm stuck.

**westerman** · 03-19-2014, 11:14 AM

1) Use a newline after printing ... e.g., 'print $seq_obj->seq . "\n" ' ... many ways to do this.

2) Look at $seq_obj->display for your header information. From there you can use a regexp to pull out the information; e.g.,

Code:

(my $coverage) = $seq_obj->display =~ m/_cov_(.+)_ID/;

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 21 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

sort contigs based on fasta header

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News