Seqanswers Leaderboard Ad

**westerman** · 02-25-2014, 06:40 AM

Your script is pulling in the sample_IDs with the '>' attached as well as the count. It then pulls in the sample_reads without the '>' attached. The program thus can not match up sample_IDs with sample_reads. So there are two problems here -- (1) you are not saving the counts and (2) you can not match up IDs.

The solution is to re-write the part where you have

$ids{$_} += 1;

Let us know you want more of a hint than that.

**bambus** · 02-25-2014, 06:54 AM

Does it mean that I have to create a hash of Ids or?

**westerman** · 02-25-2014, 06:59 AM

Yes, create the hash of IDs. You need to do two things:

1) Remove the '>'
2) Split out the counts from the read name and save the counts as the values in your hash.

**bambus** · 02-25-2014, 07:03 AM

Can you please help me how to proceed further to fulfill the steps you mentioned as I am not a very good programmer

**westerman** · 02-25-2014, 07:36 AM

The best way to become a better program is to experiment with your programs. :-)

That said, I would change the line:

$ids{$_} += 1;

To

my ($id, $count) = $_ =~ /^>*(\S+)\s+(\d+)/;
$ids{$id} = $count;

Note: I did not test the above. Basically you are taking the input line and looking for:
1) '>' (optional)
2) Characters (the id)
3) Whitespace
4) Digits (the count)
And then putting the id and count into your %ids hash

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

A program to extract the reads and modify the seq ID by adding weight

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News