Seqanswers Leaderboard Ad

**maasha** · 11-10-2011, 03:26 AM

You could run a couple of rounds of find_adaptor - which is pretty clever - from Biopieces (www.biopieces.org).

**Giorgio C** · 11-10-2011, 04:02 AM

Thank you it's a very good tool. But the problem is that some adpators occur in the middle of the sequences because they coming out from a concameration experimental design (they are miRNAs between NNNNNN...). So i want to know a script or tool that may say how many reads have 1 adapt, how many 2, (max are 4) in respect to the total number of reads. Do you know any tool/script that may help ? Tnx

**maasha** · 11-10-2011, 04:14 AM

Hm, perhaps patscan_seq then:

Code:

read_fasta -i data.fna | patscan_seq -ip "p1=<adaptor sequence> 0...50 p1 0...50 p1 0...50 p1" | write_fasta -xo got4_adaptors.fna

Code:

read_fasta -i data.fna | patscan_seq -ip "p1=<adaptor sequence> 0...50 p1 0...50 p1" | write_fasta -xo got3_adaptors.fna

etc

of cause you need to separate the sequences in piles. Clever use of grab should do that.

Martin

**Giorgio C** · 11-10-2011, 05:17 AM

Wonderful Thanks !!!

**maasha** · 11-10-2011, 06:32 AM

of cause you could perhaps also use REGEX of some sort. egrep, agrep or nrgrep springs to mind. you could also use grab --regex in Biopieces.

**Giorgio C** · 11-10-2011, 06:48 AM

Thanks maasha,

another rapid way:

nawk -F'[N]+' '/^[^>]/{a[NF-1]++}END{for(i in a) print a[i] " have " i " ADAPTOR"}' myFile.fasta > result.txt

**Giorgio C** · 11-10-2011, 07:01 AM

Also in perl with the same result:

perl -ne '$count{s/N+//g}++ if /^[^>]/;END{for $i (keys %count){print "$count{$i} have $i ADAPTOR\n";}}' myFile.fasta > result.txt

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 17 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 46 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Scripting help to identify adaptors in reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News