I would suggest primer_match. It is rather simple to use and rather flexible in its output.
The primers in this specific case would be your 20bp sequence. The program would in turn allow you to manipulate the output displaying position, entry name, or counts
(I am by no means associated with edwards lab, just a frequent user)
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
You can use BLAST program and specify the parameter -W 20 (in that way BLAST will report just the hits with at least 20 pb of similarity). Ahhh, an important think deactivate the low complexity filter (-F F)
Thank's
André
Leave a comment:
-
Have you tried BLAST or BLAT? Those tools are designed for looking for a low number of sequences in a large database of many different sequences.
Leave a comment:
-
Find all occurrences of a sequence in a fasta file
I have a fasta file with 16S sequences from many organisms. I want to find all occurrences of a certain ~20 bp sequence in this fasta file. I could do a simple text search but I would prefer to allow some flexibility in the matches.
For each match I would like the following information
1) fasta entry name
2) postion in the sequence
3) CIGAR string or some other representation of the alignment
A SAM file would be fine. I tried using bowtie2 with "-a" but it never seemed to finish. Through trial and error I found that setting "-k" to 150 worked fine but setting "-k" to 200 did not, indicating to me that there is probably some upper limit to the number of matches per query that it can report.
I am certain that what I want to do is commonly done by many people here on the site. What is the easiest/best way to go about it?
Thanks so much.Tags: None
Latest Articles
Collapse
-
by seqadmin
Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...-
Channel: Articles
12-16-2024, 07:57 AM -
-
by seqadmin
Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.
Long-Read Sequencing
Long-read sequencing has seen remarkable advancements,...-
Channel: Articles
12-02-2024, 01:49 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-17-2024, 10:28 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
12-17-2024, 10:28 AM
|
||
Started by seqadmin, 12-13-2024, 08:24 AM
|
0 responses
43 views
0 likes
|
Last Post
by seqadmin
12-13-2024, 08:24 AM
|
||
Started by seqadmin, 12-12-2024, 07:41 AM
|
0 responses
29 views
0 likes
|
Last Post
by seqadmin
12-12-2024, 07:41 AM
|
||
Started by seqadmin, 12-11-2024, 07:45 AM
|
0 responses
42 views
0 likes
|
Last Post
by seqadmin
12-11-2024, 07:45 AM
|
Leave a comment: