Seqanswers Leaderboard Ad

**Nicolas** · 01-24-2012, 02:03 PM

You can use Bowtie (or BWA, or any short-read aligner). Create an index from the hairpin.fa file (bowtie-build) and map your fastq directly to the index. Specify the number of mismatches you want (up to 3).
Make sure that you remove the 3'end adapter sequences from your raw file, if necessary (fastx-toolkit does that pretty well, but there are other tools).

**Palgrave** · 01-25-2012, 03:09 AM

I made an bowtie_index from hairpin.fa and aligned mapped with bowtie useing following command $ bowtie <bowtie_hairpin> <my_input>
but i get the following message:

# reads processed: 23178672
# reads with at least one reported alignment: 86 (0.00%)
# reads that failed to align: 23178586 (100.00%)
Reported 86 alignments to 1 output stream(s)

The input-file is fastqsanger and has been processed by fastq-groomer in Galaxy.

**Nicolas** · 01-25-2012, 07:28 AM

fastq-groomer does not remove the 3'end adapter sequence present on the 3'end of your sequences (correct me if I am wrong).

What is the size of the sequences you tried to align?

If they are microRNA sequences, most of them should be in the range 20-24 nt. Most probably, your sequences are longer than that, and therefore, you should remove some nucleotides (a non-fixed number) on the 3'end.

On Galaxy, you should use the tool "clip" under "fastx-toolkit for fastq data". If you don't know the sequence of your adapter, you can either guess it by looking at your file, look around to find the most common ones, or use the tool "trim" to trim a fixed number of nucleotides at the end of your sequences (this is not recommended, since you'll lose useful nucleotides).

Keep me posted,

**Palgrave** · 01-25-2012, 11:28 AM

I have clipped adapters from 3'end and I have trimmed remaining reads so that they are between 18 and 24 nt long. Most of them should therefore be miRNAs.

Shouldnt it work to downloads one of the files from miRBase and do bowtie_build and then just run

$ bowtie <db_file> <input> <output>

**aggp11** · 01-25-2012, 02:25 PM

Palgrave,

I think the hairpin.fa has the sequences in A, U, G, C nucleotide format. Whereas, your sequence reads might have A,T,G,C. I don't know if this would matter with bowtie, but you might want to try converting the U to T in the ref and do index again and run an alignment.

P

**Palgrave** · 01-26-2012, 12:49 AM

Thanks, I think that will do it.
However the hairpin.fa file contains alot of artifacts like Y and R, so my tool is not able to convert this from RNA-DNA. How do I remove sequences which does not contain only AGCU?

**deepika123** · 04-14-2015, 05:48 AM

hello

I know this post was posted earlier.... but i want to know that how many percentage of reads were align on precursor. fa because i got same problem.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 31 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Map to miRBase

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News