Seqanswers Leaderboard Ad

**lbeltrame** · 02-01-2012, 02:44 AM

I forgot to add that I generated the reference with bowtie-build directly from the hairpin FASTA file downloaded from miRBase.

**fseifert** · 02-01-2012, 04:07 AM

You might have to replace U in miRBase sequences to T and create the index and try again.
How has the sequence library been prepared, may it contain sRNAs, snoRNAs or degradation products from rRNA/tRNA. Did you use the mature or stemloop sequences as mapping target?

**lbeltrame** · 02-01-2012, 06:23 AM

I noticed about the U/T issue. I will convert the sequence and try again: also I realized (yes, I really feel stupid about it) that I also need to remove non-human sequences from there, as the data is for all species.

I used the hairpin (i.e. immature) sequences as I have even less hits on mature ones (and mature ones are also shorter).

EDIT: Translating Us and removing non human sequences raised the % to 17%, although it's still low. About the library, the transcripts were selected using a miRNA purification kit followed by fractioning by size and selection of RNAs between 19 and 29bp.

**Palgrave** · 02-01-2012, 11:19 AM

The reference might contain ambiguous bases, Like T, Y etc...

**epi** · 02-01-2012, 01:12 PM

whats your read length.
Remember bowtie can not match if reference sequences are smaller than query

**Kennels** · 02-01-2012, 08:07 PM

From experience with small RNA datasets, many many reads will be of exact sequence.
While your results may indicate most of your reads did not align, remember that you collapsed your sequences to begin with.
Have a look at the read IDs of each aligned sequence which should tell you how many counts of a particular sequence there are.
It might be that your 600 odd aligned sequences actually represent millions of reads (or at least a lot more than your unaligned sequences)

**lbeltrame** · 02-07-2012, 08:35 AM

Thanks for the suggestions. I'll give a go this week and see what comes out.

**mnkyboy** · 02-07-2012, 09:13 AM

You probably have a lot of non miRNA in your sequencing. Have you tried mapping just to the genome? We see a ton of non miRNA small RNA in our small RNA sequencing.

**lbeltrame** · 02-08-2012, 07:42 AM

Indeed, mapping to the genome gets a larger yield (47% for perfect matches, and 64% allowing one mismatch, with minimum length being 15):

Code:

bowtie hg19 -f -n 1 -l 15 -p 4 VB09121_Pool2/BarcodeCTTA.collapsed.fa -S Pool2_CTTA_human_aligned.sam
# reads processed: 56581
# reads with at least one reported alignment: 36637 (64.75%)
# reads that failed to align: 19944 (35.25%)
Reported 36637 alignments to 1 output stream(s)

Also, to answer other questions in the thread, I've mapped the length distribution of my reads and the vast majority are between 22 and 24bp in length.

EDIT: Not using collapsed sequences yields me a ~96% alignment rate using the same parameters (against the genome). I'd like to thank everyone who gave suggestions.

**Priyank** · 05-03-2013, 08:30 AM

Hello epi and all,

What can be alternative alignment strategy or tool when reference sequences are smaller than query ?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

miRNA seq analysis - large numbers of non-aligning reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News