Seqanswers Leaderboard Ad

**nanos** · 01-25-2017, 02:04 AM

Dear Eric, we are very often analyzing sRNA data and I can give you some insight.
1) Adaptor trimming is really a must. With the minimum sequencing length being 50 you always have adaptor remnants in the sequence.

2) removing duplicated reads would be a problem. The problem is, that you will in most of the cases have the full length sequence of you the sRNA sequenced. Therefore in contrast to RNAseq you do not have a random shifting in you sequence (hope this is understandable). Removing duplicates will leave you most likely with a very low and very similar count for all the miRNAs no matter how high/different they were expressed. You can use adaptors containing random nucleotides and then use these 8Ns in combination with the sRNA sequence to assess the duplication rate.

3) we use good old bowtie and it works perfectly fine for us (if there are different opinions on that one, any input is appreciated)

4) I guess the answer here largely depends on your question.

hope that helps as a start.

**lre1234** · 01-25-2017, 05:07 AM

Hi,
I do a lot of short RNA-seq and here are some thoughts (but there are other ways of doing things that work well):

1. Agreed that adapter trimming is a must or most of your reads will not map. We use cutadapt which works really nice.
2. No duplicate read removing is needed nor should be done. You'll loose lots of things.

3. bowtie works well, I have also used BWA which also seemed to work well but usually default to bowtie. As far as I understand, STAR wouldn't work for short RNAs as it was designed for long RNA and specifically paired-end (but don't quote me here, I may be wrong). STAR is our goto aligner for long RNA.

4. As far as aliging. In my opinion, you should always align to the whole genome (GRCh37 or 38, which ever you choose). Afterwords intersect with miRBase or some other database of interest. Also, keep in mind, that the vast majority of miRNAs are 'unique' sequences in genome and should align uniquely. But there are cases, in which some miRNAs have duplicate sequences in the genome (e.g. miR-92a-3p, or miR-1302 which the same sequence is in 11 places in the genome). Also by mapping to the whole genome, you could do additional things like novel miR discovery. Some people do use miRBase sequences and align to those instead of the whole genome, but I personally think that is a bad idea, and will give a false-sense of what you are looking at. Essentially, you would be 'forcing' many reads to align to those regions, when in fact they would align better to other places in the genome, especially when you allow a mismatch in there.

Have fun with it. miRNAs do lots of interesting things and have many useful roles!

**manwar** · 11-21-2017, 03:44 AM

Which GTFs to use for annotation of sRNA?

Hello everyone,

Following on from ErikFas's query about using the normal human reference genome for sRNA-seq analysis, I wanted to ask if a regular gtf/gff from Ensembl or UCSC can be used for annotation purposes of sRNA or are there specific gtfs?

Thanks a lot!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Alignment of small RNA data

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News