Seqanswers Leaderboard Ad

**dpryan** · 08-23-2015, 11:23 PM

No one can tell you why you only got a 75% alignment rate without looking at your data. Perhaps you have some notable adapter contamination or quality issues and didn't trim (or used end to end alignment). Perhaps you have bacterial or other contamination, have a look at some of the unaligned reads.

Generally HISAT or STAR would be used for RNAseq data, since tophat2 is just too slow. I should note that STAR is nice in that it will give you a summary of why it couldn't align reads (e.g., they're too short, too many possible hits, etc.).

**skly** · 08-26-2015, 05:59 AM

Originally posted by dpryan View Post

No one can tell you why you only got a 75% alignment rate without looking at your data. Perhaps you have some notable adapter contamination or quality issues and didn't trim (or used end to end alignment). Perhaps you have bacterial or other contamination, have a look at some of the unaligned reads.

Generally HISAT or STAR would be used for RNAseq data, since tophat2 is just too slow. I should note that STAR is nice in that it will give you a summary of why it couldn't align reads (e.g., they're too short, too many possible hits, etc.).

Thank you, dpryan！
Actually, these RNA sequencing data have be removed the adapter and be filtered low quality reads.
I suspected that HISAT caused the low mapping rate, firstly. So, for the same reference genome and the same input RNA sequencing data, I run tophat2 (v2.1.0). But the mapping rate of tophat2 is about 70%. So, I think that the low mapping rate maybe is not the result of alignment software, but reference genome I chose.
It is worth mentioning that the reference genome sequence downloaded from Ensembl is “dna_rm” type (masked genomic DNA). Maybe it is the cause. Next, I will test the “dna” type (unmasked) reference genome.

**dpryan** · 08-26-2015, 09:28 AM

Yup, using a hard masked reference would cause that. Just use either the soft masked or unmasked (the results will be the same).

**skly** · 08-27-2015, 10:35 PM

Originally posted by dpryan View Post

Yup, using a hard masked reference would cause that. Just use either the soft masked or unmasked (the results will be the same).

Thank you, dpryan. I replaced the masked reference with unmasked genome reference. The mapping rate was about 90%, but I found one another question. The discordant alignment rate was so high. The following are links. Could you help me to see the results? Thank you so much.

HISAT Discordant Alignment Rate of RNAseq data was so high - SEQanswers

http://seqanswers.com/forums/showthread.php?t=62304

Application of sequencing to RNA analysis (RNA-Seq, whole transcriptome, SAGE, expression analysis, novel organism mining, splice variants)

Ps: The HISAT results of soft masked reference were as same as the unmasked. And the parameters were defaults.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Rna

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News