Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Consistent low MiSeq cluster density and PhiX alignment with high quality reads (PF%) mamor Illumina/Solexa 14 09-23-2019 11:19 AM
Transcriptome assembly: Low GC, short contigs, low read alignment jmah Bioinformatics 0 03-15-2017 05:47 PM
Low GC, fragments, low read alignment jmah Bioinformatics 0 03-15-2017 04:18 PM
Not primary vs Supplementary alignment dnusol Bioinformatics 3 01-24-2014 04:49 AM
Alignment/transcriptome assembly/differential expression analysis with 40bp reads? heytreeful Illumina/Solexa 4 03-11-2013 09:54 AM

Thread Tools
Old 08-29-2019, 11:13 PM   #1
Junior Member
Location: Asia

Join Date: Feb 2019
Posts: 2
Default Very low % of reads showing primary alignment to transcriptome

Dear users,

I have recently started analyzing RNA-Seq data for gene expression analysis, hence am quite new to the field.

I have used STAR for aligning RNA-Seq reads (hg38, Ensemble, release 94) using --quantMode TranscriptomeBAM for STAR run.

When I analyzed the quality of BAM files (2 files - genomic BAM and Aligned.toTranscriptome.bam) using BamQC, I get widely different results in terms of basic statistics like primary alignments

In whole in genomic BAM, total 96.95 reads fall in primary alignment, transcriptome BAM has only 29.4% primary aligned reads. Does this low % means the data quality is bad for doing analysis like differential isoform and allele expression?

Thanks for your inputs.
Asmita is offline   Reply With Quote
Old 08-30-2019, 12:05 AM   #2
Senior Member
Location: East Coast USA

Join Date: Feb 2008
Posts: 7,015

Have you tried to see where (96-95-29.4) reads are aligning (since they are not aligning to transcripts)? Does your data have rRNA present? Inspecting the resulting BAM using IGV would be a great place to start.
GenoMax is offline   Reply With Quote
Old 08-30-2019, 03:00 AM   #3
Junior Member
Location: Asia

Join Date: Feb 2019
Posts: 2

Thanks for the response!

I did try to visualize the two BAM files in IGV.

When visualizing genomic BAM (Aligned.out.bam), I can see that many reads are falling into the exonic region of genes, with corresponding higher coverage, however, the coverage is missing in transcriptomics BAM file for the same region.

I would expect the coverage from transcriptomics BAM file to exist at least in genes whose annotation is present in the GTF files used for mapping. (Here the visualization is over part of MYH9, myosin 9 gene, ENSG00000100345)

As an additional note, when I analyzed my BAM (genomic) file using PICARD tools, I observed that 45% of total input bases were classified as intronic bases, while 50% of total bases were categorized as mRNA bases.

Is that the reason why there is low %reads in transcriptome BAM?
Asmita is offline   Reply With Quote

bioinformatics, rna-seq analysis, star aligner

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 01:22 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO