Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Very low % of reads showing primary alignment to transcriptome

    Dear users,

    I have recently started analyzing RNA-Seq data for gene expression analysis, hence am quite new to the field.

    I have used STAR for aligning RNA-Seq reads (hg38, Ensemble, release 94) using --quantMode TranscriptomeBAM for STAR run.

    When I analyzed the quality of BAM files (2 files - genomic BAM and Aligned.toTranscriptome.bam) using BamQC, I get widely different results in terms of basic statistics like primary alignments

    In whole in genomic BAM, total 96.95 reads fall in primary alignment, transcriptome BAM has only 29.4% primary aligned reads. Does this low % means the data quality is bad for doing analysis like differential isoform and allele expression?

    Thanks for your inputs.

  • #2
    Have you tried to see where (96-95-29.4) reads are aligning (since they are not aligning to transcripts)? Does your data have rRNA present? Inspecting the resulting BAM using IGV would be a great place to start.

    Comment


    • #3
      Thanks for the response!

      I did try to visualize the two BAM files in IGV.

      When visualizing genomic BAM (Aligned.out.bam), I can see that many reads are falling into the exonic region of genes, with corresponding higher coverage, however, the coverage is missing in transcriptomics BAM file for the same region.

      I would expect the coverage from transcriptomics BAM file to exist at least in genes whose annotation is present in the GTF files used for mapping. (Here the visualization is over part of MYH9, myosin 9 gene, ENSG00000100345)

      As an additional note, when I analyzed my BAM (genomic) file using PICARD tools, I observed that 45% of total input bases were classified as intronic bases, while 50% of total bases were categorized as mRNA bases.

      Is that the reason why there is low %reads in transcriptome BAM?

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM
      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 06:37 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 06:07 PM
      0 responses
      9 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2024, 10:03 AM
      0 responses
      51 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-21-2024, 07:32 AM
      0 responses
      67 views
      0 likes
      Last Post seqadmin  
      Working...
      X