Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Underdispersed mapped RNA-Seq reads

    Hello all,

    I am working on analyzing RNA-Seq samples for differential expression and am perplexed by the distribution of my data, which appears to be under- rather than overdispersed relative to a Poisson distribution.

    For background:
    1. I created a preliminary de Novo assembly of my RNA-Seq transcripts using CLC workbench (reads were first trimmed, etc, reads were high quality and looked good using diagnostics in FastQC).
    2. I mapped my reads to the de Novo assembly using the RNA-Seq mapping function in CLC workbench.
    3. I imported my mapped read counts into R, to analyze the differential expression using edgeR

    In edgeR, my common dispersion parameter is ~2.5. I checked the MA Plots and noticed how 'odd' my data distributions appeared. The attached plot is an example of a comparison between lanes using an MA plot (all between lane comparisons look similar regardless of experiment group).

    Histograms similarly confirm that aside from 'zeros', my most about count data (by lane) is in the thousands of reads per de Novo contig.

    My questions are as follows:
    1. Have others encountered similar data distributions?
    2. If so, are there any programs to analyze underdispersed (or whatever this is) data? DEGseq seems like it may be 'closer' as it assumes a Poisson distribution, but this still isn't the correct distribution. I've also been reading about using GLM for generalized Poisson distributions.
    3. Could this all be caused by a suboptimal de Novo assembly? Or by RNA degradation prior to sequencing?

    Any thoughts would be greatly appreciated! I am currently working to generate a 'best' assembly using multiple kmer lengths in Oases-Velvet, and to annotate some of the very abundant contigs for biological relevance. Any other suggestions regarding issues related to contamination, assembly,etc?

    Thanks so much from a beginner!!
    Celli
    Attached Files

  • #2
    It's quite sure that this underdispersion is not genuine but rather a signal that something went severely wrong. Please give a few more details on your design (what are you comparing? how many replicates?) and your analysis steps.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 11:49 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Yesterday, 08:47 AM
    0 responses
    16 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    61 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X