Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to find DE genes using RPKM values?

    Hello,
    I have essentially 2 datasets, each showing the RPKM values for each gene. I want to compare the gene expressions from one set to another, and see which ones are up-regulated or down-regulated significantly comparing to the other.
    I have tried some primitive ways, such as dividing the two by each other and see if that ratio is greater than a fold change threshold..but this yields to me like 10000 genes, which is unlikely.

    Are there any suggestions on how to find differentially expressed genes based on RPKM values?

    btw I don't have access to the mapped reads data, so programs like DEGseq won't work for me. I only have access to the RPKM values

    Thanks!

  • #2
    Well, if you get to many genes, you have to make your fold-change threshold stricter. As this is all you have, you cannot do better.

    What would be a good threshold? Obviously one which reflects how much the RPKM value for a gene typically change between two samples from the same experimental condition. Only if you fold change is much stronger, you can assume that the change is due to the change in condition.

    As you don't have replicates, you can only guess what a good threshold is. In other words: No, you cannot get any reasonable results from just two samples.

    Simon

    Comment


    • #3
      Maybe you could use regression and outlier detection to find outliers in the plot of one data set vs. another (the points furthest from the regression line are the genes showing the biggest changes).

      You should probably look at some histograms and set a lower cutoff on the RPKMs you're willing to consider if you use fold change, because using fold change with really small RPKM values will be very noisy.

      Once you have a ranked list of genes by RPKM fold-change, you could test using qPCR to verify your results.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        Yesterday, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      58 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      54 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      46 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Working...
      X