Differential gene expression analysis with bioreplicates using EdgeR/DESeq

iceage

Junior Member

Join Date: Apr 2012

Posts: 1
- Share
- Tweet
#1

Differential gene expression analysis with bioreplicates using EdgeR/DESeq

07-04-2012, 05:08 AM

Hi everyone,

I have read and search a lot about this topic but can not find any solution to my problem. May you will be able to help me.

I am doing an intern-ship in bioinformatics for my master and I have to deal with RNA-seq data. I have 2 sets of experiments (A and B), both having 2 illumina runs of two stages (1 and 2) of a plant. A and B has not been done at the same time and the technology is a bit different, coming up with:
runs about 30M reads for A,
runs about 80M reads for B.

For a given stage the log(RPKM) of the replicates are very well correlated.

When I use EdgeR to obtain a common dispersion from the counts of each runs searching for differential expressed genes between each stage I obtain 0.86. Which seems far too big regarding the correlation of the RPKM. Moreover the number of differentially expressed genes is not consistent with our affymetrix knowledge (about 250 genes when we expected about 1000 genes).

I first think about filtering the list of genes from the one having a count per million below 1 in all conditions. I then obtain a dispersion of 0.76 : still to high...

I also think about getting variance stabilized data (with DESeq) to use with limma but it does not make sense if the samples are not paired, does it?

I am wondering if I am doing something wrong here and if there are any filtration/computation that I should have done to obtain a more consistent common dispersion.

Any idea would be really appreciate,

François
Tags: None
Gordon Smyth

Member

Join Date: Apr 2011

Posts: 91
- Share
- Tweet
#2

07-07-2012, 04:58 PM

A few points:

edgeR is a Bioconductor package, so more detailed help is available on the Bioconductor mailing list than on SEQanswers.

If you want to get your RNA-seq data into limma, the way to do this is use the voom() function of the limma package. See the limma User's Guide.

There are any number of things that might be causing problems with your analysis, but there's no to way know from the information that you give. Your dispersion values are very high indeed. Have you used an MDS plot to look at your data?
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Differential gene expression analysis with bioreplicates using EdgeR/DESeq

Comment

Latest Articles

ad_right_rmr

News