I am interested in comparing samples of type A to samples of type B. There are several RNA-seq datasets in GEO that have either type A or type B, but not both. Is it possible to take samples from two different datasets and compare them? I am guessing most of the observed differences will be between the two labs and not between the two conditions. Is that a reasonable concern? Is there a proper way to deal with that?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I'd be very hesitant to do such a comparison due to the uncontrolled batch effect. I suppose you could try to estimate what sort of batch effect exists by looking at the effect between only the B or A samples, but I'm not sure how well that would work in practice.
-
Since with sequencing, there is much more confidence in the observations, which have a built in noise check during the mapping, and it is necessary to account for the overall differences in reads anyway comparing samples from different labs should be straight forward, unlike micro arrays, where batch effects were known to dominate in some cases requiring a high number of technical replicates.
After taking overall read depth into account, you would have to look for subtle effects like bias in GC annealing temperatures or PCR duplicates. Verifying the results in the lab may be difficult though, as with any meta-analysis.
Comment
-
There are still batch effects in RNAseq, though they're certainly less of an issue than in the microarray days. I happen to be looking at all of the publicly available mouse hippocampus RNAseq datasets at the moment and decided to create a little heatmap of the variance stabilised data, which you can find below. The datasets are color coded the same on the rows and columns to make life easier (there are 160 samples in the heatmap, so the labels are illegible). While there are obvious experimental differences in some of these datasets, there's still a lab batch-effect. Having said that, if you're interested in different organs or something like that then the difference due to that will be vastly greater than the batch-effect, so rskr's advise should hold-up quite well.
BTW, some of the red-colored samples are technical replicates that I never bothered merging, which is why they cluster the way they do.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:49 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
Today, 11:49 AM
|
||
Started by seqadmin, Yesterday, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment