cross-species data - questions about normalization

trelek2

Junior Member

Join Date: May 2013

Posts: 2
- Share
- Tweet
#1

cross-species data - questions about normalization

05-23-2013, 06:14 AM

Hi,

I have some data form various samples (cell types) in different species.
I want to compare and analyze gene expression variability across the different species.

I've plotted the average expression (tags per million 'tpm' data) for each sample. I found that the average expression is more or less the same for all samples from a given species but the average expression values vary across species greatly (for example all human samples are about 20 and all mouse samples are about 25 but all dog samples are at about 100 tmp).

I'm guessing that this is because the data is not normalized and therefore the data is not comparable before performing a normalization.

I tried using different recently developed methods for data normalization (RLE, TMM) implemented in EdgeR as well as DeSeq and normalized the data from different species (by entrez id) separately for each cell type. This however still does not give much more similar average expressions for the different species. The averages of samples from the same species have now become much more dissimilar.

The only samples that went to a similar average expression level across the different species is Universal RNA - which is a mix of different tissues rather than a specific cell type.

I'm really confused, does the above mean the normalization is fine and I shouldn't worry about the fact that average expression values between different species are dissimilar, or does this rather imply something is wrong?

Maybe I should normalize by ignoring the gene ids and just looking at the whole expression profile (negative binomial) by sorting the genes according to their expression values and then having a different normalization factor for every expression value, whichever gene it is in every sample, so that I end up with 2 negative binomial curves that look the same but the genes on the x axis will be differently ordered depending on the species?

thanks.
Tags: edger, normalization, rle, species, tmm
mbblack

Senior Member

Join Date: Aug 2009

Posts: 245
- Share
- Tweet
#2

05-23-2013, 07:29 AM

Does each species have it's own controls? In other words, can each species be analyzed as an independent experiment?

If so, you could normalize each independently, analyze differential expression in each independently, and then simply compare homologous genes amongst the significantly differentially expressed genes in each species. I would suggest basing significance in each of the three independent analyses by simultaniously applying a statistical and a fold change cutoff to get the most robust differential gene lists for each.

That's how I've dealt with comparisons between, for example, rat and human hepatocytes exposed to dioxin. Set up the experiments separately and analyzed each species as an independent differential gene expression experiment, then compare the analyzed gene lists for significant homologous genes in each.

Otherwise, you have a very complex normalization situation where you may have to perform independent normalizations for each species followed by some form of scaling correction/normalization for the meta-analysis of all three.

Last edited by mbblack; 05-23-2013, 07:32 AM.

Michael Black, Ph.D.
ScitoVation LLC. RTP, N.C.
Comment

Previous template Next

Pathogen Surveillance with Advanced Genomic Tools

by seqadmin

The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
- Channel: Articles
03-24-2025, 11:48 AM
New Genomics Tools and Methods Shared at AGBT 2025

by seqadmin

This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25^th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

The Headliner
The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
- Channel: Articles
03-03-2025, 01:39 PM

Topics	Statistics	Last Post
AI Tool Creates High-Resolution 3D Maps of the Mouse Brain by seqadmin Started by seqadmin, 03-20-2025, 05:03 AM	0 responses 49 views 0 reactions	Last Post by seqadmin 03-20-2025, 05:03 AM
Studying Microbial Gene Transfer with RNA Barcoding by seqadmin Started by seqadmin, 03-19-2025, 07:27 AM	0 responses 57 views 0 reactions	Last Post by seqadmin 03-19-2025, 07:27 AM
Mapping the snoRNAome in Zebrafish to Advance Disease Research by seqadmin Started by seqadmin, 03-18-2025, 12:50 PM	0 responses 50 views 0 reactions	Last Post by seqadmin 03-18-2025, 12:50 PM
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 200 views 0 reactions	Last Post by seqadmin 03-03-2025, 01:15 PM

Seqanswers Leaderboard Ad

cross-species data - questions about normalization

Comment

Latest Articles

ad_right_rmr

News