![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
DSN normalization | henry.wood | Sample Prep / Library Generation | 3 | 11-19-2014 01:36 AM |
Amplicon normalization | RCJK | Sample Prep / Library Generation | 3 | 06-20-2013 06:55 AM |
microRNA normalization | kenosaki | Bioinformatics | 2 | 08-29-2011 09:15 PM |
PCR product normalization | Palecomic | Sample Prep / Library Generation | 0 | 01-20-2011 02:50 AM |
cDNA normalization | Triticum | SOLiD | 0 | 09-15-2009 01:56 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Rockville, MD Join Date: Jan 2009
Posts: 126
|
![]()
I'm hoping for some input from my statistically gifted brethren on this one:
I have sixteen RNA-Seq libraries which were aligned with TopHat. Counts of reads mapping to RefSeq genes were generated with htseq-count. My statistician collaborators need to normalize these counts for differences in sequencing depth. Here are my choices for the denominator:
Any help will be much appreciated, Thanks, Shurjo |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]()
All of these suffer from the issue that a few strongly and differentially expressed genes can skew them. See the discussion in our paper and especially in Oshlack and Robinson's paper.
Our DESeq package offers (via its function 'estimateSizeFactors') a simple way to get a robust number for the denominator, which is explained, e.g., here. Simon |
![]() |
![]() |
![]() |
#3 | |
Senior Member
Location: Rockville, MD Join Date: Jan 2009
Posts: 126
|
![]() Quote:
Many thanks for your reply. I read both your and the Oshlack papers and agree with all of the points you make therein. However, in the context of my data, the following points would suggest to me that a simpler normalization strategy may be adequate:
Once again, thanks for your help and congratulations on your paper. Shurjo |
|
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: Mexico Join Date: Mar 2011
Posts: 137
|
![]()
Hi Shurjo,
I have been having a tough time thinking this one out as well. I would appreciate any insight you may have gained by solving this problem. I too am torn between using the htseq count total, the unique mapped reads from tophat or all the alignments generated by tophat. Thanks for your help, Carmen |
![]() |
![]() |
![]() |
Thread Tools | |
|
|