
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
DESeq raw variance  narges  RNA Sequencing  3  04172014 11:24 AM 
script to get basic statistics from velvet contigs.fa  mmmm  Bioinformatics  5  12112013 01:10 AM 
Variance Estimation  KellerMac  Bioinformatics  13  02182013 02:07 AM 
Basic statistics from alignment using bwa  NGS_New_User  Bioinformatics  1  10152012 12:55 AM 
I need basic help with basic questions re: analysis  rd69  General  3  02162012 04:11 PM 

Thread Tools 
07162014, 01:54 AM  #1 
Member
Location: Istanbul, Turkey Join Date: Oct 2012
Posts: 28

Variance  basic statistics
Hello all,
I'm sorry for my very naive and basic question, but I am trying to understand a couple of graphs: dispersion, M vs A etc, and I am a little confused about "variance" term. When you check the formula of variance it is the average of the squared differences from the mean. So can I say, genes with high FPKM values tend to have "high variance" and also they are more dispersed relative to low expressed genes? (But I guess, high variance in high FPKM is not a problem when you plot a negative binomial distribution graph to calculate the significance of differential expression) But this also sounds odd because without thinking the math part, I am tempted to say, low expressed genes generally are not significant in differential expression analyses due to the "variability" between FPKM values I guess, there is a misconception here (for me) as I think the variability in "percentage". Moreover, this variability defines the shape of the negative binomial distribution, if it will be more squeezed or spread, used for statistical testing, right? :/ Sorry for asking about basic statistics. I would appreciate if one could explain briefly. Thanks! 
07172014, 09:36 AM  #2 
Member
Location: Nebraska Join Date: Oct 2011
Posts: 25

Hello everyone, I have a statistics question:
I have a data set: 1 1 1 0 1 1 0 1 0 0 0 0 0 0 1 0 0 I want to measure how well the 1's accrue at the top of the list, that is, how well the 1's and 0's separate. The above list should have a high value compoared to a random one: 1 0 1 0 1 0 1 0 ... What kind of test do I need for this? Thanks! 
07252014, 06:45 AM  #3 
Senior Member
Location: Budapest Join Date: Mar 2010
Posts: 329

Hi sazz!
Variance in gene expression is not depends on FPKM. It is just a statistical measure of the replicates. csmatyi: I think you need khisquare test. 
Thread Tools  

