Hi everyone!
I'm currently analyzing small-RNA-seq illumina data coming from 4 different samples. I have data coming from 2 biological replicates (so in total, 8 libraries).
We have found novel and conserved miRNA hairpins and defined the best miRNA/miRNA* duplexes for each of them.
Now I'm trying using DESeq and EdgeR for some differential expression analysis.
I have 2 different tissues, under stress and control condition. My comparison are: Tissue1stress vs Tissue1 control; Tissue2stress vs Tissue2 control; but also Tissue1stress vs Tissue2 stress; Tissue1 control vs Tissue2 stress.
I've big troubles in determining:
1. which tag to load in DESeq as raw data: should I consider all diffferent tags coming from the whole library (too much I think) or only tag mapping on putative hairpins? or uniquely tags mapping on the predicted duplexes?
2. Using tags coming from many putative hairpins, I tried running DESeq, but I got very bad SCV Plots, and what is most, many tags are not significant because of FDR value! Even if fold change is very high! Some times also ResVarA or resVarB are high too...
I can not understand if the problem is in my replicates (being real biological replicates they are not highly similar) or in the statistical model that doesn't fit miRNA data...
Some times the tag that is most expressed in a pre-miRNA and seems to have a high fold change, is not significant, while another tag on the same hairpin show a very low FDR and p-value...
Can anybody help me?
Thank you very much
Erica
I'm currently analyzing small-RNA-seq illumina data coming from 4 different samples. I have data coming from 2 biological replicates (so in total, 8 libraries).
We have found novel and conserved miRNA hairpins and defined the best miRNA/miRNA* duplexes for each of them.
Now I'm trying using DESeq and EdgeR for some differential expression analysis.
I have 2 different tissues, under stress and control condition. My comparison are: Tissue1stress vs Tissue1 control; Tissue2stress vs Tissue2 control; but also Tissue1stress vs Tissue2 stress; Tissue1 control vs Tissue2 stress.
I've big troubles in determining:
1. which tag to load in DESeq as raw data: should I consider all diffferent tags coming from the whole library (too much I think) or only tag mapping on putative hairpins? or uniquely tags mapping on the predicted duplexes?
2. Using tags coming from many putative hairpins, I tried running DESeq, but I got very bad SCV Plots, and what is most, many tags are not significant because of FDR value! Even if fold change is very high! Some times also ResVarA or resVarB are high too...
I can not understand if the problem is in my replicates (being real biological replicates they are not highly similar) or in the statistical model that doesn't fit miRNA data...
Some times the tag that is most expressed in a pre-miRNA and seems to have a high fold change, is not significant, while another tag on the same hairpin show a very low FDR and p-value...
Can anybody help me?
Thank you very much
Erica
Comment