![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Expression quantification/differential expression gene analysis by RNA-Seq | chenjy | Bioinformatics | 12 | 08-02-2013 03:06 AM |
Differential gene expression analysis | colaneri | Bioinformatics | 15 | 06-14-2013 05:37 AM |
differential gene expression and variance issues | amcloon | RNA Sequencing | 5 | 05-07-2013 02:07 AM |
Differential gene expression on metatranscriptome data | Tka | Metagenomics | 3 | 04-16-2013 05:43 AM |
Differential gene expression of gene clusters | anjana.vr | RNA Sequencing | 1 | 10-28-2010 10:33 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: Brisbane Join Date: Aug 2013
Posts: 5
|
![]()
Hi all,
my apologies if this question has already been asked (I couldn't find it anywhere in either of the forums) and also that this one is a question that probably could also fit into the RNAseq forum. Last time I did differential gene expression work I used custom microarrays, but I've now been asked to switch to RNAseq for it and as I'm still fairly new to it, I'm a bit unsure how to tackle it. What I want to do is: find differentially expressed genes between different treatments. The species I'm using is a novel one (molluscan), but I already have a reference transcript (I made it with strand-specific, normalised data of multiple individuals that were control and treated, so it should have a wide range of genes in it). The questions I have are (and I think it links to some degree into the bioinformatics bit, so that's why I posted it here instead of the RNAseq forum): - should I use single reads or paired end reads - should I use strand-specific data - would 3Gb of data be enough or too much per sample to find meaningful results (i.e. how many reads should I have per sample for differential gene expression) - is there a good paper that one could recommend Sorry for all the questions. I've tried to find info online, but there is so much information and it's starting to become really confusing and also difficult to weed out papers that might not be worth following. Thank you so much for your help. Nicole |
![]() |
![]() |
![]() |
#2 |
David Eccles (gringer)
Location: Wellington, New Zealand Join Date: May 2011
Posts: 843
|
![]()
I recommend doing a pilot study with as much information as possible, which will give you data that you can remove information from to simulate your other conditions:
* paired-end reads -- can simulate single-end reads by ignoring the linking and/or one read end * strand-specific data -- can simulate unstranded sequencing by ignoring alignment direction * rRNA depletion / sample enrichment [not mentioned in your post] -- would need to do at least two tests (one with, one without) to compare bias / rRNA contamination, because this tends to be organism/kit specific * spike-in RNA controls [not mentioned in your post] -- can simulate data without the controls by ignoring spike-in read counts * use only a few samples -- can simulate high-numbers of samples / multiplexing by randomly removing reads [to check e.g. if 3Gb is enough per sample] edit: Most of the time you have a fixed cost, and need to choose between spending more money per sample to get more detail and analysing more samples to get more variance and statistical robustness Last edited by gringer; 10-29-2013 at 08:11 PM. |
![]() |
![]() |
![]() |
Thread Tools | |
|
|