Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • differential gene expression - Illumina

    Hi all,

    my apologies if this question has already been asked (I couldn't find it anywhere in either of the forums) and also that this one is a question that probably could also fit into the RNAseq forum.

    Last time I did differential gene expression work I used custom microarrays, but I've now been asked to switch to RNAseq for it and as I'm still fairly new to it, I'm a bit unsure how to tackle it.

    What I want to do is: find differentially expressed genes between different treatments. The species I'm using is a novel one (molluscan), but I already have a reference transcript (I made it with strand-specific, normalised data of multiple individuals that were control and treated, so it should have a wide range of genes in it).

    The questions I have are (and I think it links to some degree into the bioinformatics bit, so that's why I posted it here instead of the RNAseq forum):
    - should I use single reads or paired end reads
    - should I use strand-specific data
    - would 3Gb of data be enough or too much per sample to find meaningful results (i.e. how many reads should I have per sample for differential gene expression)
    - is there a good paper that one could recommend

    Sorry for all the questions. I've tried to find info online, but there is so much information and it's starting to become really confusing and also difficult to weed out papers that might not be worth following.

    Thank you so much for your help.

    Nicole

  • #2
    I recommend doing a pilot study with as much information as possible, which will give you data that you can remove information from to simulate your other conditions:

    * paired-end reads -- can simulate single-end reads by ignoring the linking and/or one read end
    * strand-specific data -- can simulate unstranded sequencing by ignoring alignment direction
    * rRNA depletion / sample enrichment [not mentioned in your post] -- would need to do at least two tests (one with, one without) to compare bias / rRNA contamination, because this tends to be organism/kit specific
    * spike-in RNA controls [not mentioned in your post] -- can simulate data without the controls by ignoring spike-in read counts
    * use only a few samples -- can simulate high-numbers of samples / multiplexing by randomly removing reads [to check e.g. if 3Gb is enough per sample]

    edit: Most of the time you have a fixed cost, and need to choose between spending more money per sample to get more detail and analysing more samples to get more variance and statistical robustness
    Last edited by gringer; 10-29-2013, 08:11 PM.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 11:49 AM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-24-2024, 08:47 AM
    0 responses
    16 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    61 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X