Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • differential gene expression - Illumina

    Hi all,

    my apologies if this question has already been asked (I couldn't find it anywhere in either of the forums) and also that this one is a question that probably could also fit into the RNAseq forum.

    Last time I did differential gene expression work I used custom microarrays, but I've now been asked to switch to RNAseq for it and as I'm still fairly new to it, I'm a bit unsure how to tackle it.

    What I want to do is: find differentially expressed genes between different treatments. The species I'm using is a novel one (molluscan), but I already have a reference transcript (I made it with strand-specific, normalised data of multiple individuals that were control and treated, so it should have a wide range of genes in it).

    The questions I have are (and I think it links to some degree into the bioinformatics bit, so that's why I posted it here instead of the RNAseq forum):
    - should I use single reads or paired end reads
    - should I use strand-specific data
    - would 3Gb of data be enough or too much per sample to find meaningful results (i.e. how many reads should I have per sample for differential gene expression)
    - is there a good paper that one could recommend

    Sorry for all the questions. I've tried to find info online, but there is so much information and it's starting to become really confusing and also difficult to weed out papers that might not be worth following.

    Thank you so much for your help.

    Nicole

  • #2
    I recommend doing a pilot study with as much information as possible, which will give you data that you can remove information from to simulate your other conditions:

    * paired-end reads -- can simulate single-end reads by ignoring the linking and/or one read end
    * strand-specific data -- can simulate unstranded sequencing by ignoring alignment direction
    * rRNA depletion / sample enrichment [not mentioned in your post] -- would need to do at least two tests (one with, one without) to compare bias / rRNA contamination, because this tends to be organism/kit specific
    * spike-in RNA controls [not mentioned in your post] -- can simulate data without the controls by ignoring spike-in read counts
    * use only a few samples -- can simulate high-numbers of samples / multiplexing by randomly removing reads [to check e.g. if 3Gb is enough per sample]

    edit: Most of the time you have a fixed cost, and need to choose between spending more money per sample to get more detail and analysing more samples to get more variance and statistical robustness
    Last edited by gringer; 10-29-2013, 08:11 PM.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    30 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    32 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    52 views
    0 likes
    Last Post seqadmin  
    Working...
    X