Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • fagire
    Junior Member
    • Jul 2011
    • 2

    Reference for differential gene expression analysis

    Hi all,

    I plan to use the DESeq2 package for differential expression analysis between two conditions and I'm wondering which transcriptome/s (consensus or singles) should I use as reference. I don't have a genome for my specie.

    The samples (regardless of the condition) presents different numbers of genes and isoforms (as the annotation of the contigs assembled suggest). Some people suggest generate a single assembly based on combining all reads across all samples as inputs and then align the reads separately back to the single ("consensus") assembly for downstream analysis of differential expression. But I really don't know if it is the best way to proceed. For example, for those samples that have only one isoform - for a determined gene - the read count would be overestimated if the consensus transcriptome included other isoforms (with shared exons) of the same gene from other samples. I don't want to discard the multimapping reads.

    The other option simply consists on aligning the reads of each sample with its corresponding assembly. I do not know to what extent the heterogeneity of the single assemblies (distinct number of genes and isoforms, differences on transcripts lengths, etc) can affect the differential gene expression analysis.

    The third option I have in mind is to use the exons obtained from all the samples as reference (I can obtain them from my transcripts and using exons of a related specie). I think that this could be the best option.

    Which option do you think would be the best for the differential expression analysis with DESeq (or DEXeq)?

    Thanks in advance,

    Facundo
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    Without a reference of some sort, the best tactic is to (1) assemble all of the reads into a transcriptome, (2) align each sample against that and (3) use something like FluxCapacitator/RSEM/etc. to get expected counts. The results can be used with limma after processing with voom(). You'll not use DESeq2 or DEXSeq with your dataset unless you can get a decently high percentage of uniquely aligned reads (unlikely, but possible).

    If you can't know with much certainty which transcripts belong to which gene (they could be paralogs!), then you can't look at differential exon usage.

    Comment

    • fagire
      Junior Member
      • Jul 2011
      • 2

      #3
      Many thanks dpryan.

      I will try with limma and voom. I guess that the biases produced for use a non-real reference aren't a big problem (or less pronounced if I use DESeq with multimapping reads).

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Pathogen Surveillance with Advanced Genomic Tools
        by seqadmin




        The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
        03-24-2025, 11:48 AM
      • seqadmin
        New Genomics Tools and Methods Shared at AGBT 2025
        by seqadmin


        This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

        The Headliner
        The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
        03-03-2025, 01:39 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 03-20-2025, 05:03 AM
      0 responses
      49 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-19-2025, 07:27 AM
      0 responses
      57 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-18-2025, 12:50 PM
      0 responses
      50 views
      0 reactions
      Last Post seqadmin  
      Started by seqadmin, 03-03-2025, 01:15 PM
      0 responses
      200 views
      0 reactions
      Last Post seqadmin  
      Working...