Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Analysing RNA-Seq from Scratch?

    Hello, I am trying to find DE genes under two different conditions (I have three replicates for each). So far I have followed the TopHat->Cufflinks->Cuffdiff workflow, but the found some of the genes that CuffDiff said were 'significantly differentially expressed' (using mySigGeneIds<-getSig(cuff_data,alpha=0.05,level='genes') were higher in replicate 1 WT than replicate 1 KO, but lower in replicate 1 WT than replicate 2 KO (as judged by FPKM). Unless I'm missing some very basic statistical trick, this says to me that the difference between WT and KO conditions is not reproducible, despite CuffDiff deeming it significant.

    After this I tried to find out how CuffDiff calculates variance, p-values, basic things etc, but couldn't find any equations. So I stopped trusting CuffDiff, and was thinking about trying to use R to calculate the variance p-value for each gene (using the FPKM values that CuffDiff provides for each replicate), but was wondering how much programming this would actually entail? I don't want to do anything fancy, just see whether there are any significantly and reproducibly (between all possible WT and KO replicate pairs) DEGs and be able to see the maths!

    Would appreciate any advice or alternative programs / CuffDiff tips if there is a way to make it more transparent.

    Thanks!

    Alex

  • #2
    Have a look at the Bioconductor packages DESeq and edgeR,
    and read the vignettes that come with each package, for examples of
    code to do your analyses.

    The Bioconductor project aims to develop and share open source software for precise and repeatable analysis of biological data. We foster an inclusive and collaborative community of developers and data scientists.

    Comment


    • #3
      Thanks, I'll give it a go!

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      25 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      24 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      52 views
      0 likes
      Last Post seqadmin  
      Working...
      X