Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DESeq2: struggle to add multiple variables to DESeqDataSet

    Hi,

    I want to add the two variables condition and treatment to my DESeqDataSet, and I thought I had succeeded:

    > ddsHTSeq
    class: DESeqDataSet
    dim: 21680 11
    exptData(0):
    assays(1): counts
    rownames(21680): scign000001 scign000002 ... scign021679 scign021680
    rowData metadata column names(0):
    colnames(11): R1D0B1 R1D2B1 ... R3DxB1 R3DzB1
    colData names(2): condition treatment

    I added four levels to condition and two levels to treatment and I am able to pull out DESeq results from the different levels of condition, but I can't pull out the results from treatment:

    > resTreatment = results(ddsHTSeq, contrast=c("treatment", "cut", "uncut"))

    Any ideas what I am doing wrong?

  • #2
    hi Jon,

    Can you include the code you used to create ddsHTSeq?

    Comment


    • #3
      Here's the code I used:

      Code:
      library("DESeq2")
      
      #Set the directory where the HTSeq count files are located
      directory = "/Users/jonbra/Desktop/Gene_count_files/"
      
      #Catch the files with the pattern "GFF"
      sampleFiles = grep("GFF", list.files(directory), value=TRUE)
      sampleFiles
      
      #Setup sample conditions in the same order of the files in sampleFiles
      sampleCondition = c("D0", "D2", "Dx", "Dz", "D0", "D2", "Dx", "Dz", "D0", "Dx", "Dz")
      sampleName = c("R1D0B1", "R1D2B1", "R1DxB1", "R1DzB1", "R2D0B1", "R2D2B1", "R2DxB1", "R2DzB1", "R3D0B1", "R3DxB1", "R3DzB1")
      sampleTreatment = c("uncut", "cut", "cut", "cut", "uncut", "cut", "cut", "cut", "uncut", "cut", "cut")
      sampleTable = data.frame(sampleName=sampleName, fileName=sampleFiles, condition=sampleCondition, treatment=sampleTreatment)
      sampleTable
      
      #Creates the DESeqDataSet which stores the count data for DESeq2
      ddsHTSeq = DESeqDataSetFromHTSeqCount(sampleTable=sampleTable, directory=directory, design= ~ condition)
      ddsHTSeq
      
      colData(ddsHTSeq)$condition=factor(colData(ddsHTSeq)$condition, levels=c("D0", "D2", "Dx", "Dz"))
      colData(ddsHTSeq)$treatment=factor(colData(ddsHTSeq)$treatment, levels=c("uncut", "cut"))
      I tried to use this design formula instead:

      Code:
      ddsHTSeq = DESeqDataSetFromHTSeqCount(sampleTable=sampleTable, directory=directory, design= ~ treatment + condition)
      But got this error message:

      Error in DESeqDataSet(se, design = design, ignoreRank) :
      the model matrix is not full rank, i.e. one or more variables in the design formula are linear combinations of the others

      Comment


      • #4
        The problem you will have with modeling ~ treatment + condition is that D0 is indistinguishable from uncut, i.e. all samples which are D0 are also uncut and vice versa. That makes the model impossible to fit as specified.

        If you leave out the treatment variable, you can make comparisons of each level against the other. And you can also make the comparison of D2,Dx,Dz vs D0 (so the treatment contrast you are interested) using the list argument to the contrast argument:

        Code:
        results(dds, 
          contrast=list(c("conditionD2","conditionDx","conditionDz"),"conditionD0"), 
          listValues=c(1/3,-1))

        Comment


        • #5
          Ah, I see. That makes sense. Thanks!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          30 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          32 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          28 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          52 views
          0 likes
          Last Post seqadmin  
          Working...
          X