Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DESeq DiffExpress Vs. Fold Change

    I don't understand the logic behind the data set DESeq provides when I perform the differential expression command. All of the numbers in the columns are in random order, there is not one column that gives the table numerical sense. What is the basis for the list of genes I am recieving? Fold change on the other hand makes more sense be it up or down regulation and the numbers in the columns have an obvious order. Is the fold change shown relative to each other, or in the perspective of the first condition over the second?

  • #2
    The first column is just a row id taken from your input file. The data is simply presented in the same order as the input data. If you don't want to sort the data in R, just write it out to a tab delimited table and parse it in Excel (the headers will be offset by one, since that first column is not data, it's just row number relative to the input data order).

    > res <- nbinomTest(cds, "Control", "Treatment")
    > write.table(res, file = "nBinom_results.txt", sep = "\t")

    Open nBinom_results.txt in Excel and parse it by tabs and sort it by whichever column is of interest. If set up as above, fold change will be for Treatment relative to control (ie. + is up-regulated in Treatement, and - is down-regulated in Treatment.
    Michael Black, Ph.D.
    ScitoVation LLC. RTP, N.C.

    Comment


    • #3
      Thanks For the quick reply Mr. Black

      Ok, I understand that the data can be sorted manually in excel, but I don't see what corelation is among the genes that are marked as differential expressed. I would assume it would be greatest to least or vice versa for at least one column of numbers but i am not seeing that relation. I'm sure its there...its just not immediately obvious.

      P.S. please excuse my spelling errors I cannot spell well and this Mac wont let me use spell check or rather i don't care enough to make it work or learn how.

      Comment


      • #4
        The default is to list the data in the same gene/transcript order as the input data. The only way to get order like you wish is to sort the data yourself, by pValue, adjPvalue, fold change, whatever you wish. But the default is just to list it by input order.

        Note too that if you use the two commands I wrote, you get the results back for all the genes in your input, not just significant genes. You would then have to sort that table and apply whatever statistical cutoffs you wish (e.g. sort by adjPvalue, then select only those less than 0.01).

        You could do something like:

        > resSig <- res[ res$padj < 0.1, ]
        > head(resSig[ order(resSig$padj), ])

        to see the most significant genes sorted by adjPvalue. But again, if you wrote those out to a file:

        > write.table(resSig, file = "nBinom_results_lessthan_.1.txt", sep = "\t")

        That file would only include those genes with an adjPvalue less than 0.1, but the table would still be sorted initially simply by the row input order of the genes. So, you would still need to sort it manually to apply whatever order you wished to see in it.
        Michael Black, Ph.D.
        ScitoVation LLC. RTP, N.C.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        50 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        67 views
        0 likes
        Last Post seqadmin  
        Working...
        X