Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GAGE enrichment GO terms, KEGG pathways

    Hello,
    I am currently trying to find the enriched pathways following a RNAseq analysis using DESeq2, in a plant species where there is no reference genome. However, it does not work as I expected.

    I used the bioconductor package “mygene” in order to get entrez identifiers corresponding to my uniprot identifiers that I had in my genes list. I then constructed my fold changes table using the log2 fold change obtained by using DESeq2, and having for names the entrez identifiers.

    Code:
    head(foldchanges)
    111151 6920 100303206 24328 8668 20517
    0.5957113 -0.4976848 0.4986454 -0.1833950 -0.3897194 0.5718210

    However, then when I use the gage function, I have troubles finding the enriched pathways. My data actually comes from plants, so I thought of using the kegg’s pathways from Arabidopsis thaliana as such:

    Code:
    kegg.ath=kegg.gsets("ath",id.type="entrez")
    kegg.ath.sigmet=kegg.ath$kg.sets[kegg.ath$sigmet.idx]
    keggres=gage(foldchanges, gsets=kegg.ath.sigmet, same.dir=TRUE)
    But I get this kind of results:

    $greater
    p.geomean stat.mean p.val q.val set.size exp1
    ath00970 Aminoacyl-tRNA biosynthesis NA NaN NA NA 0 NA
    ath02010 ABC transporters NA NaN NA NA 0 NA

    However when I tried by curiosity to use the homo sapiens pathway with the following code, it seems to work better…
    Code:
    data(kegg.sets.hs)
    data(sigmet.idx.hs)
    kegg.sets.hs=kegg.sets.hs[sigmet.idx.hs]
    keggres=gage(foldchanges, gsets=kegg.sets.hs, same.dir=TRUE)
    $greater
    p.geomean stat.mean p.val q.val set.size exp1
    hsa00010 Glycolysis / Gluconeogenesis 0.1426633 1.10118079 0.1426633 0.9257147 10 0.1426633
    hsa00240 Pyrimidine metabolism 0.1566751 1.03046501 0.1566751 0.9257147 13 0.1566751

    Can someone give me a clue about what is going on?

    Best regards,
    BioLion

Latest Articles

Collapse

  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM
  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
18 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
22 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
47 views
0 likes
Last Post seqadmin  
Working...
X