Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #46
    Pasted wrong sessioninfo..

    # > sessionInfo()
    # R version 3.0.3 (2014-03-06)
    # Platform: x86_64-apple-darwin10.8.0 (64-bit)

    # locale:
    # [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

    # attached base packages:
    # [1] grid parallel stats graphics grDevices utils datasets
    # [8] methods base

    # other attached packages:
    # [1] Rgraphviz_2.6.0
    # [2] gageData_2.0.3
    # [3] pathview_1.2.4
    # [4] org.Hs.eg.db_2.10.1
    # [5] RSQLite_0.11.4
    # [6] DBI_0.2-7
    # [7] KEGGgraph_1.20.0
    # [8] graph_1.40.1
    # [9] XML_3.95-0.2
    # [10] gage_2.12.3
    # [11] Rsamtools_1.14.3
    # [12] Biostrings_2.30.1
    # [13] TxDb.Hsapiens.UCSC.hg19.knownGene_2.10.1
    # [14] GenomicFeatures_1.14.5
    # [15] AnnotationDbi_1.24.0
    # [16] Biobase_2.22.0
    # [17] GenomicRanges_1.14.4
    # [18] XVector_0.2.0
    # [19] IRanges_1.20.7
    # [20] BiocGenerics_0.8.0

    # loaded via a namespace (and not attached):
    # [1] biomaRt_2.18.0 bitops_1.0-6 BSgenome_1.30.0
    # [4] digest_0.6.4 httr_0.2 KEGGREST_1.2.0
    # [7] png_0.1-7 RCurl_1.95-4.1 rtracklayer_1.22.6
    # [10] stats4_3.0.3 stringr_0.6.2 tools_3.0.3
    # [13] zlibbioc_1.8.0


    Pathview works, but I dont get colors or up/down regulated genes...

    Comment


    • #47
      pv.out.list <- sapply(enriched_pathways, function(pid) pathview(gene.data = gene_fc, pathway.id = pid, species = "sce", gene.idtype="KEGG", same.layer = F, kegg.native = T, node.sum="median"))

      Data in pathview:
      pv.out.list[1]
      GLK1 -0.35 0.000 0.620 -1.118 -0.900

      # original data supplied for pathview
      gene_fc[1,]
      GLK 0.14 -1.6 0.62 -1.1 -0.37

      I have attached the resultant pathway image.
      Original image sce00051.png shows genes specific[in green] to yeast.

      I am wondering why pathview data differ for GLK when GLK is the only gene on that node.

      Thanks,
      Shriram
      Attached Files

      Comment


      • #48
        Another question... How to alter the heatmaps text-part size (KEGG pathway names)?
        The names are all capped, its unreadable.. The "pdf.size" - option only regualtes the graphic part.

        ###################################################
        ### significant.genesets
        ###################################################
        kegg.sig<-sigGeneSet(cnts.kegg.p, outname="~/RNAseq/13_Acute-Changes/14_GAGE_native_A1A2/A1A2All/A1A2All.kegg",pdf.size = c(7,12))

        Comment


        • #49
          Shriram,
          Here is the original graph from official KEGG website:

          With the GLK1 gene node marked by red box. Hover your mouse over there, do you see 3 budding yeast genes in this nodes: YCL040W (GLK1), YFR053C (HXK1), YGL253W (HXK2)? You may click on this node, and check the details of these genes. Again, the nodes in pathview graphs are labelled with the most representative gene name rather than all mapped gene names for clarity.


          Originally posted by shriram View Post
          pv.out.list <- sapply(enriched_pathways, function(pid) pathview(gene.data = gene_fc, pathway.id = pid, species = "sce", gene.idtype="KEGG", same.layer = F, kegg.native = T, node.sum="median"))

          Data in pathview:
          pv.out.list[1]
          GLK1 -0.35 0.000 0.620 -1.118 -0.900

          # original data supplied for pathview
          gene_fc[1,]
          GLK 0.14 -1.6 0.62 -1.1 -0.37

          I have attached the resultant pathway image.
          Original image sce00051.png shows genes specific[in green] to yeast.

          I am wondering why pathview data differ for GLK when GLK is the only gene on that node.

          Thanks,
          Shriram

          Comment


          • #50
            Originally posted by bigmw View Post
            Shriram,
            Here is the original graph from official KEGG website:

            With the GLK1 gene node marked by red box. Hover your mouse over there, do you see 3 budding yeast genes in this nodes: YCL040W (GLK1), YFR053C (HXK1), YGL253W (HXK2)? You may click on this node, and check the details of these genes. Again, the nodes in pathview graphs are labelled with the most representative gene name rather than all mapped gene names for clarity.
            That's great, it makes full sense now, thank you very much for your help.
            Shriram

            Comment


            • #51
              Well, you can check the first attached image in post 47. Obviously you do get different colors on pathview graphs. I think you might have checked the pathview output in the wrong directory:
              kegg.dir = "~/RNAseq/13_Acute-Changes/13_GAGE_native_A1A2/A1A2pT2D/Pathview"

              This is the directory storing original KEGG pathway graphs (and xml data). you should check in your current working directory for pathview output:
              getwd()


              Originally posted by sindrle View Post
              Pasted wrong sessioninfo..

              # > sessionInfo()
              # R version 3.0.3 (2014-03-06)
              # Platform: x86_64-apple-darwin10.8.0 (64-bit)
              ...

              Pathview works, but I dont get colors or up/down regulated genes...

              Comment


              • #52
                Someone asked a similar question here:


                Originally posted by sindrle View Post
                Another question... How to alter the heatmaps text-part size (KEGG pathway names)?
                The names are all capped, its unreadable.. The "pdf.size" - option only regualtes the graphic part.

                ###################################################
                ### significant.genesets
                ###################################################
                kegg.sig<-sigGeneSet(cnts.kegg.p, outname="~/RNAseq/13_Acute-Changes/14_GAGE_native_A1A2/A1A2All/A1A2All.kegg",pdf.size = c(7,12))

                Comment


                • #53
                  Hm.
                  Ok, Ill try Adobe Illustrator, I tried Preview, but there they were capped as well...

                  I ended up editing it in Excel, cropping of the names and copy them in again from the "significant-gs.txt", then saving as new PDF.

                  Comment


                  • #54
                    Why don’t you go beyond the preview step and actually open/load the file, you will see that the full graph extends beyond the artboard area.

                    You can do the following things:
                    1 Save a png file
                    File -> Save for Microsoft office

                    2 Edit artboard size and save another copy of new pdf:
                    Object -> Artboard -> Fit to Artwork Bounds->..

                    Comment


                    • #55
                      Forgot that sigGeneSet function has been updated to give users more control on the margin and font size. sigGeneSet calls a internal function heatmap2 to generate the heatmaps. So check the argument for this function
                      args(gage:::heatmap2)
                      The argument two relevant arguments here are margins and cexRow, which control the margins for column/row names and row name font size, you may do something like:
                      kegg.sig<-sigGeneSet(cnts.kegg.p,outname="~/RNAseq/13_Acute-Changes/14_GAGE_native_A1A2/A1A2All/A1A2All.kegg",pdf.size = c(7,12), margins = c(5,10))


                      Originally posted by sindrle View Post
                      Another question... How to alter the heatmaps text-part size (KEGG pathway names)?
                      The names are all capped, its unreadable.. The "pdf.size" - option only regualtes the graphic part.

                      ###################################################
                      ### significant.genesets
                      ###################################################
                      kegg.sig<-sigGeneSet(cnts.kegg.p, outname="~/RNAseq/13_Acute-Changes/14_GAGE_native_A1A2/A1A2All/A1A2All.kegg",pdf.size = c(7,12))

                      Comment


                      • #56
                        Thank you!


                        Another question, why is it that the absolute expression levels of genes is not important when doing GAGE/GOseq/DAVID etc.?

                        If you have a new cell type/tissue or even a species, how do you go about doing GAGE if you do NO COMPARISONS, only you want to get familiar with this new transcriptome?
                        You only have set of genes that expressed (including CPM/FPKM values), and you also know which genes thats not expressed.

                        Can you use this information? Basically, you just use one condition, and the expression values of the genes expressed.

                        Comment


                        • #57
                          In pathway analysis or gene set analysis, you may still use absolute expression levels rather than relative expression levels. However, two things make absolute expression analyses less desirable.

                          First, it is very hard to quantify absolute expression level. The expression measurements are affected by multiple factors other than the actual expression level or abundance, including transcript sequences, length, GC contents etc. These factors are mostly gene/transcript specific, normalization methods don’t model all these factor well.
                          Second, absolute expression levels are not very biologically relevant even if estimated accurately. Different genes, groups or pathways have very different working abundance ranges and dynamics. Therefore, it is less meaningful to compare the absolute expression levels between genes or pathways directly.

                          On the opposite, relative expression levels can be accurately estimated and are more comparable between genes because all gene specific factors are cancelled out in such measurements, i.e. a gene's expression relative to its own reference level.

                          Comment


                          • #58
                            Having that said, you can always do GAGE (or other pathway analysis) on absolute expression levels. GAGE does work on single sample or condition (single-column matrices or vectors). But very likely, you will always see some housing keep pathways/groups, like cell growth, protein synthesis and energy metabolism etc, more expressed than others.

                            Comment


                            • #59
                              Have you compared GAGE vs other gene set analysis methods like GOseq etc?

                              Originally posted by sindrle View Post
                              Thank you!


                              Another question, why is it that the absolute expression levels of genes is not important when doing GAGE/GOseq/DAVID etc.?

                              If you have a new cell type/tissue or even a species, how do you go about doing GAGE if you do NO COMPARISONS, only you want to get familiar with this new transcriptome?
                              You only have set of genes that expressed (including CPM/FPKM values), and you also know which genes thats not expressed.

                              Can you use this information? Basically, you just use one condition, and the expression values of the genes expressed.

                              Comment


                              • #60
                                I have done a quick test with GOseq, but I must admit I like GAGE better after first glance. Easy to follow, nice manual, nice plots, lots of results and possibilities. It really facilitates further analysis I think.

                                But Im going to give GOseq another go for sure!

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  Strategies for Sequencing Challenging Samples
                                  by seqadmin


                                  Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                  03-22-2024, 06:39 AM
                                • seqadmin
                                  Techniques and Challenges in Conservation Genomics
                                  by seqadmin



                                  The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                  Avian Conservation
                                  Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                  03-08-2024, 10:41 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, Yesterday, 06:37 PM
                                0 responses
                                10 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, Yesterday, 06:07 PM
                                0 responses
                                9 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-22-2024, 10:03 AM
                                0 responses
                                51 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 03-21-2024, 07:32 AM
                                0 responses
                                67 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X