Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert GO numbers to GO terms

    Hi all!

    Which programme is most suitable for converting a list of GO numbers to their corresponding GO terms (including assignment to MF, CC, BP and lvl)?

    I have something like:
    73562
    01276
    89003
    00271
    ...
    ...
    ...

    And I need an output like
    73562 --> MF: lvl3: oxygen transporter activity
    01276 --> CC: lvl2: mitochondrial membrane
    89003 --> CC: lvl3: inner mitochondrial membrane
    00271 --> BP: lvl4: oxygen detoxification
    ...
    ...
    ...

    Thanks for hints!

  • #2
    Hi Ramet,

    At this link you can find txt files with the info you need, although I'm not sure what the "lvl" is...
    The Gene Ontology (GO) project is a major bioinformatics initiative to develop a computational representation of our evolving knowledge of how genes encode biological functions at the molecular, cellular and tissue system levels.


    You can write a small perl script to parse and combine this with your own file.

    -Stef

    Comment


    • #3
      I think, the key is the caculation of level!

      Comment


      • #4
        Another option is to create a local instance of the MySQL database and use an SQL query to map ids to descriptions. Instructions for doing this are here
        http://www.geneontology.org/GO.datab...build-your-own

        Comment


        • #5
          Yes calculating the level (or 'depth') of a term is the main challenge to obtain the information 'Ramet' seeks. A large number of MySQL queries of the GO schema are discussed at the following URL. Included on this page are various mentions of the difficulty and caveats associated with depth. Each term can have multiple associations to other terms. This means that one term may have more than one depth depending on the path you take through the tree. You are therefore faced with the problem of deciding what depth to use (average, minimum, maximum?). This all leads to the questions: Do you need the level? Does it really mean what you hope it does?

          From the GO documentation: "The depth of a node or distance between nodes is not necessarily biologically meaningful. Other more advanced queries are required to get biologically meaningful information-theory based distances between terms "

          http://wiki.geneontology.org/index.p...terms_by_GO_ID
          Good luck!

          Comment


          • #6
            There's quite a nice Perl API to the GO databases at:

            http://search.cpan.org/~cmungall/go-perl/go-perl.pod

            The example code at the top pretty much does what you're after.

            Comment


            • #7
              Thanks to everyone!

              You have convinced me that I maybe do not need the level (depth). Thanks for that conceptual help!

              And parsing the textfiles from http://www.geneontology.org/GO.downloads.files.shtml with a perl script seems to be the solution for my problem.
              Shame on me to have missed them.

              I'll later on will have a look on the MySQL database solution and then decide which way to go.

              Again, thank you very much!

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advancing Precision Medicine for Rare Diseases in Children
                by seqadmin




                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                12-16-2024, 07:57 AM
              • seqadmin
                Recent Advances in Sequencing Technologies
                by seqadmin



                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                Long-Read Sequencing
                Long-read sequencing has seen remarkable advancements,...
                12-02-2024, 01:49 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 12-17-2024, 10:28 AM
              0 responses
              27 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-13-2024, 08:24 AM
              0 responses
              43 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-12-2024, 07:41 AM
              0 responses
              29 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-11-2024, 07:45 AM
              0 responses
              42 views
              0 likes
              Last Post seqadmin  
              Working...
              X