Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • yu_chem
    Member
    • Mar 2015
    • 23

    Gene identifier provided by NCBI

    Hi everyone

    I ask you about gene identifier.
    I know that NCBI provide several gene identifier e.g. Entrez gene symbol, Entrez gene ID, Unigene, Official gene and so, but I don't know proper use of them.

    Entrez gene symbol (e.g. POU5F1)
    Unigene (Hs.249184)

    First question:
    I don't know what does NCBI call "Entrez gene symbol"
    I think that "Entrez gene symbol" is not official name, because I could not find the document provided by NCBI containing "Entrez gene symbol".

    Second question:
    I want a file containing "Entrez gene symbol" and any identifier e.g. refseq ID (NM_****)
    If I have the files, Mostly I can convert certain identifier to Entrez gene symbol via any identifier

    I hope you answer two questions.
    Best regards,
  • Richard Finney
    Senior Member
    • Feb 2009
    • 701

    #2
    Get this file : ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz

    This command will do it :wget ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz

    NCBI used to promote the term "Entrez" in terms like "Entrez Gene ID" ... but they are apparently no longer emphasizing this term. "Entrez" apparently refered to the software system used to access NCBI information.

    "Gene id" or "GeneID" is the accesison(?) number used by NCBI in column 2 in the file "gene_info" ( mentioned earlier).

    The official name is in the "Full_name_from_nomenclature_authority" field.

    Example for human TP53 gene ...

    grep -P "\tTP53\t" gene_info | grep "^9606" | cut -f1-13
    9606 7157 TP53 - BCC7|LFS1|P53|TRP53 MIM:191170|HGNC:HGNC:11998|Ensembl:ENSG00000141510|HPRD:01859|Vega:OTTHUMG00000162125 17 17p13.1 tumor protein p53 protein-coding TP53 tumor protein p53

    NCBI GeneID is 7157 and offical (HUGO) name is TP53 : https://www.ncbi.nlm.nih.gov/gene/?term=7157

    "NM_" identfiers or "RNA_nucleotide_accession.version" are in the file "gene2accession" , available at from he same place:
    wget nc ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2accession.gz
    Last edited by Richard Finney; 11-01-2016, 06:53 AM.

    Comment

    • yu_chem
      Member
      • Mar 2015
      • 23

      #3
      Originally posted by Richard Finney View Post
      Get this file : ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz

      This command will do it :wget ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz

      NCBI used to promote the term "Entrez" in terms like "Entrez Gene ID" ... but they are apparently no longer emphasizing this term. "Entrez" apparently refered to the software system used to access NCBI information.

      "Gene id" or "GeneID" is the accesison(?) number used by NCBI in column 2 in the file "gene_info" ( mentioned earlier).

      The official name is in the "Full_name_from_nomenclature_authority" field.

      Example for human TP53 gene ...

      grep -P "\tTP53\t" gene_info | grep "^9606" | cut -f1-13
      9606 7157 TP53 - BCC7|LFS1|P53|TRP53 MIM:191170|HGNC:HGNC:11998|Ensembl:ENSG00000141510|HPRD:01859|Vega:OTTHUMG00000162125 17 17p13.1 tumor protein p53 protein-coding TP53 tumor protein p53

      NCBI GeneID is 7157 and offical (HUGO) name is TP53 : https://www.ncbi.nlm.nih.gov/gene/?term=7157

      "NM_" identfiers or "RNA_nucleotide_accession.version" are in the file "gene2accession" , available at from he same place:
      wget nc ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2accession.gz
      Thank you for answer
      I understood about Entrez ID and checked gene_info

      If possible, I hope you answer following question.
      Is Entrez gene symbol official gene symbol provided by HGNC and MGI?

      I finished the quetions
      So Best regards,

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Originally posted by yu_chem View Post

        If possible, I hope you answer following question.
        Is Entrez gene symbol official gene symbol provided by HGNC and MGI?

        So Best regards,
        It should be for human genes because of this.

        Even though there is a separate committee for mouse, it appears that the process for gene name assignment for many vertebrates is moving under a new committee VGNC.

        Comment

        • yu_chem
          Member
          • Mar 2015
          • 23

          #5
          Originally posted by GenoMax View Post
          It should be for human genes because of this.

          Even though there is a separate committee for mouse, it appears that the process for gene name assignment for many vertebrates is moving under a new committee VGNC.
          Thank you for answer.
          I understood. I'll look up more information based on the above.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          30 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          96 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          115 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          108 views
          0 reactions
          Last Post SEQadmin2  
          Working...