Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • entamoeba histolytica reference genome (gene annotation file) in GTF format

    Hi,
    I am working with Entamoeba data. I have downloaded the genome data of entamoeba from NCBI in genebank format but did not get in GTF format. I have searched it in UCSC genome browser but unable to find out. Can any one suggest me in either find out it in GTF format or converting from genebank t GTF format ?

  • #2
    If you are able to do a bit of Python coding on your own you could use the bcbio BioPython module for GFF parsing/writing from the following page; there is a description on how to convert GenBank flat format annotation files into GFF3:


    Then you could use the GenomeTools (http://genometools.org/) gff3_to_gtf tool to get proper GTF (if necessary, many tools can use GFF3 as well).
    I've done this before though I've seen troubles with certain GenBank files, as some features are difficult to correctly translate into GFF3. You might need to manually check that the contents are correct, or filter out stuff that isn't needed in the end, to avoid problems with the GenomeTools converter.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    39 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X