Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Having trouble converting genbank format to gff format

    I have a fairly large number of sequin annotation files generated by GenBank's automated PGAP pipeline that I want to convert into bed format. I've been able to use the asn2gb program from the ncbi toolkit to generate GenBank format files from those sequin files, but so far I haven't had much luck converting from genbank into gff. After getting into gff, conversion into bed won't be a problem. But so far I haven't had any luck going from genbank to gff.

    I am trying to use bp_genbank2gff3.pl, but am getting this error message:

    --------------------- WARNING ---------------------
    MSG: Bad LOCUS name? Changing [Staphylococcus_spHMPREF3292-1.0_Cont0>11789] to 'unknown' and length to Staphylococcus_spHMPREF3292-1.0_Cont0>11789
    ---------------------------------------------------

    I thought maybe it was unhappy because of the greater-than symbol in the LOCUS line. So I tried removing it, and then I get a new error message:

    ------------- EXCEPTION: Bio::Root::Exception -------------
    MSG: asking for tag value that does not exist date
    STACK: Error::throw
    STACK: Bio::Root::Root::throw /gsc/scripts/opt/genome/current/user/lib/perl/Bio/Root/Root.pm:357
    STACK: Bio::SeqFeature::Generic::get_tag_values /gsc/scripts/opt/genome/current/user/lib/perl/Bio/SeqFeature/Generic.pm:498
    STACK: main::gff_header /gsc/bin/bp_genbank2gff3.pl:895
    STACK: /gsc/bin/bp_genbank2gff3.pl:406
    -----------------------------------------------------------


    All my input files were generated by GenBank itself, and I used a GenBank script to generate the .gbk files I now have. So I would think my .gbk files are correctly formatted. Can anyone either advise me on what I'm doing wrong with the 'bp_genbank2gff3.pl' script? Or if there is a better way to convert from .gbk into .gff that would work too.


    Thanks,
    John Martin

  • #2
    The "bp_" prefix stands for "BioPerl". You might want to ask the BioPerl programmers what the problem might be. They have an email list. Possibly there are other ways to query them as well.

    --
    Phillip

    Comment


    • #3
      From memory I think readseq.jar might be able to help, it is simple too.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM
      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin



        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has seen remarkable advancements,...
        12-02-2024, 01:49 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      33 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      49 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      34 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-11-2024, 07:45 AM
      0 responses
      46 views
      0 likes
      Last Post seqadmin  
      Working...
      X