Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GeneMark-ES exits with "error, file not found: info/training.fna"

    Dear All,
    I am trying to use GeneMark-ES Suite 4.32 to predict genes from a fungal genome. But GeneMark exits with:

    error, file not found: info/training.fna
    To run the program, I am using the following command:

    PHP Code:
    ./gmes_petap.pl --ES --fungus --sequence CBS_contigs.fasta.masked 
    The log file 'gmes.log' contains the following:

    gmes_petap.pl : [Mon Jul 17 14:19:04 2017] /home/gaurav/2_GeneMark_results/gmes_petap/probuild --reformat_fasta --uppercase --allow_x --letters_per_line 60 --out data/dna.fna --label _dna --trace info/dna.trace --in /home/gaurav/1_Masking_Genome/CBS/CBS_contigs.fasta.masked

    gmes_petap.pl : [Mon Jul 17 14:19:04 2017] /home/gaurav/2_GeneMark_results/gmes_petap/probuild --seq data/dna.fna --allow_x --stat info/dna.general

    gmes_petap.pl : [Mon Jul 17 14:19:05 2017] /home/gaurav/2_GeneMark_results/gmes_petap/probuild --seq data/dna.fna --allow_x --stat_fasta info/dna.multi_fasta

    gmes_petap.pl : [Mon Jul 17 14:19:05 2017] /home/gaurav/2_GeneMark_results/gmes_petap/probuild --seq data/dna.fna --allow_x --substring_n_distr info/dna.gap_distr

    gmes_petap.pl : [Mon Jul 17 14:19:06 2017] /home/gaurav/2_GeneMark_results/gmes_petap/gc_distr.pl --in data/dna.fna --out info/dna.gc.csv --w 1000,8000

    gmes_petap.pl : [Mon Jul 17 14:19:06 2017] /home/gaurav/2_GeneMark_results/gmes_petap/probuild --seq /home/gaurav/2_GeneMark_results/gmes_petap/data/dna.fna --split dna.fa --max_contig 5000000 --min_contig 50000 --letters_per_line 100 --split_at_n 5000 --split_at_x 5000 --allow_x --x_to_n --trace ../../info/training.trace
    I have tried troubleshooting this error, but it did not resolved. Please help me resolve this error. Let me know if you need any further information. (I have even configured GeneMark-ET with the Braker pipeline, it works fantastic there. But I don't know what's the problem with GeneMark-ES over here.) Any help is appreciated. Thanks in advance.

  • #2
    The error is about the program not finding this file "../../info/training.trace". Is that file present in location relative to the directory you are running your command from?

    Comment


    • #3
      Originally posted by GenoMax View Post
      The error is about the program not finding this file "../../info/training.trace". Is that file present in location relative to the directory you are running your command from?
      Hi GenoMax,

      After posting, I have used the same command on another assembly which has better genome statistics than the previous one and it worked !!! It seems the problem is with 'probuild', as it requires atleast 10Mb of good data for training (ref: [https://www.researchgate.net/post/Genemark-ES_error]), which could not be retrieved from the poor assemblies. So I used AUGUSTUS instead for the poor ones and it is working. Anyway, thats the max I can make out of poor ones.

      Thanks.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM
      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 06:37 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 06:07 PM
      0 responses
      9 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2024, 10:03 AM
      0 responses
      51 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-21-2024, 07:32 AM
      0 responses
      67 views
      0 likes
      Last Post seqadmin  
      Working...
      X