Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Problem with job execution when more than 4 jobs are submitted to condor

    We are building an application for analyzing NGS data. The application uses the Kepler workflow engine (v 2.4) for workflow execution. We have integrated HT Condor (v.8.2.3) with kepler for grid enablement. The application is installed on Ubuntu 12.04 and we are currently using this as a single node deployment. We are running this application on a high end AWS instance.

    The problem we are facing is this:
    When we submit a workflow containing multiple files to kepler it creates multiple generic job launcher actors for each step. Each generic job launcher actor has a jdl which it submits to the condor manager. We have observed that when more than 4 jdls are present, then intermittently, one of the jdl is not getting processed (i.e. one of the actor is not getting executed).The job remains in idle state for some time and then gets evicted. Our jdl has conditions to generate 3 files on execution, namely .out .log and .err, but in our case, few of the jdls are not executing at all and hence no files are generated.


    We have tried to troubleshoot this by changing the memory parameters (heap size) for condor and kepler but that has not worked.

    Please note that we can run workflows containing upto 2 files successfully. We have a requirement to run 1000 jobs or more simultaneously.




    Our thoughts on this issue:
    1) We are thinking that there might be a problem with Resource (cores) allocation.
    2) Jobs are getting evicted OR might be pre-empted.
    3) We suspect a condor_suspend signal is sent to the job. If the job is in “wait” state for a long time, after a specified time it will get evicted automatically by condor.


    Has anybody else faced similar problem on condor?


    Vaibhav Kulkarni

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 08:47 AM
0 responses
9 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
57 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
53 views
0 likes
Last Post seqadmin  
Working...
X