Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • blast+ on a grid

    Hi,

    After reading a lot manual pages I am still uncertain how to properly schedule a multithreaded blast run with SGE, so please help. I wanted to run blastx with option of 16 threads, but I cannot find how to request resources for these 16 threads in a qsub script. The manual gives openmpi examples, followed by either mpiexec or mpirun, but blast+ is not said to be compiled for the openmpi environment, is it? Another available option would be '-pe smp 16', but is not that requesting one node with 16 cores, which may not exist? Other described arguments to qsub for grids elsewhere mention slots=n, cores=n, low* n, high n, threaded n, orte n, orte_fillup - all these return error on our grid.
    Another option would be running blasx with GNU parallel, but again it is not clear how to request number of threads/cores/CPUs in such case.

  • #2
    On your head node type 'man qsub' to get the man page you need. Its been a while since I've used SGE. I think you need a line like this

    #$ -pe 16

    in your submit script.

    I think you'll get much better performance by splitting your query sequences into 16 separate fasta files and submitting an array job.

    Comment


    • #3
      If you have 16-core machines on the cluster, use -pe smp 16, and call BLAST+ with 16 threads. If you only have 4-core machines on the cluster, use -pe smp 4, and call BLAST+ with 4 threads.

      As Mike points out, when you query file is made up of lots of sequences, splitting this into separate FASTA files and running separate BLAST processes on different cluster nodes makes sense (you can then combine their output files).

      BLAST+ itself has no built in capabilities like this, something like MPI-BLAST does but is based on the legacy C BLAST suite and quite old now.
      Last edited by maubp; 03-15-2013, 03:28 AM. Reason: Fixing touch screen typos

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advancing Precision Medicine for Rare Diseases in Children
        by seqadmin




        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
        12-16-2024, 07:57 AM
      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin



        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has seen remarkable advancements,...
        12-02-2024, 01:49 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-17-2024, 10:28 AM
      0 responses
      25 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-13-2024, 08:24 AM
      0 responses
      42 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-12-2024, 07:41 AM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-11-2024, 07:45 AM
      0 responses
      42 views
      0 likes
      Last Post seqadmin  
      Working...
      X