Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GPU CUDA BLAST Efficiency

    I am currently in the possession of the following:

    7x CPU
    16 GB memory

    and a GeForce GTX 670 GPU
    Check out the latest NVIDIA GeForce technology specifications, system requirements, and more.


    Now I was testing CUDA BLAST and basically you can set the database up with three different parameters as far as I could find. Namely the following:

    -gpu_blocks
    -gpu_threads
    -max_file_sz

    Now I had a script running that combines a various of settings and then ran BLAST. The speed for all vs all of 1308 proteins varied from 55 seconds to 19 seconds. Now I was curious how do I know what is the best setting beside trying out all possible combinations?

    I should have 1344 CUDA Cores on the GPU.

  • #2
    Hi jjk - I can't answer your questions, but I was wondering what your benchmark time is with standalone blast without the GPU.

    Just a guess but I would think that -gpu_threads should be fixed to 1344 since that is how many cores you have. That would reduce the number combinations you have to search. You could do a grid search, or in your case a cube search. If you imagine your search space as a cube you can start your search in a defined smaller cube and do an exhaustive search of all possible combinations within. If your optimum lands on an edge, you can search the next space adjacent to that and continue until you reach an optimum that is not on an edge. That may save you from searching the whole space.

    You could also try a genetic algorithm...and there are other optimization strategies out there.

    Comment


    • #3
      Originally posted by jjk View Post
      I am currently in the possession of the following:

      7x CPU
      16 GB memory

      and a GeForce GTX 670 GPU
      Check out the latest NVIDIA GeForce technology specifications, system requirements, and more.


      Now I was testing CUDA BLAST and basically you can set the database up with three different parameters as far as I could find. Namely the following:

      -gpu_blocks
      -gpu_threads
      -max_file_sz

      Now I had a script running that combines a various of settings and then ran BLAST. The speed for all vs all of 1308 proteins varied from 55 seconds to 19 seconds. Now I was curious how do I know what is the best setting beside trying out all possible combinations?

      I should have 1344 CUDA Cores on the GPU.
      I ran into similar issues though I found that the best speed was obtained by maxing out the -gpu_threads=1024 reducing the file size to 1 MB and then finding the largest -gpu_blocks that is permitted based on the limitations of memory on the GPU.

      One caveat is that I have come to conclude that a single GPU is not faster than 30-60 cpu cores and probably less for BLASTing.

      Best of luck

      Andrew

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Advances in Sequencing Analysis Tools
        by seqadmin


        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
        Yesterday, 07:48 AM
      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 06:57 AM
      0 responses
      7 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 07:17 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 05-02-2024, 08:06 AM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-30-2024, 12:17 PM
      0 responses
      21 views
      0 likes
      Last Post seqadmin  
      Working...
      X