Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Originally posted by dawe View Post
    Agree. Also, the bottleneck will always be I/O.
    Not sure what kind of processing you're doing, but for most sequencing stuff, it would appear that IO shouldn't be the bottleneck, at least until you're into large scale clusters / grids.

    If you're disk throughput bound, add spindles. If it's disk IOPS, SSDs make a massive difference (and IOPS-bound data tends to be relatively small). If it's networking, go 10Gbit.

    Comment


    • #17
      Originally posted by tonybolger View Post
      I suspect the 80:20 rule (or more like 99:1 rule) will be the saviour here - you don't need to optimize the performance of all the code, just the key nasty part that takes up the vast majority of the execution time.
      For all BWT-based mappers, "the key nasty part" is to count the occurrences in BWT. This may take 80% CPU time. For BWA, another nasty part is to maintain a priority queue used by BFS.

      Comment


      • #18
        I think the problem is that GPU's don't scale very well, and consume too much electricity. IE most people who do bioinformatics have access to a cluster, and seldom is it cost effective to enable GPU's on a cluster even when the cluster motherboards come with them installed, since the GPU will use much more power than the CPU itself, and for relatively little benefit, and for the most part alignment jobs are embarrassingly parallel, so they scale with well with the number of processors that are added.

        GPU's are for kids and noobs, IMO.

        Comment


        • #19
          BGI is reporting substantial speed ups from their GPU-enabled aligner and SNP detector.

          Comment


          • #20
            Originally posted by gregg.tehennepe View Post
            BGI is reporting substantial speed ups from their GPU-enabled aligner and SNP detector.

            See there it takes a whole compute farm of GPU's and it still takes six hours, and probably uses more watts too.

            Comment


            • #21
              Originally posted by dukzilla View Post
              Thanks Nils. I thought there was a standard such as OpenGL or CUDA that would be cross platform/GPU cards so that such code may be portable across GPUs.

              @Richard, does this mean than increasing RAM has more realized improvements than optimized/parallelized code?
              There is a standard. It is OpenCL and most hardware and operating systems support it. In fact MacOSX is built using OpenCL. While universal, the trouble with OpenCL is that it is not as fast as hardware optimized technologies such as CUDA.

              I personally am a fan of OpenCL and I believe even the new processors such as Knights Corner will utilize OpenCL to farm threads to the coprocessor.

              Comment


              • #22
                icic. Looks like it is not yet the right time to buy a 3GB GTX 580

                Comment


                • #23
                  Those interested could also have a look at

                  MummerGPU
                  Download MUMmerGPU for free. MUMmerGPU is a high-throughput DNA sequence alignment program that runs on nVidia G80-class GPUs. It aligns sequences in parallel on the video card to accelerate the widely used serial CPU program MUMmer.


                  SARUMAN short read mapper

                  Comment


                  • #24
                    You can use a 3GB GTX 580 with BarraCUDA, other than that you just need a standard workstation with 4GB RAM and some disk space.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM
                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    27 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    30 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 09:21 AM
                    0 responses
                    26 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-04-2024, 09:00 AM
                    0 responses
                    52 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X