Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What would be recommended hardware (computing) for a NGS lab?

    Hi All,

    We are in process of expanding our high-throughput Next Generation Sequencing facility. The sequencer we have. But we are thinking of going for better computing hardware. As far as the budget is concerned we are not running on a 'shoe-string' budget but we will have to justify all the expenses.

    Any pointers to number of nodes, amount of RAM per node, amount of scratch space per node, total storage and any specific pointers are welcome.

    Any suggestions?
    Sameet Mehta (Ph.D.),
    Visiting Fellow,
    National Cancer Insitute,
    Bethesda,
    US.

  • #2
    I think it will help to know how much data you for see generating on a weekly basis? How long will you need to store the data for your users? The amount of memory required by the tools you are using for analysis.

    We are also thinking about setting up hardware and are doing preliminary analysis of our needs (which seems very variable at this point). So, I would like input from the community on this as well.

    Comment


    • #3
      We are looking at about 2 SOLiD runs per week that will generate to the order of 8 TB data per week, but after post-processing it should reduce to about 100 - 200 GB of usable data. Most of the raw (image) data need not be stored for long term, it may be stored for at the most a month after preliminary analysis.

      The tools are basically open source tools like the velvet pipeline and other chip-sep, rna-seq tools that are standard for the SOLiD. The memory requirements are variable, but they require decently high amounts of memory.
      Sameet Mehta (Ph.D.),
      Visiting Fellow,
      National Cancer Insitute,
      Bethesda,
      US.

      Comment


      • #4
        Is SAS hard drive necessary for NGS???

        Is SATA 3Gb/s fast enough for NGS???

        Comment


        • #5
          AFAIK the SAS is preferred because of the sheer size of data that needs to be copied across the compute nodes. I think 3 Gbps is just not enough speed. But as a disclaimer, i am not really an expert and probably experts should comment on this, i am making this comment from some experience.
          Sameet Mehta (Ph.D.),
          Visiting Fellow,
          National Cancer Insitute,
          Bethesda,
          US.

          Comment


          • #6
            Originally posted by sameet View Post
            AFAIK the SAS is preferred because of the sheer size of data that needs to be copied across the compute nodes. I think 3 Gbps is just not enough speed. But as a disclaimer, i am not really an expert and probably experts should comment on this, i am making this comment from some experience.
            What do you think about the Seagate 7200RPM SAS 6Gb/s hard drives? Are they better than their SATA 3Gb/s equivalent?

            Or do you think everything should be 15000RPM SAS 6Gb/s in NGS? No place for slow HDD???

            Comment


            • #7
              Building a computer. Please advise

              Hi I am building a system to do SNP calling from raw data and SNP imputation and maybe GWAS also. This is the system I have in mind:

              2 x Intel Xeon E5520
              2 x Intel BXSTS100C 130W CPU Fan for Xeon 5500s
              Intel S5500HCV Board
              3 x Kingston 8GB DDR3-1333 Registered ECC
              LSI Mega RAID SAS 9260-8i
              OS/Apps Storage (RAID0, 2 stripes)
              2 x OCZ 30GB SATA II 64MB Cache SSD OCZSSD2-1VTX30G
              Main Storage (RAID10, 2 stripes, 2 mirrors)
              4 x Hitachi 1TB HDS721010CLA332 SATA II/32MB HDD
              Swap Drive (RAID0, 2 stripes)
              2 x Hitachi 300GB HUS153030VLS300 15K SAS HDD
              Enermax FMA II 535W EG565P-VE DXX 2.2 ATX
              Chenbro CA-SR20964 4-Bays E-ATX Server Case
              -----------------------------------------------
              Total US$4,800

              Do you think this system is an overkill for my purposes? If you were me, how would you fix it? Thanks in advance!

              Comment


              • #8
                Do people want to keep hardware discussions in this forum? Do we need a hardware/infrastructure subforum?

                Comment


                • #9
                  Merged ymc's separate thread with this. Seems I got an answer to my own question.

                  Comment


                  • #10
                    I also would like to set up our own analysis platform.
                    More detailed information about the whole hardware system provided by company is greatly appreciated.

                    Comment


                    • #11
                      Originally posted by sameet View Post
                      Hi All, We are in process of expanding our high-throughput Next Generation Sequencing facility. The sequencer we have. But we are thinking of going for better computing hardware. As far as the budget is concerned we are not running on a 'shoe-string' budget but we will have to justify all the expenses. Any pointers to number of nodes, amount of RAM per node, amount of scratch space per node, total storage and any specific pointers are welcome. Any suggestions?
                      As an aside, don't forget that without good staff members to actually analyse the data, a big compute system won't be worth much to you.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM
                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Today, 08:47 AM
                      0 responses
                      10 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      60 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      57 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      53 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X