Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sameet
    Member
    • Apr 2010
    • 34

    What would be recommended hardware (computing) for a NGS lab?

    Hi All,

    We are in process of expanding our high-throughput Next Generation Sequencing facility. The sequencer we have. But we are thinking of going for better computing hardware. As far as the budget is concerned we are not running on a 'shoe-string' budget but we will have to justify all the expenses.

    Any pointers to number of nodes, amount of RAM per node, amount of scratch space per node, total storage and any specific pointers are welcome.

    Any suggestions?
    Sameet Mehta (Ph.D.),
    Visiting Fellow,
    National Cancer Insitute,
    Bethesda,
    US.
  • thinkRNA
    Member
    • Jan 2010
    • 94

    #2
    I think it will help to know how much data you for see generating on a weekly basis? How long will you need to store the data for your users? The amount of memory required by the tools you are using for analysis.

    We are also thinking about setting up hardware and are doing preliminary analysis of our needs (which seems very variable at this point). So, I would like input from the community on this as well.

    Comment

    • sameet
      Member
      • Apr 2010
      • 34

      #3
      We are looking at about 2 SOLiD runs per week that will generate to the order of 8 TB data per week, but after post-processing it should reduce to about 100 - 200 GB of usable data. Most of the raw (image) data need not be stored for long term, it may be stored for at the most a month after preliminary analysis.

      The tools are basically open source tools like the velvet pipeline and other chip-sep, rna-seq tools that are standard for the SOLiD. The memory requirements are variable, but they require decently high amounts of memory.
      Sameet Mehta (Ph.D.),
      Visiting Fellow,
      National Cancer Insitute,
      Bethesda,
      US.

      Comment

      • ymc
        Senior Member
        • Mar 2010
        • 496

        #4
        Is SAS hard drive necessary for NGS???

        Is SATA 3Gb/s fast enough for NGS???

        Comment

        • sameet
          Member
          • Apr 2010
          • 34

          #5
          AFAIK the SAS is preferred because of the sheer size of data that needs to be copied across the compute nodes. I think 3 Gbps is just not enough speed. But as a disclaimer, i am not really an expert and probably experts should comment on this, i am making this comment from some experience.
          Sameet Mehta (Ph.D.),
          Visiting Fellow,
          National Cancer Insitute,
          Bethesda,
          US.

          Comment

          • ymc
            Senior Member
            • Mar 2010
            • 496

            #6
            Originally posted by sameet View Post
            AFAIK the SAS is preferred because of the sheer size of data that needs to be copied across the compute nodes. I think 3 Gbps is just not enough speed. But as a disclaimer, i am not really an expert and probably experts should comment on this, i am making this comment from some experience.
            What do you think about the Seagate 7200RPM SAS 6Gb/s hard drives? Are they better than their SATA 3Gb/s equivalent?

            Or do you think everything should be 15000RPM SAS 6Gb/s in NGS? No place for slow HDD???

            Comment

            • ymc
              Senior Member
              • Mar 2010
              • 496

              #7
              Building a computer. Please advise

              Hi I am building a system to do SNP calling from raw data and SNP imputation and maybe GWAS also. This is the system I have in mind:

              2 x Intel Xeon E5520
              2 x Intel BXSTS100C 130W CPU Fan for Xeon 5500s
              Intel S5500HCV Board
              3 x Kingston 8GB DDR3-1333 Registered ECC
              LSI Mega RAID SAS 9260-8i
              OS/Apps Storage (RAID0, 2 stripes)
              2 x OCZ 30GB SATA II 64MB Cache SSD OCZSSD2-1VTX30G
              Main Storage (RAID10, 2 stripes, 2 mirrors)
              4 x Hitachi 1TB HDS721010CLA332 SATA II/32MB HDD
              Swap Drive (RAID0, 2 stripes)
              2 x Hitachi 300GB HUS153030VLS300 15K SAS HDD
              Enermax FMA II 535W EG565P-VE DXX 2.2 ATX
              Chenbro CA-SR20964 4-Bays E-ATX Server Case
              -----------------------------------------------
              Total US$4,800

              Do you think this system is an overkill for my purposes? If you were me, how would you fix it? Thanks in advance!

              Comment

              • ECO
                --Site Admin--
                • Oct 2007
                • 1360

                #8
                Do people want to keep hardware discussions in this forum? Do we need a hardware/infrastructure subforum?

                Comment

                • ECO
                  --Site Admin--
                  • Oct 2007
                  • 1360

                  #9
                  Merged ymc's separate thread with this. Seems I got an answer to my own question.

                  Comment

                  • drlukun
                    Junior Member
                    • Jul 2009
                    • 2

                    #10
                    I also would like to set up our own analysis platform.
                    More detailed information about the whole hardware system provided by company is greatly appreciated.

                    Comment

                    • Torst
                      Senior Member
                      • Apr 2008
                      • 275

                      #11
                      Originally posted by sameet View Post
                      Hi All, We are in process of expanding our high-throughput Next Generation Sequencing facility. The sequencer we have. But we are thinking of going for better computing hardware. As far as the budget is concerned we are not running on a 'shoe-string' budget but we will have to justify all the expenses. Any pointers to number of nodes, amount of RAM per node, amount of scratch space per node, total storage and any specific pointers are welcome. Any suggestions?
                      As an aside, don't forget that without good staff members to actually analyse the data, a big compute system won't be worth much to you.

                      Comment

                      Latest Articles

                      Collapse

                      • GATTACAT
                        Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by GATTACAT
                        Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                        Today, 11:43 AM
                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, Yesterday, 05:37 AM
                      0 responses
                      8 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-26-2026, 11:10 AM
                      0 responses
                      17 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      52 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      110 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...