Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to build a server (hardware requirements)?

    Hello,
    I have been analysing my NGS(illumina) microbiome data in a i5 4core 1TB Windows desktop using VM (virtual machine) biolinux package and have hit a snag due to limited virtual memory for alignment..

    So we have decided to build a better machine suited to NGS analysis. I am not sure about the hardware requirements. Our budget is modest since I am trying this for the first time.

    I would appreciate any help with this query.

    Thank you
    Naren

  • #2
    1 TB is actually pretty big, but 4 cores is really small. What kind of budget do you have?

    Comment


    • #3
      It is a desktop with 1TB HDD (SSD)...

      I would like to see a core i5 desktop supporting 1TB of RAM....

      I think here a user has 4GB - 32GB of RAM and 1TB of HDD/SSD...

      Comment


      • #4
        Originally posted by Markiyan View Post
        I would like to see a core i5 desktop supporting 1TB of RAM....

        I think here a user has 4GB - 32GB of RAM and 1TB of HDD/SSD...
        Ahh, good point! Also, Naren, it would be helpful if you could mention your sequencing platforms and how many gigabases of data you want to use in a typical assembly. Also, what kind of microbiome you're sequencing. And the actual numbers for physical RAM on your current machine versus hard-drive space; they're completely different for most purposes.
        Last edited by Brian Bushnell; 05-19-2017, 03:06 AM.

        Comment


        • #5
          Thank you Markiyan and Brian,
          Markiyan is right.. My desktop is 12GB RAM , 1TB hard disk and Quad core processor (i5).

          Our budget is only betwwen 1000 to 1500 USD (I know it is kind of cheap, but we can afford only around this range for now)

          The microbiome we use is soil microbiome.. i use SILVA data base (~10GB) as reference and my data is not very huge, ive got around 80000 sequences to align against the reference...

          in the future we may decide to do more NGS analysis as well..

          I appreciate your reply.

          Thank you
          Naren

          Comment


          • #6
            Virtual Memory is configurable in linux (but not on the fly, at initial setup).
            Often there is only a little virtual memory configured as once the system is using virtual memory, it's usually struggling and it's better to upgrade.
            If you're using a Virtual Machine on windows, you might want to see if
            you can dual boot instead.

            It looks like a 2TB SSD is in your budget. You can install linux on it
            and (via your bios) boot from that device. Setup linux to have lots of Virtual Memory, perhaps most of the SSD disk. (You'll need a little bit of disk for the OS and /tmp space).
            You'll need to make sure you can read the existing hard disk (formatted in NTFS for windows) by installing and configuring NTFS support.

            Comment


            • #7
              Just to clarify - is this shotgun sequencing or 16S amplicon sequencing?

              Comment


              • #8
                Brian,
                It is 16s amplicon sequencing

                Thank you
                Naren

                Comment


                • #9
                  Originally posted by Naren View Post
                  Brian,
                  It is 16s amplicon sequencing

                  Thank you
                  Naren
                  Oh, perfect! Well, before you go out and buy a new computer, please try this:

                  1) Download BBMap
                  2) Run this command:
                  sendsketch.sh in=reads.fq out=results.txt mode=sequence silva size=1000000

                  This assumes you've already merged your reads; if not, you should run something like:

                  bbmerge.sh in1=read1.fq in2=read2.fq out=merged.fq outu=unmerged.fq

                  ... and just use the merged reads.

                  This uses an alignment-free kmer-based 16S comparison method. I am eager to hear what kind of results you get... in my tests, it works really well. And it only takes a second for a single sequence. I'm not sure how long it takes for 80k ribosomal sequences so it would be great if you could tell me.
                  Last edited by Brian Bushnell; 05-19-2017, 02:56 PM.

                  Comment


                  • #10
                    I would recommend at least a 12-core XEON and 1.5 TB SSD. You can also get a high end GPU but not a lot of programs utilize it . Xeons have no integrated GPU so you may have to get a separate GPU anyway if you'll use your computer for other tasks. All this might set you back by around $3000.
                    Last edited by rnablix; 07-12-2018, 02:39 AM.
                    my recommended growler

                    Comment


                    • #11
                      I would suggest learning about cloud computing on a platform such as Amazon Web Services or the like. The days of needing to build hardware infrastructure are over. Just build up your own AMI (Amazon machine image), then spin it up on demand. You can provision whatever hardware configuration you need for any given task. Storing your data on Amazon S3 is probably convenient too for many reasons.

                      Comment


                      • #12
                        Hmm... What about AMD Ryzen 7 + 16 GB RAM + SSD?
                        Web and Mobile Development Company

                        Comment


                        • #13
                          Originally posted by dericklineker View Post
                          Hmm... What about AMD Ryzen 7 + 16 GB RAM + SSD?
                          That is what I am using (with 2x16G), excellent performance for the price.

                          Comment


                          • #14
                            Originally posted by dericklineker View Post
                            Hmm... What about AMD Ryzen 7 + 16 GB RAM + SSD?
                            RAM is often the limiting factor in bioinformatics computing. I would not recommend buying a computer that you plan to use for bioinformatics with only 16 GB RAM unless it will be dedicated to some specific program that does not need much memory. SSDs are much less important, typically. If you can't afford more RAM and an SSD, I'd suggest dropping the SSD. Spinning disks are fine and have much more capacity anyway, per unit cost.

                            Comment


                            • #15
                              Unfortunately prices for DDR4 RAM have gone through the roof, so expect to pay top dollar. 2 x 16 GB Desktop sticks are ~ $359 in US (+ tax) as I write this.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM
                              • seqadmin
                                The Impact of AI in Genomic Medicine
                                by seqadmin



                                Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                                02-26-2024, 02:07 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 03-14-2024, 06:13 AM
                              0 responses
                              32 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-08-2024, 08:03 AM
                              0 responses
                              71 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-07-2024, 08:13 AM
                              0 responses
                              80 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-06-2024, 09:51 AM
                              0 responses
                              68 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X