Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sklages
    Senior Member
    • May 2008
    • 628

    Data Storage after HiSeq Upgrade

    Hi Folks,

    after the (upcoming) upgrade of the HiSeq the local harddisks are too small to hold the data for the whole run locally; data needs to be written to some kind of external storage devices (e.g. the Illumina-recommended Isilon systems).

    How are you managing the data storage for a running HiSeq?
    Are you using Isilon systems or some home-made solutions (Linux/Windows)?
    The old iPARs (SAS) can only be upgraded to 7.5TB (less than 7TB with RAID6) which is too small ...
    Any experiences and comments on pro/contra of home-made solutions?

    just curious :-)

    Sven
  • HESmith
    Senior Member
    • Oct 2009
    • 512

    #2
    Hi Sven,

    We run our HiSeq 2000 on a Dell T7500 installed with two 2.7 TB hard drives (one for each flow cell), which is sufficient local storage for two PE-101bp runs on each (at least with the current chemistry). We copy to an Isilon system for data storage, and (after compression) backup on external hard drives (an inelegant solution, but cheap).

    Harold

    Comment

    • sklages
      Senior Member
      • May 2008
      • 628

      #3
      Originally posted by HESmith View Post
      Hi Sven,

      We run our HiSeq 2000 on a Dell T7500 installed with two 2.7 TB hard drives (one for each flow cell), which is sufficient local storage for two PE-101bp runs on each (at least with the current chemistry). We copy to an Isilon system for data storage, and (after compression) backup on external hard drives (an inelegant solution, but cheap).

      Harold
      Hi Harold,

      that's how we do it currently, local data storage for one run, copying to a server after the run has finished. But after the upgrade (600G) we get more than 7TB data per run. So we need to write on a dedicated (external) system (most people will prefer commercial solutions from e.g. 'isilon' or 'bluearc'). I am curious about some advantages/pitfalls using non-commercial systems ...

      thanks, Sven

      Comment

      • HESmith
        Senior Member
        • Oct 2009
        • 512

        #4
        Hi Sven,

        I didn't realize that the data size per run would increase that much after upgrading. Even with compression, that's going to fill up most storage systems in relatively short order. Perhaps it's worthwhile to consider cloud computing solutions...

        Harold

        Comment

        • AijazS
          Junior Member
          • Jul 2010
          • 3

          #5
          I am assuming you are wanting to store CIF files on the disk. We configured RTA to delete the CIF files from the instrument after successful transfer to a remote (Isilon) data storage disk. By doing this you possibly wouldnt need large diskspace. 2.7 TB should suffice..

          Comment

          • sklages
            Senior Member
            • May 2008
            • 628

            #6
            Originally posted by AijazS View Post
            I am assuming you are wanting to store CIF files on the disk. We configured RTA to delete the CIF files from the instrument after successful transfer to a remote (Isilon) data storage disk. By doing this you possibly wouldnt need large diskspace. 2.7 TB should suffice..
            Data produced during the run is somewhat around 6-8TB; too much for local storage on the machine itself. Just another error by design :-)
            Deleting files after transfer to whatever system is not a problem (though experience as a sequencing core has tought us to keep more files on disk as may be "necessary") ...

            Sven
            Last edited by sklages; 06-07-2011, 10:04 PM. Reason: TB, not GB :-)

            Comment

            • lletourn
              Member
              • Oct 2009
              • 63

              #7
              We plugged the hiseqs in a Pillar Axiom SAN. Our runs of the v3 kit for 100PE (207cycles, 7 for the index) have an average size of about 4.5Tb, no images, cifs and bcls.

              With the v2 kits we got about 4.1Tb, the 400Gb difference is all in the gzipped fastqs.

              Althought I must admit we haven't pushed the cluster density as high as the V3 allows yet. We still get ~220million reads per lane though.

              We might hit 6Tb when we do...we'll see.

              Comment

              • sklages
                Senior Member
                • May 2008
                • 628

                #8
                Originally posted by lletourn View Post
                We plugged the hiseqs in a Pillar Axiom SAN. Our runs of the v3 kit for 100PE (207cycles, 7 for the index) have an average size of about 4.5Tb, no images, cifs and bcls.

                With the v2 kits we got about 4.1Tb, the 400Gb difference is all in the gzipped fastqs.

                Althought I must admit we haven't pushed the cluster density as high as the V3 allows yet. We still get ~220million reads per lane though.

                We might hit 6Tb when we do...we'll see.
                Interesting .. you only stick with the fastq files, deleting cif/bcl? What if you need to re-basecall or re-convert from bcl for whatever reason?

                You're probably right, with increasing cluster densities you'll get pretty fast to 6GB or more ..

                Comment

                • lletourn
                  Member
                  • Oct 2009
                  • 63

                  #9
                  What I meant was:
                  We don't keep images
                  We *do* keep cifs and bcls, but only for a month or 2.

                  If after a month no problems were seen in the run we delete everything but the fastqs.

                  so my 4.5Tb is cifs+bcls+fastqs

                  Sorry for the confusion.

                  Comment

                  • sklages
                    Senior Member
                    • May 2008
                    • 628

                    #10
                    Ah, .. ok. Now I got it. :-)
                    Same here (except we don't delete). Thanks for clarification ..
                    Last edited by sklages; 06-08-2011, 07:50 AM. Reason: :-)

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    12 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    48 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    106 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    125 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...