I am in the process of setting up a NGS core facility. I will be starting with a single HiSeq 1000 with an IlluminaCompute Tier0 analysis server. In a past life, I ran a NGS facility, in which we had a "medium-term" storage server and long-term tape back up system. File sizes have gotten so large, I'm not sure how practical it is to back up data on tape or deal with the hassle putting data on tape -- and retrieving if needed again in the future.
A few questions for all of you:
1. What data are you keeping ?
-- keeping BCL = 330Gb
-- keeping BAM = 330Gb
-- total = 660Gb per run (paired end, 2 x 101bp)
2. What long term data storage media are you using ?
3. I am a geneticist/biologist --- I'm not an IT professional -- what would be the easiest solution for me ? (at some point, I will be hiring an informaticist/computational biologist)
4. Would it be easier to store on external drives ?
5. Do any of you back up data and send to another facility for storage - such as Iron Mountain ?
Any advice you can give would be appreciated.
Thank you,
Michael
A few questions for all of you:
1. What data are you keeping ?
-- keeping BCL = 330Gb
-- keeping BAM = 330Gb
-- total = 660Gb per run (paired end, 2 x 101bp)
2. What long term data storage media are you using ?
3. I am a geneticist/biologist --- I'm not an IT professional -- what would be the easiest solution for me ? (at some point, I will be hiring an informaticist/computational biologist)
4. Would it be easier to store on external drives ?
5. Do any of you back up data and send to another facility for storage - such as Iron Mountain ?
Any advice you can give would be appreciated.
Thank you,
Michael
Comment