Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • HiSeq 2000 File Sizes

    I was wondering if anyone that is working with a HiSeq 2000 could provide some general file size estimates. I am looking for total file sizes
    - per lane, per flow-cell, or per run
    - intensity files, .bcl files, FASTq files, and/or BAM files
    - for 50, 75, and 100 bp PE and SE runs.

    Thanks,

  • #2
    Attached is the site prep guide. It doesn't give the file sizes you want but does recommend computing specs and archiving space.

    Have you gotten your HiSeq? I have heard that it can take 6 months or more. Our official ship date is over 3 months from placing the order.
    Attached Files

    Comment


    • #3
      I have a booklet from a user group meeting that lists:


      Hiseq 200G run
      Image Data 32TB (not tfrd)
      Intensity data 2TB (optionally tfrd)
      BaseCall / Quality score data .25 TB
      Final Alignment output 1.2 TB

      GAIIx 50G run
      Image Data 5.6TB (optionally tfrd)
      Intensity data .5 TB (optionally tfrd)
      BaseCall / Quality score data .06 TB
      Final Alignment output .3 TB

      Comment


      • #4
        Our first HiSeq is being validated now

        For 1 36X36 flowcell:

        48G per lane .cif data (Intensities/L00X folders)
        3.3G per lane of _pos.txt files
        200M per lane of .filter files
        6.1G per lane of .bcl data (BaseCalls/L00x folders)
        40G per lane of _qseq.txt files

        Size of BaseCalls folder after BclConverter: 193G
        Total size (Intensities/BaseCalls) 728G

        You don't have to keep the .cif and the .bcl files, but I wanted to test if running Bustard from .cif gave the same results as Bclconverter -- it did almost exactly.
        Christine Brennan
        UM DNA Sequencing Core
        Ann Arbor, MI 48109

        [email protected]

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        54 views
        0 likes
        Last Post seqadmin  
        Working...
        X