Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • HiSeq 4000 : typical RAW data output in GBytes

    Hi,

    can someone provide some info on how much raw data (in terms of GBytes, not Gbases) is generated for some typical HiSeq 4000 WGS runs (150,75, whatever)? How much fastq data (GBytes) is generated roughly/typically?

    We need to plan computing and storage resources accordingly.

    best,
    Sven

  • #2
    From the Illumina FAQ:

    The run folder size for the maximum read length of 2 x 150 is about 0.6 TB.

    Comment


    • #3
      Originally posted by kcchan View Post
      From the Illumina FAQ:
      Sure. For our HiSeq 2000 a few years ago, we had folder sizes up to 4TB; Illumina stated in their FAQ that a run folder is roughly 1TB.

      I am just curious how real-world numbers align with Illumina's FAQ.

      Comment


      • #4
        Some recent run data folder sizes for HiSeq 4000.

        SE 50 bp - 90G
        PE 50 bp - 170G
        PE 75 bp - 250G
        PE 150 bp - 500G

        Comment


        • #5
          GenoMax, thanks for the numbers. That makes me optimistic :-)

          Comment


          • #6
            Should have said that those are raw data folder sizes before bcl2fastq v.2.x conversion.

            Comment


            • #7
              Originally posted by GenoMax View Post
              Should have said that those are raw data folder sizes before bcl2fastq v.2.x conversion.
              That's what I was interested in :-)

              But btw, what's the fastq output for such runs (roughly)?

              Comment


              • #8
                Originally posted by sklages View Post
                That's what I was interested in :-)

                But btw, what's the fastq output for such runs (roughly)?
                Fastq folder size (sample dependent)

                PE 50 bp - ~195 G
                PE 75 bp - ~275 G
                PE 150 bp - ~350-400 G

                Comment


                • #9
                  Are those numbers for gzipped or raw fastqs?

                  Comment


                  • #10
                    For gzipped fastq's.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin


                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                      Yesterday, 07:01 AM
                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    55 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    52 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 09:21 AM
                    0 responses
                    45 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-04-2024, 09:00 AM
                    0 responses
                    55 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X