Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Real performance of HiSeq 2000

    Hello, what is the 'real world' performance of HiSeq 2000 using 2x100 PE?

    Illumina says 374 million/lane; so 374e6 x 2 x 100 = 74.80 Gb/lane.

    According to Illumina: 80% percentage of Bases > Q30

    74.80*0.80 =~ 60Gb.

    However looking in web pages of sequencing centers the 'real value' ranges between 40-60Gb/lane.

    Thank you very much.

  • #2
    When they say 374 million/lane, they mean 374 millions of 100bp reads, or 187 millions of 100+100 reads. This gives ~ 37 Gb. In our hands, we usually get 30-40 Gb, with Q30 ~90%.
    It is possible to get a bit more, 40-45 Gb, but I would not recommend this.

    Comment


    • #3
      We would normally hope to get 200 million reads from a lane these days. We've had quite a few runs from the Sanger centre lately which have been averaging nearer 100 million though. On a good day you can get 250+ million so the yield you get can vary by a factor of 3 depending on how optimally everything worked. Quality to the end of a 2x100bp run is pretty consistently good these days and we remove very little by quality trimming.

      Comment


      • #4
        Hi simonandrews, thank you.

        When you say 200 million reads, do you mean paired reads (100 + 100) or single reads (100bps)?

        Thanks.

        Comment


        • #5
          200 million clusters, so 400 million reads from a 2x100 paired end run.

          Comment


          • #6
            The numbers Simon is presenting are for well constructed libraries.

            Every library run on a HiSeq does not yield this many reads per lane.

            Comment


            • #7
              Ok I think I understand you now. But please correct me if I am wrong. So if you have 400 million reads from a 2x100 paired end run (from 200 million clusters) ... You would get 40Gb from a lane?

              However your output from a lane can range between 20-50 Gb (100-250 million reads) depending on how optimally everything worked.

              Comment


              • #8
                Originally posted by GenoMax View Post
                The numbers Simon is presenting are for well constructed libraries.
                We only make well constructed libraries

                If only...

                Comment


                • #9
                  Originally posted by cip View Post
                  Ok I think I understand you now. But please correct me if I am wrong. So if you have 400 million reads from a 2x100 paired end run (from 200 million clusters) ... You would get 40Gb from a lane?

                  However your output from a lane can range between 20-50 Gb (100-250 million reads) depending on how optimally everything worked.
                  Output will range from 0 - 50 Gb (specially if you had not QC'ed your library before running it)

                  That aside the estimate you are using is reasonable.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin


                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                    Yesterday, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  55 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  51 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  45 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  55 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X