Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Fancy a peek inside Sanger's Illumina GA Pipeline?



    So it has been brought to my attention that the Sanger has a publicly accessible "stats" page that contains quite a few statistics about their Illumina short read pipeline. The stats give a very interesting look into the daily operations of perhaps the highest throughput genome center in the world (...if I had a nickel for every PM I will get correcting me! ).

    Screenshot of the public page, containing a dropdown menu for different stats:


    I have reproduced all the available data below, unchanged (with the exception of blanking out someone's email address) as of this evening. I am hesitant to post the URL only because I don't want to cause undue ruckus, or cost anyone their job. I know these big genome centers are fiercely competitive...

    With that. Enjoy.











































    Eighty percent of the 28 genome analyzers that they have translates to 22 of them running all the time!


    Just scanned through Google Analytics, and realized that Sanger sends a fair amount of traffic here...appears they have a link to a popular thread on their intranet! Greetings Sanger-folk!

  • #2
    Graphs

    I see our graphs are getting around.

    Couple of things not clear from them as shown.

    The yields are PF yields, i.e from non-overlapping clusters. typically this is half of all of the clusters on a dense chip. Some people quote yields as total bases.

    Per run numbers are used, for paired end runs - which are about 90% - two runs needs to be summed to give yield per flowcell.

    Error rates are estimated fro control lanes and very often are an average of first and second read rates for a flowcell with 2 runs. Second reads often have worse data quality that first (this is being fixed in collaboration with illumina). Early data is clearly from a very small number of runs with high variable success rates - hence the mountains - error bars are not on these graphs but the would be very broad for early data, and very narrow for later data.

    Some of the graphs are under development.

    c.

    Comment


    • #3


      in fact this article is a little off, the 300 Gigabases already submitted is bigger than Genbank.

      Comment


      • #4
        a new page is coming fro Roger....

        Comment


        • #5

          Comment


          • #6
            I think Sanger will hit 1 Terabase (PF) by the end of june

            Comment


            • #7
              ta daaaaa

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM
              • seqadmin
                The Impact of AI in Genomic Medicine
                by seqadmin



                Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                02-26-2024, 02:07 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-14-2024, 06:13 AM
              0 responses
              32 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-08-2024, 08:03 AM
              0 responses
              71 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-07-2024, 08:13 AM
              0 responses
              80 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-06-2024, 09:51 AM
              0 responses
              68 views
              0 likes
              Last Post seqadmin  
              Working...
              X