Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ECO
    --Site Admin--
    • Oct 2007
    • 1360

    Fancy a peek inside Sanger's Illumina GA Pipeline?



    So it has been brought to my attention that the Sanger has a publicly accessible "stats" page that contains quite a few statistics about their Illumina short read pipeline. The stats give a very interesting look into the daily operations of perhaps the highest throughput genome center in the world (...if I had a nickel for every PM I will get correcting me! ).

    Screenshot of the public page, containing a dropdown menu for different stats:


    I have reproduced all the available data below, unchanged (with the exception of blanking out someone's email address) as of this evening. I am hesitant to post the URL only because I don't want to cause undue ruckus, or cost anyone their job. I know these big genome centers are fiercely competitive...

    With that. Enjoy.











































    Eighty percent of the 28 genome analyzers that they have translates to 22 of them running all the time!


    Just scanned through Google Analytics, and realized that Sanger sends a fair amount of traffic here...appears they have a link to a popular thread on their intranet! Greetings Sanger-folk!
  • cgb
    Member
    • May 2008
    • 50

    #2
    Graphs

    I see our graphs are getting around.

    Couple of things not clear from them as shown.

    The yields are PF yields, i.e from non-overlapping clusters. typically this is half of all of the clusters on a dense chip. Some people quote yields as total bases.

    Per run numbers are used, for paired end runs - which are about 90% - two runs needs to be summed to give yield per flowcell.

    Error rates are estimated fro control lanes and very often are an average of first and second read rates for a flowcell with 2 runs. Second reads often have worse data quality that first (this is being fixed in collaboration with illumina). Early data is clearly from a very small number of runs with high variable success rates - hence the mountains - error bars are not on these graphs but the would be very broad for early data, and very narrow for later data.

    Some of the graphs are under development.

    c.

    Comment

    • cgb
      Member
      • May 2008
      • 50

      #3


      in fact this article is a little off, the 300 Gigabases already submitted is bigger than Genbank.

      Comment

      • cgb
        Member
        • May 2008
        • 50

        #4
        a new page is coming fro Roger....

        Comment

        • cgb
          Member
          • May 2008
          • 50

          #5
          I'm sorry, your page cannot be found on this site.

          Comment

          • cgb
            Member
            • May 2008
            • 50

            #6
            I think Sanger will hit 1 Terabase (PF) by the end of june

            Comment

            • cgb
              Member
              • May 2008
              • 50

              #7
              ta daaaaa

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Pathogen Surveillance with Advanced Genomic Tools
                by seqadmin




                The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                03-24-2025, 11:48 AM
              • seqadmin
                New Genomics Tools and Methods Shared at AGBT 2025
                by seqadmin


                This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                The Headliner
                The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                03-03-2025, 01:39 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-20-2025, 05:03 AM
              0 responses
              41 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-19-2025, 07:27 AM
              0 responses
              51 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-18-2025, 12:50 PM
              0 responses
              38 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-03-2025, 01:15 PM
              0 responses
              193 views
              0 reactions
              Last Post seqadmin  
              Working...