Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • afalvarez
    Junior Member
    • Apr 2013
    • 5

    Illumina GAIIx sequecing error rate (background)

    Hi guys,

    We are using NGS to detect non-clonal, low frequency mutations, so we are quite concerned about the background the sequencer introduces, meaning the amount of inespecific mutations coming from the sequencing process rather than from our samples.

    In that sense, all the information we've found refers to the error rates coming from base calling (99.9% accuracy for GAIIx, 99.99% for SOLiD, etc.), but nothing about the errors generated during the sequencing process per se (library preparation, cluster generation, SBS...).

    Do you know if there is any experimental data available? For instance, a measure of the mutational load of a known sequence (let's say the phage PhiX genome -usually loaded as a calibration control-) in subsequent technical replicates, or something like that.

    Thanks a lot!
  • nucacidhunter
    Jafar Jabbari
    • Jan 2013
    • 1250

    #2
    Here is a link for an earlier publication: http://genomebiology.com/2011/12/11/R112. There might be more information in papers citing this one.

    Comment

    • Brian Bushnell
      Super Moderator
      • Jan 2014
      • 2709

      #3
      Originally posted by afalvarez View Post
      ...all the information we've found refers to the error rates coming from base calling (99.9% accuracy for GAIIx, 99.99% for SOLiD, etc.)
      I think you should be very wary of that particular source of information

      Comment

      • afalvarez
        Junior Member
        • Apr 2013
        • 5

        #4
        Manufacturer's enthusiasm

        Originally posted by Brian Bushnell View Post
        I think you should be very wary of that particular source of information
        For sure. Anyway, look at what AB claims in the SOLiD 5500 whitepaper:

        "As the results demonstrate, Exact Call Chemistry determines the
        template sequence with extremely high accuracy, the majority
        of base calls achieving accuracy in excess of 99.9999%"

        And 99,99% came from the specification sheet of the very same sequencer...

        Manufacturers are usually too enthusiastic about their products

        Comment

        • afalvarez
          Junior Member
          • Apr 2013
          • 5

          #5
          Originally posted by nucacidhunter View Post
          Here is a link for an earlier publication: http://genomebiology.com/2011/12/11/R112. There might be more information in papers citing this one.
          Thanks!

          As a quick reference to the data that can be found in the paper, these are the substitution rates found in PhiX genome:

          1) HiSeq

          a. 2x95 (4.3e6 pair reads; spiked) --> 0.11% substitution errors
          b. 2x100 (8.87e5 pair reads; spiked) --> 0.12% substitution errors

          2) GAIIx

          2x150 (6.4e6 pair reads; one lane) --> 0.28% substitution errors

          These numbers are worked out as (Total errors) / (Uniquely aligned bases) using B-tail trimmed reads.

          Error rates with different quality filters can be found in Table6, it worths having a look at them as well.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          16 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          49 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          108 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          125 views
          0 reactions
          Last Post SEQadmin2  
          Working...