Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Luvs_2_sequence
    Junior Member
    • Feb 2014
    • 1

    MiSeq error rate for 600 cycle PE

    Was a bit alarmed at the run for 600 cycle chemistry we did the other day. Error profile shows that along the read, the error rate increases dramatically. Even only 200 cycles into the first read, the error rate has already surpassed 1%
    The error rate for read 2 is similar.

    Is this expected? We were a bit shocked to see this high of any error rate and was wondering if others were seeing that doing 600 cycle runs.

    Let me know
  • nickloman
    Senior Member
    • Jul 2009
    • 355

    #2
    Yes, we are too.

    Comment

    • DJ Flowcell
      Member
      • Apr 2013
      • 13

      #3
      Yeah, we are experiencing the same thing. The chemistry has been improved though, although the cycle time is back up to 5min (up from 4.3min for the v2 kits). By the end of the first 300bp read, for %perfect reads, the 0 error metric is down to about 45%

      Comment

      • WhiteSeal
        Member
        • Jul 2013
        • 14

        #4
        We are seeing a lot of issues with these 600 runs. There is a significant decrease in Qscores around 200 cycles (first read) and then around cycle 450 of the second read (a picture of %>Q30 is attached). We've had a lot of data that just isn't good enough for data analysis.. and I mean a lot of data. Certainly it is not what the kit should return as a result. I mean it is a 2x300 kit. Not a 2x maybe 300 if you are lucky. We've contacted illumina about this and have received compensation for 3 kits already (hoping for more) but has anyone else tried this.

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          Drop-off in quality accompanies presence of short inserts (when one would start reading to the adapter on other end). Have you checked/eliminated that possibility for this data set. It seems to happen more often that one would expect.

          Comment

          • NextGenSeq
            Senior Member
            • Apr 2009
            • 482

            #6
            IMHO the v4 chemistry isn't worth the extra money.

            How many reads did you get?

            Comment

            • WhiteSeal
              Member
              • Jul 2013
              • 14

              #7
              @GenoMax: samples were checked on BioAnalyzer to check the length of the fragments (all >700bp) so this should not happen. Also, we have seen this with almost all our 600PE v3 runs on the MiSeq.

              @NextGenSeq: MiSeq run 600PEv3 with 566k clusters gave - according to illumina output - ~14M reads, but in read two at cycle 550 already 5M reads have a Qscore <Q30 (not good), and at cycle 600 12M reads have a Qscore <Q30 (really not good). And then this was a 'good' run.

              Comment

              • nucacidhunter
                Jafar Jabbari
                • Jan 2013
                • 1250

                #8
                The explanation so far leaves two possibilities open. One is library diversity (particularly in the positions with low Q scores) and the other is faulty reagents. Illumine has had some recalls recently. I wonder what was %PF for the run.

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  @WhiteSeal: Has this data been analyzed and you are sure the sequence (irrespective of Q-scores) looks ok (no adapters etc).

                  Points mentioned by nucacidhunder are very relevant. If the nucleotide diversity drops late into the read then that would also lead to Q-score drops. Since you have been in touch with Illumina tech support if you used reagents that were recalled they should alert you. Wouldn't hurt to proactively check on that possibility too.

                  Comment

                  • avo
                    Member
                    • Sep 2013
                    • 14

                    #10
                    I'm a bit confused by the %Q30 graph shown here. I have seen 600V3 runs on our MiSeq that looked much worse. Yet we generally cluster well above 566k, so that might be a factor. Nevertheless I would be perfectly satisfied to see a Q30 graph like this. Even the example runs I looked at on Illuminas BaseSpace for the 600V3 kit are worse (SmallGenomesBcereusRhodo).

                    Because all our runs met the Illumina specs of a Total % ≥ Q30 of 70%, i never thought about complaining.
                    We observe this for a variety of different samples using TruSeq LT as well as TruSeq PCR-free libraries. Based on the %Base plots and the output of the adapter trimming there shouldn't be a significant problem with the fragment size (>800 bp) either.

                    Despite that we lose bases from the ends of both reads during the quality trimming it doesn't have a major impact on our analysis.

                    Does anyone experience similar issues?

                    Comment

                    • WhiteSeal
                      Member
                      • Jul 2013
                      • 14

                      #11
                      @nucacidhunter: indexing was fine. 76 samples, barcode variation for each base was evenly distributed. So no issue there.

                      @GenoMax: I have not had the opportunity to look into the data myself, but customer is complaining that Qscores at final 50 cycles is well below expected.

                      @avo: I was wondering.. you say you have seen worse and you just trim the data, but with so many reads not making it till the end then what is the use of running a 600PE kit. I (and customer also) expect at least 70% of all data to have a >Q30. In this case 70% is around cycle 550...

                      As for faulty reagents. This was actually a replacement kit from Illumina for one of their faulty ones. Also, I was proactive and complained and replacement has been received to run the pool again.
                      Last edited by WhiteSeal; 09-26-2014, 06:48 AM.

                      Comment

                      • NextGenSeq
                        Senior Member
                        • Apr 2009
                        • 482

                        #12
                        Originally posted by WhiteSeal View Post
                        @GenoMax: samples were checked on BioAnalyzer to check the length of the fragments (all >700bp) so this should not happen. Also, we have seen this with almost all our 600PE v3 runs on the MiSeq.

                        @NextGenSeq: MiSeq run 600PEv3 with 566k clusters gave - according to illumina output - ~14M reads, but in read two at cycle 550 already 5M reads have a Qscore <Q30 (not good), and at cycle 600 12M reads have a Qscore <Q30 (really not good). And then this was a 'good' run.
                        This is really low. Did you spike in PhiX? If so what are those reads like?

                        If they are good it's your library.

                        Comment

                        • GenoMax
                          Senior Member
                          • Feb 2008
                          • 7142

                          #13
                          Originally posted by WhiteSeal View Post
                          @GenoMax: I have not had the opportunity to look into the data myself, but customer is complaining that Qscores at final 50 cycles is well below expected.
                          Should one overly worry about Q-scores, specially for a re-sequencing run (not sure what kind of a run this is)? If a reference genome was available (and you are not trying to call SNP's in that region) the sequence may be still usable (irrespective of the Q-scores). If this was a de novo run then the worry would be justifiable.

                          Comment

                          • pmiguel
                            Senior Member
                            • Aug 2008
                            • 2328

                            #14
                            Originally posted by NextGenSeq View Post
                            IMHO the v4 chemistry isn't worth the extra money.
                            40% more in reagent costs gets you roughly twice the data. Generally see >5 billion >=Q30 bases for a 500 cycle run and >10 billion >=Q30 bases for a 600 cycle run.

                            --
                            Phillip

                            Comment

                            • NextGenSeq
                              Senior Member
                              • Apr 2009
                              • 482

                              #15
                              On the MiSeq we get maybe 24 million versus 20 million for v3. That's not worth it for an extra $400.

                              Comment

                              Latest Articles

                              Collapse

                              • SEQadmin2
                                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                                by SEQadmin2


                                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                                Here are nine questions we think about, in roughly the order they matter, before...
                                06-18-2026, 07:11 AM
                              • SEQadmin2
                                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                by SEQadmin2


                                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                ...
                                06-02-2026, 10:05 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, 06-17-2026, 06:09 AM
                              0 responses
                              26 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-09-2026, 11:58 AM
                              0 responses
                              43 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-05-2026, 10:09 AM
                              0 responses
                              48 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-04-2026, 08:59 AM
                              0 responses
                              49 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...