Hey, any one came across the upper boundary of desired cluster density on Hiseq2000. Above what level of clusters there will not be any intensity detection?
Unconfigured Ad
Collapse
X
-
-
You seriously have to overload to get no data at all.
What happens as you increase concentration is that more and more reads (as a percentage of raw reads) fail the filters and you also start to get quality problems. The Q-score is based a number of statistics, some of which depend on being able to distinguish clusters from each other (which obviously is more difficult the more clusters there are).
Optimum cluster density is around 800-850k/mm^2 according to Illumina, but we've had over 1M/mm^2 and still got data loads of data out at the end with acceptable Q-scores. I guess some of that also depends on the nature of the library - a more biased or lower diversity library may cause problems at high density.
Routinely, I'd aim for 800k/mm^2 as you should get decent data regardless of the library.
Comment
-
-
Thanks to all for the comments. basically we got cluster density in the range of 1100k+/-50/mm-2 on hiseq2000 with flw cell v3 and got 0% pass filter for all the sample except one with32% PF and PF reads 102M. samples dilution of10pM was used and Phix 8pM. Phix gave cluster density of 350k/mm2. it must be overloading but all Qc were performed on Bioanalyzer DNA1000 and qubit. it seems that it is important to run a titration flow cell before running the experiment.
Comment
-
-
Yes, it is a problem with defining of concentration. I used 16 pM (Qubit) on all lanes and some lanes gave a 1M, phiX gave a 350k, and library on self-made adapters gave 400 K.
Real-time gave a some-fold concentration higher than Qubit. And we observe a different concentration of phiX libraries from some deliveries (eight-fold difference on Qubit and real-time).
TruSeq Sample Prep kits give very high cluster density from libraries, so be care of high concentration of libraries.
Comment
-
-
You also can't really trust cluster scores that are over 1 million. The last time I overloaded a lane (after trusting the Qubit and not qPCR) I had clusters of ~1.2 million but ~60% passing filter. In talking with someone at illumina apparently the cluster finding algorithm starts not working properly at densities over ~ 1 million, and you may actually have a significantly higher cluster density that what it reports, it just can't handle it all.
Comment
-
-
I remember Illumina mentioning in one of their talks that qPCR is the "gold" standard in terms of quantifying.
For the HiSeq 2000 that we are using, having a 2nM (from qPCR) and loading with 12pM on the cBot usually hits the 1 mil mark. It's risky hitting near max thought; you can get more data from higher cluster densities or nothing at all if you over cluster.
Comment
-
-
You should be aware that high cluster densities (900-1000K) have a more deleterious effect on index reads than inserts. We've had several flow cells with good cluster calling (80-90% PF) and high quality scores (mean ~38), yet fewer than 50% of the indices were called accurately. In some cases, pseudotiles at the inflow side (which contain higher cluster densities) have completely dropped out (i.e., no basecalling) during the index read after producing high-quality insert reads. The problem can be mitigated by balancing the ratio of index bases at each position. It is not solved merely by having different bases at each position if those bases are excited by the same laser (A/C or G/T).
Comment
-
-
I have some hope that HCS 1.5.0 has improved index calling rates. (On the down side we were shut down for 3 weeks after installing 1.5.0. The upgrade failed to update a critical line in a config file. The result of this was that flow cells frequently, but not always, failed to find the edge of lane 8 and refused to start cycle 1 scanning.)Originally posted by HESmith View PostYou should be aware that high cluster densities (900-1000K) have a more deleterious effect on index reads than inserts. We've had several flow cells with good cluster calling (80-90% PF) and high quality scores (mean ~38), yet fewer than 50% of the indices were called accurately. In some cases, pseudotiles at the inflow side (which contain higher cluster densities) have completely dropped out (i.e., no basecalling) during the index read after producing high-quality insert reads. The problem can be mitigated by balancing the ratio of index bases at each position. It is not solved merely by having different bases at each position if those bases are excited by the same laser (A/C or G/T).
We always try to "MK balance" (M=A,C ; K=G,T) our indexes in a lane. By which I mean (as you imply above) you want a good number of clusters in the A or C channel AND plenty in the G or T channels as well. But a response to my post about this made me think this was not really an issue for the HiSeq, only for HiScanSQs.
Anyway, we have a lane with a mean cluster density of 950 K/mm2. 94% of the PF reads demultiplexed. (3 indexes.) That does seem better than what we were getting with previous versions of HCS.
--
Phillip
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
06-18-2026, 07:11 AM -
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, 06-26-2026, 11:10 AM
|
0 responses
15 views
0 reactions
|
Last Post
by SEQadmin2
06-26-2026, 11:10 AM
|
||
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
49 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
||
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
107 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
125 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
Comment