Seqanswers Leaderboard Ad

**krobison** · 05-07-2011, 04:05 AM

A lot of this is going to depend on how the library was prepared in terms of size selection.

For example, one technique is to use electrophoresis to sort by size & then cut out a specific band. These sorts of libraries may have a size distribution which is very close to uniform within very specific bands -- i.e. you might have essentially nothing larger or smaller than a defined range. The reality is probably a little bit of blurring of that boundary, but I'm guessing not a lot.

Size selection with beads, on the other hand, probably isn't quite as sharp and perhaps is more like a normal (I haven't looked). Nextera would probably be different again. Some libraries prep protocols I think rely solely on the shearing device to generate a population.

Too many papers fail to report how this is done, so if you wanted to study this you'll need to dig through a bunch of papers to find those that report their methods. But I would guess if you looked through a lot of papers, you'd find a bunch of different distributions. Perhaps if you can identify the center which did each sequence in 1K genomes, you'd see a different distribution which corresponds to their method.

**delphi_ote** · 05-07-2011, 07:58 AM

Thank you so much, krobison! That was incredibly informative, and pointed me toward a lot of good resources. I really appreciate it.

**gogreen** · 05-08-2011, 05:57 AM

hi delphi_ote, Krobison was right with the point. If one uses Gel selection or other automated size selection methods, the size selected fragments are mostly in X±30 bp where X is the selected size.
But when beads are used for size selection, this can be quite a large distribution typically ranging over a 100 bp or more of the desired size.
Attached are 2 bioanalyzer profiles of two libraries. One using Gel size selection and other using beads (The bead size selection can do a better job than this, I just found this one first)

Attached Files

**pmiguel** · 05-09-2011, 04:05 AM

To add another twist, sometimes one of the methods we use to size fractionate DNA, E-gel, has too narrow a size window, so we do a few collections.

Well, that may not be clear... These E-gels have a slot in the gels with no agarose in it--just water or buffer some distance from the loading well. The DNA migrates first into the agarose of the gel, where it migrates at differential rates largely determined by length of the DNA fragment. When it reaches the collection slot it migrates through this window, continuing on back into the gel on the other side. Once the desired size range of DNA is migrating through the window the gel is stopped and the fraction is pulled out with a pipette.

But the well can be filled back in and electrophoresis continued, and then another fraction taken at a later time. This can easily result in bimodal (or multi-modal) size distributions if the resulting fractions are pooled at a later point.

I don't know how common this practice is, but in cases where there is concern for the limited amount of library being produced I would imagine it would be common.

--
Phillip

**delphi_ote** · 05-09-2011, 10:31 AM

Every time I ask people who do the hard work, I always learn the real story. Thanks so much, gogreen and pmiguel. Clearly, this community was the right one to ask!

Do you know if any of these library preparation techniques would cause the desired fragment lengths to be 100bp or more less than the desired size? A few of the libraries I've been examining seem like they're not only bimodal, but also significantly shorter. For example, here's a graph I made for a library that was designed to be 614bp:

Any idea what would cause this?

**gogreen** · 05-09-2011, 01:01 PM

When you say 612 bp, is it the mean insert size or the size that was gel selected? If it was the selected size, you'd lose around 120 bp for the adapters on both ends which would explain why you get insert size of 440-500 bp. The smaller ones could be the self ligated adapters which typically appears at 120-135 bp (although theoretically not possible, it does happen!). Is this from some modified RNAseq or chipseq??

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Are Illumina library fragment lengths actually normally distributed?

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News