Using Restriction Enzymes pre library prep for targeted sequencing.

Hoban

Junior Member

Join Date: Mar 2015

Posts: 3
- Share
- Tweet
#1

Using Restriction Enzymes pre library prep for targeted sequencing.

03-23-2015, 12:27 PM

Hey Everyone,

I'm using a targeted capture approach for sequencing the variable regions in Ig, we're trying to develop a basis for genotyping experiments for Ig. We're going with HiSeq, we have too many samples for PacBio. Ig is hard because of the huge amounts of structural variations that can occur (frequent insertions, deletions, duplications, and 'complex' events) which make sequencing with NGS difficult.

Here's my plan, design an enzyme cocktail that will chop at defined regions in the Ig region from one sample into ~1kb+ segments with the majority of segments in the 1kb-20kb range. Size select (I'm proposing <1kb, 1kb-5kb, 5kb-15kb, 15kb+) and separately fragment -> index each size pool. Then pool everything together (and include a non-restricted regular library prepped genome), and do the capture, amp, and sequencing.

The thought is that the extra information (if the read came from a 1kb, 5kb, 15kb, or 15kb+ region) will help differentiate reads during alignment, indel, and read depth analysis. An ex: Reads 1-2-3-4-5 all align to the same area, reads 1-2-5 are 15kb indexed, 3-4 are 1kb indexed, so it is likely 1-2-5 and 3-4 are from separate areas of the region.

I haven't been able to find anything too similar. I've mostly been going off of RADseq papers but since I still want to sequence the whole region I'm just size selecting and indexing separately then pooling, nothing gets thrown out from the region (assuming everything in the region is capture-able by our custom capture). I've considered doing PacBio for 5-10% of the samples to do de novo assembly and align the HiSeq data to that reference (hg19 is a very poor reference for variable regions in Ig).

Any thoughts/advice?
Tags: capture, igh, ngs, restriction enzyme, targed sequencing
SNPsaurus

Registered Vendor

Join Date: May 2013

Posts: 525
- Share
- Tweet
#2

03-28-2015, 09:35 AM

That's an interesting idea, although I'm not convinced it would get you much past "so it is likely 1-2-5 and 3-4 are from separate areas of the region" although maybe that is all you need?

I'm not fully understanding the costs of PacBio sequencing of small regions. My intuition is that capture of long DNA fragments and PacBio sequencing will be cheaper and more directly return applicable data compared to individually digesting, size selecting, and fragmenting with indexing of each sample followed by capture. I guess multiplexing the capture is a big help, though. On the other hand, the cost of analysis would be much higher with your protocol and require lots of manual annotation.

There were some genome assembly papers that partitioned the genome into different size restriction fragments to limit the reads for de novo assembly. I can't remember them exactly, but they might have some useful perspective on how much such information can help if you can search them out. One more recent one is http://genome.cshlp.org/content/20/2/249.full but I think some other papers were in this vein earlier on.

Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 10 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Using Restriction Enzymes pre library prep for targeted sequencing.

Comment

Latest Articles

ad_right_rmr

News