Seqanswers Leaderboard Ad

**nucacidhunter** · 05-06-2014, 01:14 AM

If we take higher end of your fragment number estimate and assume that by size selection half of the resultant double digested fragments are present in your libraries, there will 50Kx120=6 M fragments. A lane of HiSeq with 150 M reads would give an average 25x coverage. If 25% of reads are not informative, the coverage will be around 18X. Depending on your intended downstream application, ploidy and relatedness of samples this coverage might be enough. However, a tighter gel cut still would be better option than double SPRI size-selection.

**SNPsaurus** · 05-06-2014, 09:19 AM

I find people usually underestimate the number of reads needed when multiplexing. A typical issue is that the number of reads per sample may have a 4-fold range (some get 500k reads, some 2M reads). There is also a wide range of depths at different loci (some will have 10 reads, some 200 reads). The locus variation is usually consistent across samples though. And then, as nucacidhunter mentioned, some % of reads are lower quality, don't align or have other issues that prevent them front being used productively.

It may not matter for your analysis, but of the 120 samples, 40 may have 9X depth on average instead of 18X. And of your 50k loci in those 40 samples, 25k of the loci may have 4X depth. Before you start is the best time to check if your statistics are robust to missing alleles and other issues this variation will create.

**Todd McLay** · 05-06-2014, 08:31 PM

Thanks for both of your responses.

I have been fiddling with calculations for a while now, it's a daunting task to try and determine the best way to do it when a failed run costs so much.

I think I will take your advice and use a gel cut in the final library to narrow the size range.

I am intending to use ddRAD for phylogenetic purposes. Other papers I have read set the minimum coverage for loci as low as 4x, but I don't find that as satisfying as a coverage greater than 10, which was why I decided to multiplex 120 or so for the 50-100k loci I assumed.

Regards,
Todd

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

ddRAD size selection with Ampure beads

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News