Seqanswers Leaderboard Ad

**SNPsaurus** · 02-19-2020, 11:18 PM

That's a pretty good idea! We published an approach like that :-) https://bmcgenomics.biomedcentral.co...864-016-2669-3

However, it would be hard to drive it down to 1 in 1 million. I think something like duplex-sequencing https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4271547/ does it better for ultra-low frequency changes.

I don't know if anyone has tried PacBio HiFi reads on short fragments...say 1kb. You could assay random chunks of the genome without amplification (removing PCR-induced changes) and generate an accurate consensus sequence on the 1 kb fragment after getting 100 or more passes on the same fragment. If PacBio errors are as random as they say, then 100 passes should give an amazing consensus quality.

**torben** · 02-25-2020, 12:58 AM

Originally posted by SNPsaurus View Post

However, it would be hard to drive it down to 1 in 1 million. I think something like duplex-sequencing https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4271547/ does it better for ultra-low frequency changes.

There is also an improved version of the duplex sequencing method using CRISPR/Cas9 for improved enrichment:
Targeted genome fragmentation with CRISPR/Cas9 enables fast and efficient enrichment of small genomic regions and ultra-accurate sequencing with low DNA input (CRISPR-DS)

**maxz411** · 03-04-2020, 07:43 PM

Originally posted by SNPsaurus View Post

That's a pretty good idea! We published an approach like that :-) https://bmcgenomics.biomedcentral.co...864-016-2669-3

However, it would be hard to drive it down to 1 in 1 million. I think something like duplex-sequencing https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4271547/ does it better for ultra-low frequency changes.

I don't know if anyone has tried PacBio HiFi reads on short fragments...say 1kb. You could assay random chunks of the genome without amplification (removing PCR-induced changes) and generate an accurate consensus sequence on the 1 kb fragment after getting 100 or more passes on the same fragment. If PacBio errors are as random as they say, then 100 passes should give an amazing consensus quality.

Thank you! And sorry for the delayed response- I wanted to read through your PELE-seq paper as well as the one on ENU-induced mutagenesis, since that is what I am doing (in cells though, not in fish).

This approach is exactly the one I would like to take, so it is very helpful that you replied with this information.

My primary question would be whether you think the barcoding would be necessary. I do not necessarily need to eliminate false positives- really all I am looking for is a statistically significant difference in the number of mutants between treated and untreated populations, and from what I have read about the error rate of Q5 polymerase, which I am using, that level of error should also be tolerable for this purpose. This should still be possible with a small number of false positives I would think, and I am also interested in variants that may only be present in 1 or a few cells within a population of millions, and as I understand the barcoding process would result in most of these ultra rare variants being filtered out.

**SNPsaurus** · 03-05-2020, 12:35 PM

Right, the barcoding does require a certain level of presence in the population otherwise one pool has it the other does not. 1 in a million still sounds too rare to be able to identify. Can you bottleneck the cell populations to increase the presence of some rare mutations (and eliminate most)?

**maxz411** · 03-09-2020, 01:22 PM

Originally posted by SNPsaurus View Post

Right, the barcoding does require a certain level of presence in the population otherwise one pool has it the other does not. 1 in a million still sounds too rare to be able to identify. Can you bottleneck the cell populations to increase the presence of some rare mutations (and eliminate most)?

That is a possibility. I suppose the reason I thought 1 in a million would be reasonable is if I set a Phred cutoff of 30 (1 in 1,000), I would think the probability of both paired-end reads having an incorrect base at the same location would be 1 in 1,000,000 (excluding PCR errors). Or is there a reason it doesn't work like this in practice?

**SNPsaurus** · 03-09-2020, 02:27 PM

I think the problem comes from Illumina errors not being perfectly random and that this bias is not being reflected in the quality scores. At 1 in a million there will be a higher background of artifacts than real changes.

**maxz411** · 03-12-2020, 01:50 PM

Thanks- that makes sense.

Since I am only doing a single amplicon, I am wondering if there is a way to do something like this through PCR alone. I would think that if I barcoded each primer, I might be able to do something similar, assuming that in subsequent rounds of PCR the primer with the matching barcode was strongly favored over a mismatched barcode. Are you aware of any examples of something like this being done? Or do you think the genome fragmentation and barcode ligation before PCR is unavoidable?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Overlapping paired-end reads for rare mutation detection

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News