Seqanswers Leaderboard Ad

**nilshomer** · 04-30-2010, 11:58 AM

Originally posted by krobison View Post

Question arising from another thread in the forums; I'm posting this separately (and cross-referencing) because I want to ask the more bioinformatics and less sample-prep oriented subset.

An apparently property of at least some hybridization capture methods is a tendency to reduce the library size. As a result, with long Illumina reads the paired end reads may overlap in the middle.

How do the variant callers out there handle this? If a variant is found in the overlap & the two reads agree, then that is clearly stronger evidence that a given variant is present in that DNA fragment. BUT, if you are worried about PCR (or sample damage such as FFPE) artifacts then you may want some separate accounting for having actually seen the same variant in two different fragments.

The short answer is it treats both ends as if they were from independent fragments.

What is more powerful/accurate, having two observations of the same DNA fragment (read pairs overlapping), or two observations of the same haplotype (sampling with two fragment reads)?

The former may improve the consensus call of the read, but does not increase the number of times an allele is sampled. The latter is independent of sequencing error artifacts from the same fragment and provides an independent observation of the same haplotype. Basically, if you need 30x diploid coverage to guarantee to see both alleles of a heterozygous variant, the former should count only once while the latter should count twice.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

How do variant callers deal with overlapping paired end reads?

Comment

Latest Articles

ad_right_rmr

News