Hello everyone,
we're starting to work with deep sequencing data of target regions that were enriched by Agilent's Haloplex kit (Illumina MiSeq, paired-end, often >500x mean seq. depth).
How would you handle duplicate reads after alignment? The initial DNA fragmentation by restriction enzymes seems to make it impossible to remove duplicates (from PCR mainly) just like for random fragmentation. If you kept all the duplicates, does the amplification bias or other biases like strand bias hamper mutation detection significantly?
Would be great if someone had some experience he'd like to share and discuss here or knew some literature that presents some work with Haloplex sequencing data (which seems to be quite rare).
Thanks in advance,
Chris
we're starting to work with deep sequencing data of target regions that were enriched by Agilent's Haloplex kit (Illumina MiSeq, paired-end, often >500x mean seq. depth).
How would you handle duplicate reads after alignment? The initial DNA fragmentation by restriction enzymes seems to make it impossible to remove duplicates (from PCR mainly) just like for random fragmentation. If you kept all the duplicates, does the amplification bias or other biases like strand bias hamper mutation detection significantly?
Would be great if someone had some experience he'd like to share and discuss here or knew some literature that presents some work with Haloplex sequencing data (which seems to be quite rare).
Thanks in advance,
Chris
Comment