SEQanswers

SEQanswers (http://seqanswers.com/forums/index.php)
-   Bioinformatics (http://seqanswers.com/forums/forumdisplay.php?f=18)
-   -   Duplicates and targeted sequencing (http://seqanswers.com/forums/showthread.php?t=27361)

chrna 02-11-2013 01:27 PM

Duplicates and targeted sequencing
 
I am trying to analyze data that sequenced targeted regions of the genome (about 500 Mb in total). In the GATK pipeline it suggests marking duplicates/removing duplicates. Is this important to do with targeted sequencing? Wouldn't you expect duplicates because of the experimental design? Thanks!

Bukowski 02-11-2013 01:46 PM

Yes. If you have PCR or optical duplicates you will want to remove them before variant calling or you may introduce spurious variant calls.

It does depend what method you have used to do the capture, some methods are not amenable to deduplication - amplicon based approaches etc. A straightforward in-solution capture approach will be fine because you've randomly sheared your DNA before capture, and therefore you shouldn't expect lots of read pairs with the same coordinates when mapped.


All times are GMT -8. The time now is 10:43 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.