I tend to think I should run picard to remove duplicate after I use GATK to do realign and recalibrate, then remove duplicate. Or should I remove duplicate first.
I fail to understand the algorithm Picard uses to remove duplicates. Can someone explains how does Picard determine if a read is duplicate?
Thanks
I fail to understand the algorithm Picard uses to remove duplicates. Can someone explains how does Picard determine if a read is duplicate?
Thanks
Comment