Does anyone have any experience with removing duplicate reads from Ion Torrent data? Any idea if the 454-designed programs work well with Torrent (e.g., cd-hit-454)? Or any suggestions for other programs to use? The basis for this question is PCR-derived duplicates which I am anticipating will skew our analysis, and I would therefore like to remove them.
I should also mention that we do not have a reference genome to align reads to, as we are working with WGS metagenomic data.
I should also mention that we do not have a reference genome to align reads to, as we are working with WGS metagenomic data.
Comment