Ok, I did install and use this clumpify tool.
and it did not make any difference. it found some 300 duplicates, I assemble slightly fewer reads, to similar n50 and total nt count and of the contigs produced I had 1 less contig match than before.....
I have heard the term redundant vs non-redundant. just to make sure I understand: removing these duplicates would make my set of reads non-redundant? Or am I completely lost?
