Hi all
When I received my reads they had a significant overrepresentation of 5-base sequences (listed by FASTQC but not described as adaptors). I then used NGS Toolkit and IlluQC to filter the reads (IlluQC is supposed to remove adaptors even when a library isn't available). My FASTQC reports improved a lot, but I still have some kmer overrepresentation and there's a somewhat "wavy behaviour" in the first few bases. Anyway, I trimmed the reads (10 bases from the 3' end) and assembled them.
So, my questions are: should I have trimmed the reads from the 5' end also? Looking at the images, how can I tell if I still have a contamination? My assembly wasn't fantastic, but the coverage is relatively low, so I don't know if it's the best I can get with these reads. And a truly silly question: aren't the adaptors supposed to be in the ends of the reads? I'm now starting to think that they might be in the middle also, but in that case they can't be removed by simply trimming/clipping the ends.
Thanks a lot
Sandra
When I received my reads they had a significant overrepresentation of 5-base sequences (listed by FASTQC but not described as adaptors). I then used NGS Toolkit and IlluQC to filter the reads (IlluQC is supposed to remove adaptors even when a library isn't available). My FASTQC reports improved a lot, but I still have some kmer overrepresentation and there's a somewhat "wavy behaviour" in the first few bases. Anyway, I trimmed the reads (10 bases from the 3' end) and assembled them.
So, my questions are: should I have trimmed the reads from the 5' end also? Looking at the images, how can I tell if I still have a contamination? My assembly wasn't fantastic, but the coverage is relatively low, so I don't know if it's the best I can get with these reads. And a truly silly question: aren't the adaptors supposed to be in the ends of the reads? I'm now starting to think that they might be in the middle also, but in that case they can't be removed by simply trimming/clipping the ends.
Thanks a lot
Sandra
Comment