Hi,
I have one lane (~360 million reads) of Illumina transcriptome data which I am trying to assemble. Having only a tiny server, I am attempting to reduce this dataset with digital normalization. However, before doing this, I have run fastqc which flags up some over-represented sequences. When I BLAST these, I get matches to my organism's 16S, 18S and 28S rRNA genes.
As a bit of a beginner, I was wondering whether this was normal and whether I need to worry or do anything about this?
Any help or advice very much appreciated!!
I have one lane (~360 million reads) of Illumina transcriptome data which I am trying to assemble. Having only a tiny server, I am attempting to reduce this dataset with digital normalization. However, before doing this, I have run fastqc which flags up some over-represented sequences. When I BLAST these, I get matches to my organism's 16S, 18S and 28S rRNA genes.
As a bit of a beginner, I was wondering whether this was normal and whether I need to worry or do anything about this?
Any help or advice very much appreciated!!
Comment