Hello,
I have paired end data from Illumina hi seq for a bacterial genome that has been sequenced using three different insert sizes, 160, 305 and 505 respectively. My task is to perform de novo assembly of the genome but the problem is that every single file contains more than 60 million reads and its not possible to run assembly of this much large file. Is there any way I can reduce the size of the file, by removing some reads?? or performing some kind of filteration?
I have paired end data from Illumina hi seq for a bacterial genome that has been sequenced using three different insert sizes, 160, 305 and 505 respectively. My task is to perform de novo assembly of the genome but the problem is that every single file contains more than 60 million reads and its not possible to run assembly of this much large file. Is there any way I can reduce the size of the file, by removing some reads?? or performing some kind of filteration?
Comment