Dear all,
I'm looking into splitting a FASTQ read file into several smaller sized files. It's basically just distributing batches of 4 lines into a certain number of files.
I'm trying with
which works of course, but is too slooooooooooow on a HiSeq read file.
Any recommendations for faster splitting? awk, sed?
Thanks!
I'm looking into splitting a FASTQ read file into several smaller sized files. It's basically just distributing batches of 4 lines into a certain number of files.
I'm trying with
Code:
split -l <number of lines per file> <FASTQ>
Any recommendations for faster splitting? awk, sed?
Thanks!

Though... 2mins on a 16 Gb file?
Tried splitting a 32Gb file and it took HOURS! There must have been something seriously wrong with our file system server...
yes you can use awk, how big you want your small files?
Comment