G'day Everyone,
I am trying to split six plates of data by the barcodes of 576 indviduals. We are split our data using 'fastx_barcode_splitter.pl', but unfortunately this tool isn't able to work with a barcodes file where barcodes have different lengths. To combat this we decided to feed one barcode in at a time which works fine.
Our problem is that during this process 'fastx_barcode_splitter.pl' also writes out an unmatched.fq file which is about 50X larger than the file we are interested in and is taking up a lot of the processing time. It took longer than an hour to split out 1 individual, multiplied by 576 means it will take way to long to run.
Is there a way to stop 'fastx_barcode_splitter.pl' producing an unmatched.fq file? I think this would help us reduce a lot of unnecessary read and writing processing time.
Thanks for your help in advance.
Cheers,
Adam
I am trying to split six plates of data by the barcodes of 576 indviduals. We are split our data using 'fastx_barcode_splitter.pl', but unfortunately this tool isn't able to work with a barcodes file where barcodes have different lengths. To combat this we decided to feed one barcode in at a time which works fine.
Our problem is that during this process 'fastx_barcode_splitter.pl' also writes out an unmatched.fq file which is about 50X larger than the file we are interested in and is taking up a lot of the processing time. It took longer than an hour to split out 1 individual, multiplied by 576 means it will take way to long to run.
Is there a way to stop 'fastx_barcode_splitter.pl' producing an unmatched.fq file? I think this would help us reduce a lot of unnecessary read and writing processing time.
Thanks for your help in advance.
Cheers,
Adam
Comment