![]() |
BBDuk java error when filtering using entropy?
Hello,
I am having an issue with a java error when trying to use BBDuk to remove low entropy sequences from a fastq file. The libraries were made using ribozero so there are a number of polyT sequences I would like to remove. I have previously used BBDuk on the same library to remove PhiX an adapter sequences with no problem. The file has ~135 million 100bp SE reads. I am running on a node with 24 cores and 128 GiB RAM running CentOS Linux release 7.3.1611 and java version "1.7.0_131". I get this error with or without the -Xmx flag. Code:
$ bbduk.sh in=seq.fq out=seq_0-1-entrop-filtered.fq outm=low_complexity-0-1.fq entropy=0.1 Thank you. |
Can you only try "-Xmx24052m threads=12" ? Don't use -Xms=.
|
Quote:
Thank you for the suggestion. With the "-Xmx24052m threads=12" flag it runs with 12 threads and the memory but still has an ArrayIndexOutOfBoundsException in multiple threads... :confused: Code:
Set threads to 12 @D00278:496:CC4LRANXX:7:1109:7642:2397 1:N:0:1 AAGACGGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT + BBBBBFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFBFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF<FFFFFFFFFFFFFFFFFFFFFFFFFF Thanks, Dave |
You can use something like
Code:
literal=TTTTTTTT k=5 |
worked finally - but had to increase kmer and literal..
Hi,
Thank you, I finally got it to work with literal, but I had to increase the length of the literal to 20 X T and increase the kmer to 25. With literal=TTTTTTTT k=5 I got 98% of the reads filtered out, even with mm=false and hdist=0... Ok, another question - how can I use bbduk to split my file into multiple files based on a inline barcode of 8 bp at the 5' end? I have 100 bp SE end reads and they are multiplexed using a 32 X 8 base barcodes. I have used sabre before but it is on another machine and I would like to avoid transferring files between different servers if possible. Cheers, Dave |
If you know the barcode sequences then you could run bbduk in "match" mode and require the match to be strict of 8 bp on 5'-end of the read (restrictleft=7). You may have to try a few command options out to see what works best.
|
All times are GMT -8. The time now is 08:17 AM. |
Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.