Hello Members,
I'm working with E. coli, paired end, Illumina data. Using SPAdes 3.5 version.
SLURM Environment. 128 gb RAM node.
I checked out QUAST report for one of isolate which was alarmingly high for an E. coli:- 6.592727 mb, with 918 contigs (1000>= bp).
Read length: 103
I traced back to warnings.log file for its assembly run, it says:
No idea how and what is causing it.
Meanwhile, I shall check fastqc report, see if that would make sense to me.
Any guidance would be of great help.
Thank you.
I'm working with E. coli, paired end, Illumina data. Using SPAdes 3.5 version.
SLURM Environment. 128 gb RAM node.
I checked out QUAST report for one of isolate which was alarmingly high for an E. coli:- 6.592727 mb, with 918 contigs (1000>= bp).
Code:
zcat file_R1_001.fastq.gz | awk '{if(NR%4==2) print length($1)} ' | head -1
I traced back to warnings.log file for its assembly run, it says:
=== Error correction and assembling warnings:
* 0:03:13.851 484M / 9G WARN General (kmer_coverage_model.cpp : 327) Valley value was estimated improperly, reset to 1
* 0:02:26.558 620M / 9G WARN General (kmer_coverage_model.cpp : 327) Valley value was estimated improperly, reset to 4
* 0:02:26.565 620M / 9G WARN General (kmer_coverage_model.cpp : 366) Failed to determine erroneous kmer threshold. Threshold set to: 4
======= Warnings saved to $HOME/Docs/warnings.log
* 0:03:13.851 484M / 9G WARN General (kmer_coverage_model.cpp : 327) Valley value was estimated improperly, reset to 1
* 0:02:26.558 620M / 9G WARN General (kmer_coverage_model.cpp : 327) Valley value was estimated improperly, reset to 4
* 0:02:26.565 620M / 9G WARN General (kmer_coverage_model.cpp : 366) Failed to determine erroneous kmer threshold. Threshold set to: 4
======= Warnings saved to $HOME/Docs/warnings.log
Meanwhile, I shall check fastqc report, see if that would make sense to me.
Any guidance would be of great help.
Thank you.
Comment