Hi,
I was simulating a .fastq read set from a genome by reading random locations of the genome. My objective here was to test some read error correction software like Quake. The .fastq file I created looks something like this:
@SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
TACGTGACTGGATCAAAACTCACAAGGACTTTAATGGCCGCCGCTATACACTGCATCATTGCGTAGTCAGCTAATGCCGGGCGACTGGTTGGCTATTGTA
+SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
@SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
TGAAACATGGGTATTTCGTGACTCTGGTCTAAAGAGGGACGTGAGAGGGCAGCGCTACCTATTGACCTGTTGTGAATTTGCGATTGTCAGGCATGATAAA
+SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
However, when I run jellyfish (while running SEECER), it returns warnings like the following
Warn: Bad character in sequence: :
Warn: Bad character in sequence: 1
Warn: Bad character in sequence: :
Warn: Bad character in sequence: 2
Warn: Bad character in sequence: 2
Warn: Bad character in sequence: 9
It looks like it is considering the control lines in the .fastq file as reads. I would appreciate any help with regard to what I am missing here.
I was simulating a .fastq read set from a genome by reading random locations of the genome. My objective here was to test some read error correction software like Quake. The .fastq file I created looks something like this:
@SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
TACGTGACTGGATCAAAACTCACAAGGACTTTAATGGCCGCCGCTATACACTGCATCATTGCGTAGTCAGCTAATGCCGGGCGACTGGTTGGCTATTGTA
+SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
@SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
TGAAACATGGGTATTTCGTGACTCTGGTCTAAAGAGGGACGTGAGAGGGCAGCGCTACCTATTGACCTGTTGTGAATTTGCGATTGTCAGGCATGATAAA
+SRR566546.970 HWUSI-EAS1673_11067_FC7070M:4:1:2299:1109 length=100
IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
However, when I run jellyfish (while running SEECER), it returns warnings like the following
Warn: Bad character in sequence: :
Warn: Bad character in sequence: 1
Warn: Bad character in sequence: :
Warn: Bad character in sequence: 2
Warn: Bad character in sequence: 2
Warn: Bad character in sequence: 9
It looks like it is considering the control lines in the .fastq file as reads. I would appreciate any help with regard to what I am missing here.
Comment