Hey all, a newbie here, and not sure if this is the appropriate place to post this but was wondering if I could get some help with an issue involving Illumina deepseq data. I'm trying to run a batch of deepseq data that we have recently got through CASAVA v 1.7 and align it to a genome. The file is formated in .fastq and the reads look like this:
@6:1:1410:944:N
NNNNCAAACACAAAGTTACCTAAACTATAGAAGTCAAACA
+
####&&()''@@@@@@8@@@31888@@@@@3885817775
However, when I try to run it through the program, it gives the following error:
Could not identify index of the following line:
*********************************
6:1:1410:944:N
*********************************
Please check your files, we expect the following syntax:
<machine-id>_<run-number>(flow_cell-id):lane:tile:x:y#<index>:<pair>
machine-id: all characters except '_'
I realize this is a formating issue as CASAVA wants the file in the format of:
@<machine_id>:<lane>:<tile>:<x_coord>:<y_coord>#<index
>/<read_#>
But am unsure how to go about fixing it. I'm pretty sure the machine_id is missing, as well as any information dealing with the index and read. Any help would be much appreciated. Thanks!
@6:1:1410:944:N
NNNNCAAACACAAAGTTACCTAAACTATAGAAGTCAAACA
+
####&&()''@@@@@@8@@@31888@@@@@3885817775
However, when I try to run it through the program, it gives the following error:
Could not identify index of the following line:
*********************************
6:1:1410:944:N
*********************************
Please check your files, we expect the following syntax:
<machine-id>_<run-number>(flow_cell-id):lane:tile:x:y#<index>:<pair>
machine-id: all characters except '_'
I realize this is a formating issue as CASAVA wants the file in the format of:
@<machine_id>:<lane>:<tile>:<x_coord>:<y_coord>#<index
>/<read_#>
But am unsure how to go about fixing it. I'm pretty sure the machine_id is missing, as well as any information dealing with the index and read. Any help would be much appreciated. Thanks!
Comment