Hello all,
I am trying to convert a DIAMOND output from a reference genome into a fasta file so I can use it as a database for subsequent DIAMOND annotations. Currently, the file is in .txt format with the columns going
Gene ID -- CAZy ID -- % Identical -- Length -- Mismatches -- Gap Open -- Gene Start -- Gene End -- CAZy Start -- CAZy End -- E Value -- Bit Score
As of now, I don't necessarily have an idea of how to figure out the sequences that correspond to each row, so I'm not sure how to really parse the output file to create a fasta file. Any help or advice would be much appreciated.
I am trying to convert a DIAMOND output from a reference genome into a fasta file so I can use it as a database for subsequent DIAMOND annotations. Currently, the file is in .txt format with the columns going
Gene ID -- CAZy ID -- % Identical -- Length -- Mismatches -- Gap Open -- Gene Start -- Gene End -- CAZy Start -- CAZy End -- E Value -- Bit Score
As of now, I don't necessarily have an idea of how to figure out the sequences that correspond to each row, so I'm not sure how to really parse the output file to create a fasta file. Any help or advice would be much appreciated.