I am blasting a transcriptome dataset against some vertebrate unigene sets using standalone BLAST+. I want my output in a tab-delimited file so have been using outfmt 6 or 7. However, this only returns the ID (or accession or GI) of the hit - ie something like this: gnl|UG|Gga#S19183375, but what I would really like to have is the description of what this gene actually is - for the entry above it is "Gallus gallus breast cancer 2, early onset (BRCA2), mRNA" (ie the Sequence Definition line in Genbank format).
The xml file has this information under hit_def, but how can I get this into the tab-delimited output?
The xml file has this information under hit_def, but how can I get this into the tab-delimited output?
Comment