hi,
I have a fasta file including arabidopsis protein sequences like attached below. The uniprot AC is provided after the ">sp|", like ">sp|A8MQL7|". How can I get all their AGI ID like "AT3G04715" in an excel file. Thanks!
Attached examples:
>sp |A8MQL7| Protein RALF-like 20
MVLSKKTIMSFALFGHHERJYEDFSFHSHSHJJSJRJYU
>sp |A8MQM2| protein RALF-lile 6
AKHJSJHAFGAHFGLAGLNGAIYUEFNVJGIOAOPGAOJSFO
I have a fasta file including arabidopsis protein sequences like attached below. The uniprot AC is provided after the ">sp|", like ">sp|A8MQL7|". How can I get all their AGI ID like "AT3G04715" in an excel file. Thanks!
Attached examples:
>sp |A8MQL7| Protein RALF-like 20
MVLSKKTIMSFALFGHHERJYEDFSFHSHSHJJSJRJYU
>sp |A8MQM2| protein RALF-lile 6
AKHJSJHAFGAHFGLAGLNGAIYUEFNVJGIOAOPGAOJSFO