This may sound like a trivial question for most folks on this forum. Apologies, I a newbie.
I have FASTA sequences from GenBank with unique sequence identifiers. For example, one looks something like this:
>gi|74026815|gb|DQ107070.1| Feline immunodeficiency virus isolate Ac002pA3 pol protein (pol) gene, partial cds
AGAGCAGATCCTAACAATCCCTGGAATACCCCTATATTTTGTATAAAGAAGAAATCAGGAAAATGGAGAATGTTAATAGATTTTAGAGAATTGAATGCAAAGACTGAGAAAGGAGCAGAAGTACAGTTAGGATTGCCTCA.....
I would like to change the above header theoretically for all my sequences with names that are unique to where the sample was isolated. i.e.,
>Yellowstone
AGAGCAGATCCTAACAATCCCTGGAATACCCCTATATTTTGTATAAAGAAGAAATCAGGAAAATGGAGAATGTTAATAGATTTTAGAGAATTGAATGCAAAGACTGAGAAAGGAGCAGAAGTACAGTTAGGATTGCCTCA.....
Is there a script out there that allows one to do this all simultaneously?
Thank you kindly,
Nick
I have FASTA sequences from GenBank with unique sequence identifiers. For example, one looks something like this:
>gi|74026815|gb|DQ107070.1| Feline immunodeficiency virus isolate Ac002pA3 pol protein (pol) gene, partial cds
AGAGCAGATCCTAACAATCCCTGGAATACCCCTATATTTTGTATAAAGAAGAAATCAGGAAAATGGAGAATGTTAATAGATTTTAGAGAATTGAATGCAAAGACTGAGAAAGGAGCAGAAGTACAGTTAGGATTGCCTCA.....
I would like to change the above header theoretically for all my sequences with names that are unique to where the sample was isolated. i.e.,
>Yellowstone
AGAGCAGATCCTAACAATCCCTGGAATACCCCTATATTTTGTATAAAGAAGAAATCAGGAAAATGGAGAATGTTAATAGATTTTAGAGAATTGAATGCAAAGACTGAGAAAGGAGCAGAAGTACAGTTAGGATTGCCTCA.....
Is there a script out there that allows one to do this all simultaneously?
Thank you kindly,
Nick
Comment