Hello again (2nd post in one day, the other one is here)
On this thead the issue is as follows:
I have a table corresponding to a multifasta protein file which contains one row per predicted motif consisting on the name of the sequence, the motif predicted, and the coordinates.
What do I want to do? Convert the coordinates to the corresponding ones in the CDS multifasta file and extract them.
The thing is, just multiplying by 3 doesn't do the work because, for example, if the motif starts in the aminoacid 1, the corresponding sequence in the CDS file starts in the first position, not the third one
Any ideas? BTW, I work on python3 and bash but at this point, any solution will do
Thanks in advance
On this thead the issue is as follows:
I have a table corresponding to a multifasta protein file which contains one row per predicted motif consisting on the name of the sequence, the motif predicted, and the coordinates.
What do I want to do? Convert the coordinates to the corresponding ones in the CDS multifasta file and extract them.
The thing is, just multiplying by 3 doesn't do the work because, for example, if the motif starts in the aminoacid 1, the corresponding sequence in the CDS file starts in the first position, not the third one
Any ideas? BTW, I work on python3 and bash but at this point, any solution will do
Thanks in advance