Hi,
I have a GenBank file, that contains several mRNA and CDS.
I'd like to pull from that file pairs of mRNA and CDS.
For example:
Having a .gb file for NF1 gene
http://www.ncbi.nlm.nih.gov/nuccore/...report=genbank
I know, that mRNA with ID NM_000267.3 has corresponding CDS with ID NP_000258.1
I know it because a tag of mRNA: /product="neurofibromin 1, transcript variant 2"
describes "/product" tag of CDS: /product="neurofibromin isoform 2"
I use Perl's Bio::SeqIO for parsing .gb files. I can pull all main tags like mRNA and CDS, but I don't see any way how to combine them in pairs.
Thanks for suggestions.
I have a GenBank file, that contains several mRNA and CDS.
I'd like to pull from that file pairs of mRNA and CDS.
For example:
Having a .gb file for NF1 gene
http://www.ncbi.nlm.nih.gov/nuccore/...report=genbank
I know, that mRNA with ID NM_000267.3 has corresponding CDS with ID NP_000258.1
I know it because a tag of mRNA: /product="neurofibromin 1, transcript variant 2"
describes "/product" tag of CDS: /product="neurofibromin isoform 2"
I use Perl's Bio::SeqIO for parsing .gb files. I can pull all main tags like mRNA and CDS, but I don't see any way how to combine them in pairs.
Thanks for suggestions.
Comment