Hello,
I apologize if this question is too simple, I am new to bioinformatics and am trying to completely my first independent project. I am trying to retrieve DNA sequences from a set of random organisms within a given taxonomic group. For example, I want to be able to input "Mammalia" and retrieve subsets of say, 5 mammalian genomes. I have been looking into the NCBI resources including the taxdump files, the Taxonomy database, and RefSeq, but am struggling to put these resources together in order to traverse a taxonomy and retrieve random sequences from different taxonomic levels.
Any hints on how/where to begin would be appreciated so much! Thank you!!
I apologize if this question is too simple, I am new to bioinformatics and am trying to completely my first independent project. I am trying to retrieve DNA sequences from a set of random organisms within a given taxonomic group. For example, I want to be able to input "Mammalia" and retrieve subsets of say, 5 mammalian genomes. I have been looking into the NCBI resources including the taxdump files, the Taxonomy database, and RefSeq, but am struggling to put these resources together in order to traverse a taxonomy and retrieve random sequences from different taxonomic levels.
Any hints on how/where to begin would be appreciated so much! Thank you!!
Comment