Hi,
I currently am looking at two different strains of the same bacteria. I have RNA-seq data and reference genomes from NCBI. What I want to do is find the following information:
Very venn diagram sort of stuff here.
I tried making a blast database from A and B, then blasting them against each other, and selecting the set of genes that met criteria like high identity and significant length (over 200), but this way I'm only able to identify 100-200 genes, when there are around 5000 in these genomes.
Can anyone offer some advice or point me toward a tool that will help me accomplish this? Thanks very much in advance.
I currently am looking at two different strains of the same bacteria. I have RNA-seq data and reference genomes from NCBI. What I want to do is find the following information:
- Genes that exist in A, but not in B.
- Genes that exist in both A and B. (Including start/end positions)
- Genes that do not exist in A, but exist in B.
Very venn diagram sort of stuff here.
I tried making a blast database from A and B, then blasting them against each other, and selecting the set of genes that met criteria like high identity and significant length (over 200), but this way I'm only able to identify 100-200 genes, when there are around 5000 in these genomes.
Can anyone offer some advice or point me toward a tool that will help me accomplish this? Thanks very much in advance.
Comment