SEQanswers (
-   Bioinformatics (
-   -   Trouble with Orthomcl Clusters (

jflowers002 10-28-2013 09:16 PM

Trouble with Orthomcl Clusters
I recently did analysis on over 100 genomes within the same phylum using Orthomcl. Sadly, once I finally got the results a couple of weeks later, I discovered that several clusters appeared to have the same function. In fact, in one instance, there were 11 clusters that were likely fructose-2,6-bisphosphatase. I played around with the inflation value a little and found that by it resulted in clusters that appeared to be too mixed in terms of function yet there were still several repeat function clusters. Clearly, this could be a result of bad annotation, but I wanted to see if anyone has had similar problems with Orthomcl cluster prediction.


jflowers002 10-31-2013 11:55 AM

Well, I will answer my own question. One inherent issue that i believe caused this trouble was my blast parameters. I had 100 genomes and I only allowed 250 hits since I was concerned about diskspace and time. Orhtomcl acutally recommends not limiting this (

I figured that 2.5 orthologs in each genome for the same gene was enough, but maybe not. I am currently rerunning the process with a larger allowed hits and hopefully this will fix it.

bckirkup 01-28-2014 10:24 PM

following up...
I'm interested to see how this has worked out for you now. Did it solve the problem?

rhinoceros 01-29-2014 01:42 AM

Depending on your research question, it might be a better idea to cluster your proteins based on hmmer searches against Pfam. This is something that can take minutes on a 4 core laptop (with say 100 bacterial genomes) vs. days on a 100 core cluster (when going with something blast-based).

All times are GMT -8. The time now is 05:00 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.