Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Relating OrthoMCL clusters with Phylogenetics Shishir Bioinformatics 5 02-10-2015 08:08 AM
Orthomcl for more then 2 species figo1019 Bioinformatics 0 01-08-2013 01:51 AM
If orthomcl is right anyone1985 Bioinformatics 0 03-22-2012 09:37 PM
clusters per tile to clusters mm2 niceday General 3 07-27-2011 06:35 AM
Orthomcl Installation Canadian_philosophy Bioinformatics 0 07-29-2010 07:36 AM

Thread Tools
Old 10-28-2013, 08:16 PM   #1
Junior Member
Location: Seattle

Join Date: Jan 2013
Posts: 6
Default Trouble with Orthomcl Clusters

I recently did analysis on over 100 genomes within the same phylum using Orthomcl. Sadly, once I finally got the results a couple of weeks later, I discovered that several clusters appeared to have the same function. In fact, in one instance, there were 11 clusters that were likely fructose-2,6-bisphosphatase. I played around with the inflation value a little and found that by it resulted in clusters that appeared to be too mixed in terms of function yet there were still several repeat function clusters. Clearly, this could be a result of bad annotation, but I wanted to see if anyone has had similar problems with Orthomcl cluster prediction.

jflowers002 is offline   Reply With Quote
Old 10-31-2013, 10:55 AM   #2
Junior Member
Location: Seattle

Join Date: Jan 2013
Posts: 6

Well, I will answer my own question. One inherent issue that i believe caused this trouble was my blast parameters. I had 100 genomes and I only allowed 250 hits since I was concerned about diskspace and time. Orhtomcl acutally recommends not limiting this (

I figured that 2.5 orthologs in each genome for the same gene was enough, but maybe not. I am currently rerunning the process with a larger allowed hits and hopefully this will fix it.
jflowers002 is offline   Reply With Quote
Old 01-28-2014, 09:24 PM   #3
Location: Washington DC

Join Date: Jan 2011
Posts: 17
Default following up...

I'm interested to see how this has worked out for you now. Did it solve the problem?
bckirkup is offline   Reply With Quote
Old 01-29-2014, 12:42 AM   #4
Senior Member
Location: sub-surface moon base

Join Date: Apr 2013
Posts: 372

Depending on your research question, it might be a better idea to cluster your proteins based on hmmer searches against Pfam. This is something that can take minutes on a 4 core laptop (with say 100 bacterial genomes) vs. days on a 100 core cluster (when going with something blast-based).
rhinoceros is offline   Reply With Quote

bioinformatics analysis, orthomcl, protein clustering

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 06:46 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO