![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
orthomcl error | zxl124 | Bioinformatics | 7 | 11-10-2015 09:29 PM |
Fasta with one entry per chromosome (for methylation analysis and others) | mixter | Epigenetics | 0 | 06-28-2011 02:52 AM |
Orthomcl Installation | Canadian_philosophy | Bioinformatics | 0 | 07-29-2010 08:36 AM |
MosaikBuild entry point missing | ursaan | Bioinformatics | 0 | 04-23-2010 07:08 AM |
How to estimate error rate for short-reads and base-calling duplicate? | zchou | Illumina/Solexa | 10 | 01-20-2010 09:13 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Queensland Join Date: Feb 2011
Posts: 14
|
![]()
I have made it up to step 10 of the orthomcl process, finding protein pairs, but have become stuck with a duplicate entry error:
Duplicate entry '5206|CNAG_00003-5206|CNE00390' for key 'better_hit_ix' at /data/ngs/apps/orthomclSoftware/bin/orthomclPairs line 693, <F> line 14 I thought maybe it was something to do with the way I ran BLAST so I reran with parameters recommended but still get an error, although a different one: Duplicate entry '5206|CNAG_00006-5206--181-0-5206-5206|CNAG_00006' for key 'ss_qtaxexp_ix' at /data/ngs/apps/orthomclSoftware/bin/orthomclPairs line 693, <F> line 14 Has anyone had a similar issue? Thanks for your help |
![]() |
![]() |
![]() |
#2 |
Junior Member
Location: los angeles Join Date: May 2012
Posts: 3
|
![]()
the duplicate entry error can be removed by selecting only the distinct rows in the similarSequence table
|
![]() |
![]() |
![]() |
#3 | ||
Junior Member
Location: London Join Date: Mar 2012
Posts: 6
|
![]() Quote:
Quote:
Last edited by guyleonard; 01-07-2013 at 03:39 AM. |
||
![]() |
![]() |
![]() |
#4 |
Junior Member
Location: los angeles Join Date: May 2012
Posts: 3
|
![]()
to check if you have duplicate entries use the following command
select * from similarSequences group by query_id,subject_id having count(*)>1; this command will give you the rows that are duplicated. Then you can create a new table that will have only distinct rows. create table holdup as select distinct * from similarSequences; |
![]() |
![]() |
![]() |
#5 | |
Junior Member
Location: Wageningen Join Date: Jan 2013
Posts: 6
|
![]()
I am also having the same errors as the OP.
Apart from trying ways to fix it I am wondering what causes the duplicate error in the orthoMCL pipeline. Given that duplicates are to be expected after an all-vs-all blast I would expect that the orthoMCL scripts would appropriately deal with them. Is this a matter of orthomclBlastParser, orthomclLoadBlast or orthomclPairs not doing a proper job? Quote:
1. Looking at those commands I assume these are to be executed within mysql, is that correct? 2. Will they simply replace the similarSequences table with itself with duplicates removed, or do I need to do more to be able to continue the analysis? 3. Will this (changing the table somewhere midway the orthoMCL pipeline) not compromise the analysis? Thanks again! |
|
![]() |
![]() |
![]() |
#6 |
Junior Member
Location: los angeles Join Date: May 2012
Posts: 3
|
![]()
Yes, the commands have to be executed on mysql.
first you need to select distinct rows in a different table(holdup table), delete all the rows in similarSequence table and then insert the rows from the holdup table. I hope this clears it |
![]() |
![]() |
![]() |
#7 | |
Junior Member
Location: Wageningen Join Date: Jan 2013
Posts: 6
|
![]() Quote:
In the meantime I had redone the whole orthoMCL pipeline and for some reason got no more duplicate errors ![]() |
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|