![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
glimmer compile error | anyone1985 | Bioinformatics | 26 | 11-29-2012 01:54 PM |
Predict 5' and 3' UTR's using RNA-seq? | James | Bioinformatics | 2 | 08-24-2011 07:22 AM |
Any pipeline to find automatically ORF in consensus sequences? | Christopher Sauvage | Bioinformatics | 6 | 05-21-2010 06:09 AM |
Help with glimmer multi-extract | sbberes | Bioinformatics | 2 | 03-19-2010 02:35 PM |
a software to predict whether a sequence is circular | dina | Bioinformatics | 0 | 09-22-2009 11:41 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: shanghai, chia Join Date: Mar 2009
Posts: 67
|
![]()
A genome about 6M has about 340 solexa contigs. I'd like to predict genes of each contig with the Glimmer. I don't know to do it from reference genome as training data or if i can use the long-orf to get training data from each contig. I wander if anyone do the same job and how i should do.
|
![]() |
![]() |
![]() |
#2 |
Member
Location: Raleigh, NC Join Date: Nov 2008
Posts: 51
|
![]()
Yes, it is possible to do it using the long-orfs program, but I often juct go to NCBI and retrieve genes from their taxonomy browser of a closely related strain or species and use that to do my training. Be sure to use the -r flag when using build-icm. In my experience it helps a lot.
|
![]() |
![]() |
![]() |
#3 | |
Member
Location: shanghai, chia Join Date: Mar 2009
Posts: 67
|
![]()
Yes, glimmer with the training data is perfect. However, the contigs are parts of the genome. The training data maybe not suitable for the contig. Or I should do the training for every contig?
Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|