SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
ERANGE and other packages for RNAseq analysis warrenemmett RNA Sequencing 9 07-02-2013 12:58 PM
Software packages capable of aligning roughly 9000 bp josecolquitt Bioinformatics 4 05-18-2010 04:17 AM
DNAnexus free account: next-gen sequence analysis in the cloud DNAnexus Vendor Forum 0 04-27-2010 10:46 PM
Sequence Analysis Software Developer Cofactor Genomics Industry Jobs! 0 01-27-2010 09:02 AM
Companies offering next gen sequence analysis services gavin.oliver Bioinformatics 8 01-12-2010 04:27 AM

Closed Thread
 
Thread Tools
Old 07-10-2008, 12:43 AM   #41
zee
NGS specialist
 
Location: Malaysia

Join Date: Apr 2008
Posts: 249
Default

Looking good. This site is very useful. Will anybody in this community be getting together at the ISMB Toronto 2008 meeting?
zee is offline  
Old 07-15-2008, 12:24 AM   #42
jkbonfield
Senior Member
 
Location: Cambridge, UK

Join Date: Jul 2008
Posts: 143
Default

Quote:
Originally Posted by RudyS View Post
anybody have scoops on possible software upgrade (GAP5?) for Staden package?
I just noticed this forum - seems like a little gem :-)

Gap5 development is in progress, although it's slow as I keep getting distracted with SRF and ZTR trace formats. Currently there's not official public release of Gap5 itself, just of the text terminal viewer using some of the same code (see below).

So far the basic underlying storage and searching methods exist plus a basic contig editor. It's definitely fast and very efficient (both memory and cpu) compared to Gap4, but Gap4 has one key advantage - it's finished (as much as anything ever is)!

I should probably work on getting a publically useable release ready sometime, at least for testing purposes. The text-mode version which shares the same file format but is just a simple curses-based viewer can be downloaded from https://sourceforge.net/project/show...kage_id=256957 although it's a tad out of date.

James
jkbonfield is offline  
Old 07-15-2008, 05:07 PM   #43
sci_guy
Member
 
Location: Sydney, Australia

Join Date: Jan 2008
Posts: 83
Default

Updated table. Please inform me if I have missed something.
sci_guy is offline  
Old 07-15-2008, 06:13 PM   #44
sparks
Senior Member
 
Location: Kuala Lumpur, Malaysia

Join Date: Mar 2008
Posts: 126
Default

Quote:
Originally Posted by sci_guy View Post
Updated table. Please inform me if I have missed something.
Hi, I think novocraft should be in align/assemble.
sparks is offline  
Old 07-15-2008, 06:18 PM   #45
sci_guy
Member
 
Location: Sydney, Australia

Join Date: Jan 2008
Posts: 83
Default

Good spotting. Updated.
sci_guy is offline  
Old 07-16-2008, 07:11 AM   #46
bioinfosm
Senior Member
 
Location: USA

Join Date: Jan 2008
Posts: 482
Default

Quote:
Originally Posted by sci_guy View Post
Updated table. Please inform me if I have missed something.
THere is the NextGENe from Softgenetics that looks pretty good with the condensation of reads
bioinfosm is offline  
Old 07-16-2008, 02:35 PM   #47
sci_guy
Member
 
Location: Sydney, Australia

Join Date: Jan 2008
Posts: 83
Default

Thanks Bioinfosm. Now updated.
sci_guy is offline  
Old 07-17-2008, 10:38 AM   #48
mchaisso
Member
 
Location: Seattle, WA

Join Date: Apr 2008
Posts: 84
Default

Quote:
Originally Posted by zee View Post
Looking good. This site is very useful. Will anybody in this community be getting together at the ISMB Toronto 2008 meeting?
I'll give a short presentation on EULER-SR at the short read SIG.
mchaisso is offline  
Old 07-21-2008, 07:45 PM   #49
ScottC
Senior Member
 
Location: Monash University, Melbourne, Australia.

Join Date: Jan 2008
Posts: 246
Default

[Edit: Just noticed James' post above in response to the GAP5 query!]

Last edited by ScottC; 07-21-2008 at 08:02 PM.
ScottC is offline  
Old 08-06-2008, 11:18 AM   #50
nclement
Junior Member
 
Location: Utah

Join Date: Jun 2008
Posts: 3
Default

There's another program as well for mapping short reads called gnumap (http://dna.cs.byu.edu/gnumap/) made to increase the accuracy with duplicate matches. Open source, creates viewable output (with Affy's Integrated Genome Browser), and produces results very similar to novocraft's.
nclement is offline  
Old 08-07-2008, 12:00 PM   #51
lh3
Senior Member
 
Location: Boston

Join Date: Feb 2008
Posts: 693
Default

Another good program: ZOOM! Zillions of oligos mapped, which is online now at Bioinformatics. ZOOM resembles Eland a lot, but it further improves the spaced-seed method. I think SOLiD read mapper also uses quite a similar strategy of spaced seed, but it indexes genome, while ZOOM indexes reads like Eland.

Also, I believe ZOOM is carefully engineered. Outperforming eland with a similar algorithm is non-trivial, even given the advantages in the algorithm. I just wonder how ZOOM-C/I/P may perform. The authors did not give benchmark.

Last edited by lh3; 08-07-2008 at 12:04 PM.
lh3 is offline  
Old 08-07-2008, 12:07 PM   #52
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

I've seen the manuscript of their ZOOM publication, which looks impressive, and followed up with someone at the company that produces it - I was told that the software wasn't yet available, and might not be for some period of time.

Unfortunately, I get the impression ZOOM might be vapourware for the forseable future - though if anyone knows more than I do, please feel free to correct me.
__________________
The more you know, the more you know you don't know. —Aristotle

Last edited by apfejes; 08-07-2008 at 12:11 PM.
apfejes is offline  
Old 08-07-2008, 12:41 PM   #53
lh3
Senior Member
 
Location: Boston

Join Date: Feb 2008
Posts: 693
Default

I do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
lh3 is offline  
Old 08-07-2008, 12:54 PM   #54
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

Thanks - all I know is that when I spoke to my contact at Bioinformatics Solutions, they were using the core of the software to generate benchmarks, but were still working on the application itself. A lot of my questions weren't answered, unfortunately.

Either way, I agree, once we pass 64bp reads (I've heard we're going to start doing 72bp test runs next week), we're going to leave the realm of short read aligners and need to start dealing with medium length (100-1000bp) read aligners, anyhow. (I'm not sure what's in that space, though: blast, blat, exonerate?)

As an aside, I'll save the term "long read alignments" for when Pacific Biosciences releases their SMRT (single molecule - real time) sequencing machine. 5-25k reads are about as long as I expect are going to be necessary for any application, though at that point, you're probably better off doing assembly than alignments.
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline  
Old 08-08-2008, 09:53 AM   #55
zee
NGS specialist
 
Location: Malaysia

Join Date: Apr 2008
Posts: 249
Default

I will be looking out for some benchmark data for > 70bp Illumina runs to test future versions of novoalign/novopaired on. We had anticipated this would be coming and have some ideas on how to handle these.
If anybody could arrange some , public or under NDA, then we'd like that very much to get something out to the community.


Quote:
Originally Posted by lh3 View Post
I do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
zee is offline  
Old 08-08-2008, 10:13 AM   #56
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

Hi zee,

I haven't seen the runs yet - but if you message me in a week or two, I might be able to let you know how things ran here.

Actually, make that early September - I'm on holiday for the last 2 weeks of august. (-:
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline  
Old 08-08-2008, 12:23 PM   #57
spirit
Member
 
Location: Canada

Join Date: Feb 2008
Posts: 11
Default

Hi, lh3 and apfejes. Thanks for you opinion on ZOOM . I am one of the developers of ZOOM. In fact, we will release the command-line version of ZOOM next week.

We finished the part of ZOOM dealing with Illumina/Solexa data in January of this year. Due to some personal reason, we came back to it until May. Then we found the release of ABI SOLiD data. We wanted to support color space data in ZOOM. After that, we focused on the GUI part, manual, website... We are trying to release an efficient, useful and easy-to-use tool. So, sorry for letting you wait. The good news is it will appear next week.

Welcome to try ZOOM out. We'd like to provide any help we could.

Quote:
Originally Posted by lh3 View Post
I do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
spirit is offline  
Old 08-08-2008, 12:30 PM   #58
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

Thanks for the update, spirit.

Maybe you could give us a little bit of information on Zoom, as well, since things may have changed since last time I heard anything about it.

What are the longest and shortest reads it can handle effectively?
how does it compare to Eland or MAQ in reads aligned per minute?
How many mismatches does it handle?
Does it have a gapped mode?
What format is required for the reference genome?
What format are the alignments reported in?
Can you comment on the cost/licenses it will be provided under?
Can you give us the link to the download when it's ready?

I'm sure I'm missing other important information, but those are the first questions that occur to me.

Thanks!
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline  
Old 08-08-2008, 12:31 PM   #59
spirit
Member
 
Location: Canada

Join Date: Feb 2008
Posts: 11
Default

And may I ask a question? What is the pair-end data of Illumina/Solexa data like? ZOOM now accepts two types of pair-end reads input. One is two fasta input files recording reads from two ends separately. ZOOM will automatically find the reads paired according to their name before mapping. The other is one fasta input files with two reads of a pair appearing in odd line and even line respectively. Thanks.

Quote:
Originally Posted by lh3 View Post
Another good program: ZOOM! Zillions of oligos mapped, which is online now at Bioinformatics. ZOOM resembles Eland a lot, but it further improves the spaced-seed method. I think SOLiD read mapper also uses quite a similar strategy of spaced seed, but it indexes genome, while ZOOM indexes reads like Eland.

Also, I believe ZOOM is carefully engineered. Outperforming eland with a similar algorithm is non-trivial, even given the advantages in the algorithm. I just wonder how ZOOM-C/I/P may perform. The authors did not give benchmark.
spirit is offline  
Old 08-08-2008, 12:51 PM   #60
apfejes
Senior Member
 
Location: Oakland, California

Join Date: Feb 2008
Posts: 236
Default

I suppose the PET comes out in the format provided by whichever base caller you use. I imagine it wouldn't be too difficult to convert to the formats ZOOM requires.

I believe that people here are starting to look at alternate base calling programs, so maybe someone who knows more than I do can point you in the direction of the documentation for those applications.
__________________
The more you know, the more you know you don't know. —Aristotle
apfejes is offline  
Closed Thread

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 12:55 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO