Seqanswers Leaderboard Ad

**nilshomer** · 03-15-2011, 07:43 PM

I just found this like (Draft 1): http://korflab.ucdavis.edu/Datasets/...n_analysis.pdf.

**Graham Etherington** · 03-21-2011, 03:51 AM

The full results from the Assemblathon can be found at:

**jnfass** · 05-18-2011, 10:02 AM

Linked to Genome10K Project

Hi all,

This was actually a collaborative effort between David Haussler's group at UCSC, Ian Korf's lab here at UC Davis, and the UC Davis Genome Center's Bioinformatics Core. David Haussler initiated the collaboration to complement the recent Genome10K Project meeting this past March, and we discussed the results at the Genome Assembly Workshop attached to that meeting. There will be a paper discussing the results in great detail - it's in preparation now. Finally, the Assemblathon "competition" was meant to be the first of many; Assemblathon 2 is slated to start later in the summer and wrap in the fall sometime. As far as I understand, the Broad Institute and BGI are contributing novel sequence data from previously unsequenced organisms, to be used in Assemblathon 2.

**iankorf** · 05-18-2011, 10:17 AM

Assemblathon 2 data will be released June 1 (a fish, bird, and snake). Groups will then have until September 1 to assemble the genomes. The results will be announced at CSHL Genome Informatics in November. These are the plans, and I hope we don't fall behind schedule. Please check out the website and join the mailing list if you're interested.

**BaCh** · 05-19-2011, 12:46 AM

Originally posted by iankorf View Post

Assemblathon 2 data will be released June 1 (a fish, bird, and snake). Groups will then have until September 1 to assemble the genomes. The results will be announced at CSHL Genome Informatics in November. These are the plans, and I hope we don't fall behind schedule. Please check out the website and join the mailing list if you're interested.

How about adding some smaller genomes? Like one or two bacteria and one or two small eukaryotes (yeasts, fungi).

There is a definitive bias from the organizers of both the Assemblathon and dnGASP to "think big" whereas having a look at smaller things - which are supposedly easy - may also be very ... interesting.

B.

**iankorf** · 05-19-2011, 05:14 AM

The first (and second) Assemblathon were born out of the needs of the G10K project. We aren't thinking big as much as we are thinking vertebrate. But you're absolutely correct: there are small assembly problems that are also important. We'll get there soon.

**jkbonfield** · 05-19-2011, 05:47 AM

I'd also say depth is important.

Some assemblers basically take the approach of sheer depth alone is enough to ensure that any sequence with an error becomes irrelevant as there's probably another sequencing spanning the same region that is error free. This technique does indeed work, but it's very costly to implement. So some assemblies of lower depth sets would be nice too.

Then there are issues of library sizes, singular size or mix, etc. It's a large field to survey basically. Anyway more variety could be interesting. I suspect no one assembler will "win", but rather some will have their own particular niche.

**jnfass** · 05-19-2011, 10:03 AM

Library type / depth issues

Some of the parameters of the data (library insert sizes, depths) are determined more by the parties who are willing to donate novel data "to the cause," rather than pure ab initio considerations of what data people would like to see (based on their own focus, or what kind of data is usually available to them). This is a little unfortunate, as it constrains the input to what a sub-population of the larger assembling community would prefer.

In addition, we hesitate to include too many options / sub-problems in the competition, as this increases the workload of the evaluators (who may or may not be funded for their Assemblathon-related efforts).

But, as Ian said, we'll probably get there in future Assemblathons, because the issues you mention are definitely interesting to many people, and may also have relevance for the Genome10K Project (metagenomic assemblies of microbes and vertebrate host?, mitochondrial assemblies?).

~Joe

**BaCh** · 05-20-2011, 01:25 AM

Originally posted by jnfass View Post

... (metagenomic assemblies of microbes and vertebrate host?, mitochondrial assemblies?).

Oh. My. God. Noooo! No mitochondria or chloroplasts.

Include mitochondrial and chloroplast data only if you feel sadistic and want to see assembly programs (and then evaluators) sweat: host contamination which was not filtered away; very high, but uneven coverage (maybe due to GC content); genetic variations in sequenced samples (like ploidy, but worse); repeats; etc.pp

B.

PS: let's see whether reverse psychology works

PPS: I still think that small and "easy" well-known bacterial or fungal genomes should be part of any evaluation ... simply because it also gives the evaluators and then readers of the results a warm and fuzzy feeling on how well actually the evaluation process works. I'll wait for Assemblathon 3 then.

**kbradnam** · 05-20-2011, 08:51 AM

I'd add to Joe and Ian's comments by saying that it's great the genome assembly community has a thirst for tackling lots of different areas of genome assembly. We'd like to address all areas of sequence assembly, but we had to start somewhere. Indeed, part of the goal of Assemblathon 1 was just to see whether it was even possible to get a group of people to all work on the same problem at once.

Going forward, people should feel free to approach the Assemblathon organizers ideas and suggestions, though ideally we'd like to hear from people who have – or will have – short read data that can be used in future Assemblathons.

Finally, I'd ask that if people want to be kept in the loop on Assemblathon discussions then they should join the Assemblathon mailing list: http://assemblathon.org/pages/mailing-list

I also write the occasional short blog post on the Assemblathon website which can be subscribed to as an RSS feed, and there is also the Assemblathon twitter account.

**jstjohn** · 06-01-2011, 10:35 PM

assemblathon 2

Data is now posted for Assemblathon 2, the submission date is September 1st.

http://assemblathon.org/assemblathon-2-begins-today

**mjp** · 09-21-2011, 01:09 AM

Assemblathon 1: A competitive assessment of de novo short read assembly methods

I don't think I'm the first one to spot this in the press but thought it may be relevant to the thread.

Assemblathon 1: A competitive assessment of de novo short read assembly methods

http://genome.cshlp.org/content/early/2011/09/16/gr.126599.111.abstract

An international, peer-reviewed genome sciences journal featuring outstanding original research that offers novel insights into the biology of all organisms

**Mahtab** · 12-19-2011, 09:14 PM

Hi All

I'm trying to reproduce some of Assemblathon 1 results and so far the metrics (N50 , NG50) I'm getting for SOAPdenovo are far from what has been reported. UCDavis people told me they don't have the parameters that the assemblers were run with. I emailed BGI but did not get a reply back. Any suggestions on parameter setting( K-mer size, which libraries to use for contig, scaffold creation and....) for Assemblathon 1 data?
Thanks in advance.

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 33 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 49 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 34 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 46 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

Assemblathon: Collaborative Assembler Comparison!

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News