Seqanswers Leaderboard Ad

**TonyBrooks** · 12-10-2013, 01:59 AM

Everything should be done automatically if you set it up on your sample sheet. The assembly is carried out in BaseSpace. Although, if you go into the Run Options screen on your MiSeq, you have the ability to replicate the analysis locally - which you might want to do if not using BaseSpace.
We've found the Velvet assembly on MiSeq to be rather hit and miss in terms of quality, ranging from acceptable to very poor. We got much better assemblies using an OLC assembler than we ever got with Velvet.

**mcnelson.phd** · 12-10-2013, 05:41 AM

I'll echo Tony's statements about assemblies coming straight off the MiSeq as being very hit or miss. For one genome we once got a 2.8Mbp contig that was nearly perfect out of a 3.5Mbp genome, but we've also gotten assemblies with N50s of 2Kbp and no contigs larger than 50Kbp. A large part of the problem is that the data doesn't appear to be pre-processed in any way to trim off low quality regions or look for PCR duplicates.

I'd suggest setting up your run to produce the assembly, but also do the work yourself to compare. Most likely you'll find that you can do a much better job and be glad you didn't just rely on the system to give you an assembly.

**TonyBrooks** · 12-10-2013, 06:08 AM

We once got an N50s less than the read-length (251PE).

<AssemblyStatistics>
<NumberOfContigs>59444</NumberOfContigs>
<MeanContigLength>56.10188</MeanContigLength>
<MedianContigLength>46</MedianContigLength>
<MinContigLength>31</MinContigLength>
<MaxContigLength>560</MaxContigLength>
<BaseCount>3334920</BaseCount>
<N50>62</N50>
</AssemblyStatistics>

No idea what was going on there. All quality stats suggested it was good sequencing (12m reads from v2 500 cycle with 93% >Q30). We assembled the data offline without problems. N50's went up to 150kb.

If you want to use velvet, Nick Loman has a good guide about how to pre-process

http://pathogenomics.bham.ac.uk/blog/2009/09/tips-for-de-novo-bacterial-genome-assembly/

**Etherella** · 12-10-2013, 11:20 PM

Thanks a lot! Well, anyaway Miseq stores unaligned fastaq data, so I will be able to have a look at the automatic assembly and then, if the quality is lacking try other software or run Velvet again but with pre-process.

**nucleus** · 12-13-2013, 08:58 AM

Ive had trouble with Velvet before. The issue was not the read quality but rather the sequencing that was too deep (>~50x Velvet falls apart). I am now almost exclusively using Spades (http://bioinf.spbau.ru/spades/) which does the read corrections and assembly on one go, and dosent mind very deep coverage. Spades also gives me better results than CLC.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

bacterial genome assembly on Miseq

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News