Seqanswers Leaderboard Ad

**flobpf** · 04-14-2011, 11:13 AM

Nice tool

Plantagora is a really useful tool for simulations. I want to use the scripts for denovo assembling a genome though.If I provide the assembly_run.sh script with my own sets of reads, would it assemble it into a regular assembly?

Also, does the Plantagora use Abyss for hybrid Illumina+454 assembly? How do I set it up for that?

Thanks
Flobpf

**azroger** · 04-18-2011, 10:27 AM

Hi-
Thanks for the comment. The assembly_run.sh (which I didn't write) was designed to work with the Plantagora datasets to run multiple assembly runs. It may be most useful to look at the script and (if you can -- I'm not exactly expert at this) either edit it to your needs, or try using it with some of your own datasets with your own inputs and settings. In the end, though, if you're not doing a lot of different runs, then you can take the commands as they are written in the script and put in your own settings as you want and run the assemblies directly. For example, for abyss, the command in there is time mpirun -np 4 abyss-pe $params name=$header. You can leave out the time command if you don't want to time it, and in some cases you may not want or need to use mpi for a parallel run (which in this example is set to run on 4 processors. You have to have openmpi installed to do it. I have been studying Abyss and it has a lot of subprograms that it uses, one of which is abyss-pe. Abyss has to be installed and abyss-pe in the path environment to run the command. Otherwise you can try running it with just --help and it will tell you about the options. The options set in the file as it is distributed on the website (I think) are -j2 n=2 k=$k, where k is the kmer size which is something you may want to try to optimize, because it can make a big difference. You may already know a lot of this, but some of it is not too obvious when you first look at Abyss.

The interesting thing about abyss-pe is that it is a makefile, and it can be edited and you can also run the commands it uses independently, because it really just runs through a series of commands that invoke some of the other subprograms that also have to be in the path environment for abyss-pe to run properly -- they are in a bunch of subfolders of the abyss install. I believe the default command series will be spit out if you give it the option --dry-run. You can break down the commands and even replace some of them with other aligners or mappers, like bowtie. I'm trying to figure out at this point how best to use this.

In any case, Plantagora uses Abyss for the hybrid Illumina+454 assemblies, and some of them produce scaffolds even over 100,000 bp, although the scaffoldN50's are considerably lower than this. Abyss is one of the few assemblers that can readily make use of the combined data. I have been told by another group that you can convert Illumina reads to .sff files and use them with Newbler. They had trouble running the combo so far, but that is because the memory usage is really heavy for this combination. I don't know how efficiently Newbler can use the smaller reads, either. It does not use a de Bruijn graph or kmers, like the small read assemblers generally do. But it may work fine under some conditions.

**flobpf** · 04-18-2011, 10:34 AM

Thanks Roger. Thats answers a lot of my questions.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Plantagora

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News