Seqanswers Leaderboard Ad

**GenoMax** · 02-29-2016, 08:17 AM

SPAdes (or some would recommend velvet). If you have enough coverage (and the data is good quality) start there. If you don't get good results then you could start looking at other options.

If you have a good reference you could just align and see what that gets you.

I also would recommend that you look at Mauve for genome wide comparisons.

**jerrybug109** · 02-29-2016, 09:11 AM

Thanks!

Someone has also suggested to me that I could de novo assemble all 40 strains and then map those contigs onto a reference genome afterwards?

Is this sort of "hybrid" approach preferable over just trying to do an alignment? It seems like there's numerous approaches out there but I'm not well versed enough to gauge general consensus yet.

Our later downstream goal with the assembled genomes is to look for genetic variation between the strains.

Thanks!

**GenoMax** · 02-29-2016, 10:35 AM

Are your genomes true strains of the reference (i.e. you expect them to align with high % identity across the entire genome)?

In your case you can probably tackle the ultimate goal of SNP calls from more than one end but if the answer to the question above is yes the doing alignments to reference followed by SNP calls may be the most straightforward way. If you start seeing gaps/unaligned reads then that could give you an idea of how much the strains vary from the reference. At that point individual assemblies can be tried for a more complete picture. Followed by Mauve analysis.

**jerrybug109** · 02-29-2016, 10:42 AM

Originally posted by GenoMax View Post

Are your genomes true strains of the reference (i.e. you expect them to align with high % identity across the entire genome)?

In your case you can probably tackle the ultimate goal of SNP calls from more than one end but if the answer to the question above is yes the doing alignments to reference followed by SNP calls may be the most straightforward way. If you start seeing gaps/unaligned reads then that could give you an idea of how much the strains vary from the reference. At that point individual assemblies can be tried for a more complete picture. Followed by Mauve analysis.

Our genomes are individual strains of bacillus subtilis and the reference genome is bacillus subtilis, so I think my answer to your question is yes.

Thanks.

Since the genomes are only ~4.2 megabases long it seems like it would take a fairly trivial amount of running time to assemble them using spADES or velvet/velvetoptimiser. Maybe we'll go ahead and just do the individual assemblies anyway. I'll try it out a couple ways!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 23 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 21 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

help needed: reference guided assembly of bacterial genomes?

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News