Seqanswers Leaderboard Ad

**westerman** · 02-20-2012, 06:58 AM

I've never done such a project. An interesting project.

I would map each sample's reads versus the reference. Eliminate those reads. Then use Velvet (or other denovo assembler) to assemble the remaining per-sample reads. Use Glimmer (or other) to detect the genes.

Your idea of a 'super assembly' is a good one however you might get better results via eliminating the reads that already map to the reference.

**rghan** · 02-20-2012, 08:01 AM

Have you read the below paper? The paper and supplementary describes an interesting pipeline that might be useful to you.

http://www.nature.com/nature/journal/v477/n7365/full/nature10414.html

This links to the software pipeline they employed. We're still trying to get it to work properly in house, but we've a much larger genome then you do.

http://mus.well.ox.ac.uk/19genomes/IMR-DENOM/

**Zam** · 02-20-2012, 12:13 PM

An alternative approach is to assemble a "graph" of all of your samples simultaneously, and then look either at the accumulation of new variants, or for which contigs are shared by which strains. Or, alternatively, you could build an assembly of yourfirst strain by standard means, and then compare this with your joint assembly of all strains, and pull out "novel" contigs that differ from your original assembly. All of these are supported by this software (disclosure - I am an author)
cortexassembler.sourceforge.net
You might take a look at this paper

http://dx.doi.org/10.1038/ng.1028

which does something by assembling 164 human genomes and looking for novel sequence different from the human reference

**Zam** · 02-20-2012, 12:14 PM

Oops, signed off too quickly - hope that made sense - feel free to email me if not (zam AT well.ox.ac.uk)

**green tree** · 02-20-2012, 03:19 PM

Hi everyone, Thanks for the responses ! Zam, great link and interesting paper ( I was actually just thinking about this in the human population)

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 23 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 21 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Finding new regions of DNA in genome assemblies

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News

Seqanswers Leaderboard Ad

Announcement

Finding *new* regions of DNA in genome assemblies

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News

Finding new regions of DNA in genome assemblies