Seqanswers Leaderboard Ad

**GenoMax** · 03-15-2016, 02:52 PM

You may be able to do this by finding strain specific kmers: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005670/
BBMap suite has k-mer identification programs you can use, in addition to the programs in the paper above.

**Brian Bushnell** · 03-15-2016, 09:22 PM

You can substantially reduce sequencing error rate by error-correcting the data, which can be done for example using Tadpole. 18x is pretty low for good error-correction or assembly, though. If the reads mostly overlap, you can also achieve some degree of error-correction by merging them using e.g. BBMerge.

I would probably assemble each strain (using adapter-trimmed, error-corrected, merged [if they mostly overlap] reads), and then do all 16 mappings of reads to assemblies to estimate SNP rates from pairwise error rates. For example if strain 1 has a 0.1% substitution rate when mapped to its own assembly and strain 2 has a 0.7% substitution rate when mapped to strain 1's assembly, then probably, there is a 0.6% SNP rate between strain 1 and strain 2.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Determine similarity from NGS data

Comment

Comment

Latest Articles

ad_right_rmr

News