Simulating a genome with known SNPs

lovettse

Junior Member

Join Date: May 2014

Posts: 2
- Share
- Tweet
#1

Simulating a genome with known SNPs

07-10-2014, 09:59 AM

Does anyone know of a way to generate a simulated bacterial genome with known SNPs and indels relative to a given reference? I'd like to be able to generate these simulated genomes to benchmark various SNP-calling pipelines. It would, after all, be much easier to trust a particular SNP if I know for an absolute fact that it's there. I'm using PacBio data, so the various tools to simulate short read dataset seem designed to solve a different problem than the one I have.

If you think I'm going about this in entirely the wrong way, I'm willing to listen.
Tags: bacteria, pacbio, snps
vivek_

PhD Student

Join Date: Jul 2012

Posts: 164
- Share
- Tweet
#2

07-10-2014, 10:03 AM

If you have a VCF file of known variants and a reference fasta, you can use GATK's FastaAlternateReferenceMaker
Comment

Previous template Next

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad