Seqanswers Leaderboard Ad

**Bukowski** · 05-24-2014, 05:10 AM

Well it's exactly like IonTorrent advise - you can't deduplicate amplicon data on any platform, neither can you do it with HaloPlex.

This means it is impossible to remove PCR artefacts from the data. This is why I much prefer hybridisation capture for these kind of studies, as deduplication is required to keep that source of false positives under control.

My best advice is if you're stuck with this system, run samples in duplicate - at least then if you have false positives from the amplification, they shouldn't be present in the other replicate.

**nbahlis** · 05-24-2014, 01:06 PM

I have the same problem. Is there a way to rum Mutect without removing duplicates (or presumed duplicates)?

**IonTom** · 07-07-2014, 09:22 AM

You can use the VariantTools Biocondctor package, this gives you the
number of unique in read positions for variant and reference. Using a GenomicRanges object generated from a vcf you can make it report specifically for the positions of interest. This can be done using the tally function.

**IonTom** · 07-07-2014, 09:33 AM

Here is the code:

library(gmapR)
library(VariantTools)
library(VariantAnnotation)
library(BiocParallel)

biocParam <- MulticoreParam(workers = ncores)

fastaFile <- rtracklayer::FastaFile(referencefile)
gmapGenome <- GmapGenome(fastaFile, create=TRUE,directory = referencefolder)

print(tallied)

vcf = readVcf(vcffile)
called <- as(unlist(vcf),"VRanges")

tally.param <- TallyVariantsParam(gmapGenome, high_base_quality = 0L,minimum_mapq = 10L,
which = unique(as(called,"GRanges")),ignore_duplicates = FALSE,read_pos_breaks = c(1,10,120,330),
variant_strand = 1)

tallied = tallyVariants(bam_file,tally.param,BPPARAM = biocParam)
matched = called %in% tallied
matched_tallied = tallied %in% called

cur_called = called[matched]
tallied = unique(tallied[matched_tallied])

sampleNames(tallied) = sampleNames(cur_called)

elementMetadata(tallied) = c(elementMetadata(cur_called),elementMetadata(tallied))

print(tallied)

**dakl** · 09-11-2014, 01:50 AM

Originally posted by nbahlis View Post

I have the same problem. Is there a way to rum Mutect without removing duplicates (or presumed duplicates)?

Yes, just skip the dedup step. It's just an upstream step to skip.

**quantrix** · 12-16-2016, 11:05 AM

Hi Fourie,
I was wondering what you ended up doing in terms of variant calling with the Ion CCP panel data? Did you just end up using the torrent variant caller? If so, how did you do the downstream analysis for filtering the variants etc? Did you use Ion Reporter?
I find it quite incredible that the Ion data seems to be quite incompatible with ANY of the variant callers like Mutect, Varscan to do paired tumor normal analysis.
Thanks for the favor of a reply.
Regards

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Ion Torrent Ampliseq: duplicate removal / coverage / variants?

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News