Seqanswers Leaderboard Ad

**m_two** · 01-02-2014, 09:37 AM

You may need to filter strand specific errors from your data. In addition, some indels can result in alignment anomalies that result in multiple SNPs appearing in close proximity near the ends of the aligned portion of the reads containing the indel.

Many tools have filters in place to address these artifacts. Varscan includes a filtering tool that you may be able to apply to your data. See http://tvap.genome.wustl.edu/tools/varscan/.

If you see somatic mutations with support on both strands in close proximity you may want to refer to this manuscript:

http://www.nature.com/ng/journal/v45/n9/full/ng.2702.html?WT.ec_id=NG-201309

An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancersNature Genetics 45, 970–976 (2013) doi:10.1038/ng.2702

"...throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates with APOBEC mRNA levels. Mutation clusters in whole-genome and exome data sets conformed to the stringent criteria indicative of an APOBEC mutation pattern. "

**lovenlong** · 01-02-2014, 05:47 PM

Hi, m_two
Thanks for your reply.

Actually, my samples were not real somatic tissues, but were pooled of 100 rice plant individuals following mutagenesis. The theoretical variant allele frequency would be very low (<1%), and we thought it would help to improve calling accuracy of these heterozygous SNVs by using the matched tumor-normal pairs.

As marked in the attached figure, the Tumor_1_bwa.bam and Tumor_2_bwa.bam was the pool of mutagenized generation 1 and 2 respectively,the corresponding wild-type generation 1 and 2 was Normal_1_bwa.bam and Normal_2_bwa.bam. My wild-type was a pure rice cultivar which had been self-crossing at least 15 generations.

Due to low effective coverage, I merged the two generations of pools respectively, and called "somatics" with muTect, Strelka and Varscan2. My analysis pipeline was as follows:
1.calling somatic: (MAPQ≥30, Base_Q≥30)
Clean reads > bwa mapping > merged_BAMs > calling somatic with different callers > filtering
Clean reads > Stampy mapping > merged_BAMs > calling somatic with different callers > filtering

2. Eliminate mapping errors by combining calls of bwa and stampy.
calls_of_bwa_callerA + calls_of_stampy_callerA > overlapped SNVs of callerA
calls_of_bwa_callerB + calls_of_stampy_callerB > overlapped SNVs of callerB
...

muTect_passed_overlapped = 519
Strelka_passed_overlapped =55
Varscan2_passed_overlapped = 60

3. Finding concordant overlapped_SNVs among mutliple callers
muTect_call = Varsc_call =29 SNVs, muTect_call = Strelka_call =43 SNVs, Varsc_call = Strelka_call =23, 3_callers_overlapped=20.

4. Implement hard filtration of SNVs
(1) No more than 1 ALT read or read pair has additional mismatch/gap;
(2) No more than 3 additional mismatches/gaps exist within 50 bp either side of ALT site;
(3) ALT reads maximum MAPQ > 40；
(4) When ALT reads ≥ 4, they should not emanate exclusively from one strand;
(5) At least 2 mismatches or gaps are not in the 10 bp beginning or end of ALT reads;
(6) Mismatches or gaps are not at the beginning or end of homopolymers or SSRs (n>4)

Finally, 35 SNVs called by at least 2 callers were selected for validation.
I'm not sure if my analysis workflow was correct.
I would be very grateful if any body could give me some suggestions.

Best regards.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Confused about adjacent somatic SNVs within a 100bp window

Comment

Comment

Latest Articles

ad_right_rmr

News