SEQanswers (
-   Bioinformatics (
-   -   Mutect2 tumor-only - missing CHROM, ID, QUAL and FILTER columns in output vcf (

alzbeta_h 03-05-2019 04:14 PM

Mutect2 tumor-only - missing CHROM, ID, QUAL and FILTER columns in output vcf

I am a medical student new to NGS. I'm currently trying to count the tumor mutational load from tumor samples (FFPE) to which I have no matched normal tissue. We are using a gene panel of less than 70 genes.

I have got the fastq files (2 files per sample, paired-end) and I tried to write a script in bash to generate recalibrated bam files ready for use with mutect2.

However, when I received the output vcf file from mutect2, there are some mutations but there is no information in CHROM, ID, FILTER and QUAL column.
I am attaching the commands I used. I suppose there might be a problem with alignment, but I failed to recognize where the problem is. As a complete beginner, I suppose maybe I do some kind of systemic error.
I would be grateful for any kind of hint. THanks.


bwa index -a bwtsw hs37d5.fa | samtools faidx hs37d5.fa | picard CreateSequenceDictionary R=hs37d5.fa O=hs37d5.dict

bwa mem -M  ~/projects/patologie/hs37d5/hs37d5.fa  ~/projects/patologie/01_raw_data/BRCA1_S1_L001_R1_001.fastq ~/projects/patologie/01_raw_data/BRCA1_S1_L001_R2_001.fastq > sample.sam

samtools view -bS sample.sam > sample.bam

picard SortSam I=sample.bam O=sorted_sample.bam SORT_ORDER=coordinate

picard MarkDuplicates I=sorted_sample.bam O=sample_marked.bam M=marked_metrics.txt ASSUME_SORT_ORDER=coordinate

picard AddOrReplaceReadGroups I=sample_marked.bam  O=sample_rg.bam RGID=1 RGLB=lib1 RGPL=illumina RGPU=unit1 RGSM=1

gatk IndexFeatureFile -F= ~/projects/patologie/reference/1000G_phase1.indels.b37.vcf

gatk IndexFeatureFile -F= ~/projects/patologie/reference/Mills_and_1000G_gold_standard.indels.b37.vcf

gatk IndexFeatureFile -F= ~/projects/patologie/reference/dbsnp_138.b37.vcf

gatk BaseRecalibrator -R ~/projects/patologie/hs37d5/hs37d5.fa  -I sample_rg.bam --known-sites ~/projects/patologie/reference/1000G_phase1.indels.b37.vcf --known-sites ~/projects/patologie/reference/dbsnp_138.b37.vcf --known-sites ~/projects/patologie/reference/Mills_and_1000G_gold_standard.indels.b37.vcf -O recal_table

gatk ApplyBQSR -bqsr recal_table -I sample_rg.bam -R ~/projects/patologie/hs37d5/hs37d5.fa -O sample_recal.bam

gatk IndexFeatureFile -F= ~/projects/patologie/reference/af-only-gnomad.raw.sites.b37.vcf

gatk Mutect2 -R ~/projects/patologie/hs37d5/hs37d5.fa -I ~/projects/patologie/analysis2/sample_recal.bam -tumor 1 -O sample_single.vcf --germline-resource ~/projects/patologie/reference/af-only-gnomad.raw.sites.b37.vcf

All times are GMT -8. The time now is 02:31 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.