View Single Post
Old 03-05-2019, 04:14 PM   #1
alzbeta_h
Junior Member
 
Location: Prague

Join Date: Mar 2019
Posts: 1
Default Mutect2 tumor-only - missing CHROM, ID, QUAL and FILTER columns in output vcf

Hi,

I am a medical student new to NGS. I'm currently trying to count the tumor mutational load from tumor samples (FFPE) to which I have no matched normal tissue. We are using a gene panel of less than 70 genes.

I have got the fastq files (2 files per sample, paired-end) and I tried to write a script in bash to generate recalibrated bam files ready for use with mutect2.

However, when I received the output vcf file from mutect2, there are some mutations but there is no information in CHROM, ID, FILTER and QUAL column.
I am attaching the commands I used. I suppose there might be a problem with alignment, but I failed to recognize where the problem is. As a complete beginner, I suppose maybe I do some kind of systemic error.
I would be grateful for any kind of hint. THanks.


Code:
bwa index -a bwtsw hs37d5.fa | samtools faidx hs37d5.fa | picard CreateSequenceDictionary R=hs37d5.fa O=hs37d5.dict

bwa mem -M  ~/projects/patologie/hs37d5/hs37d5.fa  ~/projects/patologie/01_raw_data/BRCA1_S1_L001_R1_001.fastq ~/projects/patologie/01_raw_data/BRCA1_S1_L001_R2_001.fastq > sample.sam 

samtools view -bS sample.sam > sample.bam

picard SortSam I=sample.bam O=sorted_sample.bam SORT_ORDER=coordinate

picard MarkDuplicates I=sorted_sample.bam O=sample_marked.bam M=marked_metrics.txt ASSUME_SORT_ORDER=coordinate

picard AddOrReplaceReadGroups I=sample_marked.bam  O=sample_rg.bam RGID=1 RGLB=lib1 RGPL=illumina RGPU=unit1 RGSM=1

gatk IndexFeatureFile -F= ~/projects/patologie/reference/1000G_phase1.indels.b37.vcf 

gatk IndexFeatureFile -F= ~/projects/patologie/reference/Mills_and_1000G_gold_standard.indels.b37.vcf 

gatk IndexFeatureFile -F= ~/projects/patologie/reference/dbsnp_138.b37.vcf

gatk BaseRecalibrator -R ~/projects/patologie/hs37d5/hs37d5.fa  -I sample_rg.bam --known-sites ~/projects/patologie/reference/1000G_phase1.indels.b37.vcf --known-sites ~/projects/patologie/reference/dbsnp_138.b37.vcf --known-sites ~/projects/patologie/reference/Mills_and_1000G_gold_standard.indels.b37.vcf -O recal_table

gatk ApplyBQSR -bqsr recal_table -I sample_rg.bam -R ~/projects/patologie/hs37d5/hs37d5.fa -O sample_recal.bam

gatk IndexFeatureFile -F= ~/projects/patologie/reference/af-only-gnomad.raw.sites.b37.vcf

gatk Mutect2 -R ~/projects/patologie/hs37d5/hs37d5.fa -I ~/projects/patologie/analysis2/sample_recal.bam -tumor 1 -O sample_single.vcf --germline-resource ~/projects/patologie/reference/af-only-gnomad.raw.sites.b37.vcf
alzbeta_h is offline   Reply With Quote