Seqanswers Leaderboard Ad

**swNGS** · 02-04-2012, 08:43 AM

okay, that dosn't work!

GATK throws the following error, which I recollect getting in the past:

##### ERROR MESSAGE: Input files known and reference have incompatible contigs: No overlapping contigs found.
##### ERROR known contigs = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X]
##### ERROR reference contigs = [chr1, chr2, chr3, chr4, chr5, chr6, chr7, chr8, chr9, chr10, chr11, chr12, chr13, chr14, chr15, chr16, chr17, chr18, chr19, chr20, chr21, chr22, chrX, chrY, chrM]

Its fairly obvious that the contigs are different, and I get something similar when I use the 1000 genomes reference:

##### ERROR MESSAGE: Input files reads and reference have incompatible contigs: No overlapping contigs found.
##### ERROR reads contigs = [chr1, chr2, chr3, chr4, chr5, chr6, chr7, chr8, chr9, chr10, chr11, chr12, chr13, chr14, chr15, chr16, chr17, chr18, chr19, chr20, chr21, chr22, chrX, chrY, chrM]
##### ERROR reference contigs = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, Y, MT, GL000207.1, GL000226.1, GL000229.1, GL000231.1, GL000210.1, GL000239.1, GL000235.1, GL000201.1, GL000247.1, GL000245.1, GL000197.1, GL000203.1, GL000246.1, GL000249.1, GL000196.1,...etc

Does anyone have any idea how I can fix this ?

thanks

**aaronh** · 02-07-2012, 11:00 AM

Looks like the names of the chromosomes are not matching between the reference dictionary in the bam file, the reference fasta file and the known variants in the VCF file.

**swNGS** · 03-05-2012, 05:52 AM

Ah, that's helpful, I suspect it might be the known variants vcf file that's causing the problem. Would I have to edit the headers so that the contig names in both the major allele reference genome and the known variants in the VCF are the same ?

Also, and hopefully not a stupid question, what if one of these files refers to a contig that is missing in the other? I am thinking specifically of GL000207.1, GL000226.1... etc ?

Thanks,

Chris

**moty** · 03-07-2012, 04:36 AM

Hi there,
did you ever solve this? I am having the exact same problem.
I am using the hg19 reference fasta from the GATK website and got bam files as a given.

edit: I have tried renaming the chromosomes labels from the .dict and .fai files # --> chr# and ended up with a complaint the chrM is 2 read shorter.
If you have a wiser solution I'd love to hear it

Moty

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 33 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 48 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 34 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 46 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

how to use Indel religner against bam aligned to major allele human genome

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News