Unconfigured Ad

**Brian Bushnell** · 02-24-2015, 10:22 AM

Whether or not indels are found depends on the aligner (and, perhaps, ploidy). How did you align the reads, and how did you align the contigs?

**sfh838t** · 02-24-2015, 10:53 AM

All my alignments are done using BWA, and the same file of reads was used for both GATK and samtools.
I used the samtools mpileup/bcftools/vcfutils steps both times:
samtools +reads = only SNPs
samtools + contigs = only indels
GATK tool: UnifiedGenotyper + reads = only SNPs

**Brian Bushnell** · 02-24-2015, 10:56 AM

Do you see indels in the reads when you look at the mapped bam in IGV? And how long are these indels?

**sfh838t** · 02-24-2015, 11:03 AM

yes, I can see both indels and SNPs in IGV. most of them are 3-7 bp long. And the indels found are at different locations than the SNPs found.
Is it legitimate to use contigs in calling variants?

**sfh838t** · 02-24-2015, 11:07 AM

I look at contigs in IGV and see both SNPs and indels. Have not looked at reads.

**sfh838t** · 02-24-2015, 11:18 AM

ok, I just went and looked at reads and see the same insert that I identified in contigs. this is important for the project, because this insert is present in one possible parent but not the other.

**Brian Bushnell** · 02-24-2015, 12:33 PM

Calling indels from the contigs is probably a valid approach as long as these are homozygous events; I'm not really sure how chloroplast genomes work. Also, if the assembly is good. Possibly, someone who knows more about GATK or mpileup can comment on why they seem to be missing the indels in reads.

**sfh838t** · 02-24-2015, 12:43 PM

I did get the GATK answer and now have a file with both together. working with plants and all bets are off

the samtools output might remain a mystery .
Probably a new question: does anyone know how to build a consensus from an alignment which DOES include indels??

**sarvidsson** · 02-25-2015, 12:44 AM

Did you try the FastaAlternateReferenceMaker in GATK? You need called variants in a VCF file however, not just an alignment - and read the documentation carefully, there are some limitations.

**sfh838t** · 03-02-2015, 12:54 PM

Thanks, I will try that.

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Yesterday, 05:37 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 52 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 110 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

variant calling in plant

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News