Unconfigured Ad

**swbarnes2** · 04-03-2013, 08:36 AM

Align both samples to your best reference, then use samtools mpileup on both .bams together.

**Rainbird** · 04-03-2013, 06:35 PM

Thanks swbarnes2

Could you explain a little more why "use samtools mpileup on both .bams together" will work? In that case, we still need the reference genome from subspecies A, right?

Another thing is that: if there are more than 1 non-reference allels reported, the samtools only gives out the depth of the 1st non-reference allel (as listed in DP4). Also, although the 1/1 indicates homozygous alternate, I don't understand the meaning of the PL value which is "131,59,26,91,0,85" (as shown below). How can we get the depths and other information for the 2nd alternate ?

chr2 213263 . A C,T 72 . DP=14;VDB=0.0355;AF1=1;AC1=2;DP4=0,0,9,4;MQ=56;FQ=-60 GT:PL:GQ 1/1:131,59,26,91,0,85:63

**Rainbird** · 04-06-2013, 07:55 PM

Anyone can help?

**Khen** · 04-07-2013, 03:55 PM

Calling species B and C against the reference together just saves space. And yes, you will still need to use the reference genome. The output is slightly different however, so what you will get is an extra GT field:

Code:

chr2	213263	.	A	C,T	72	.	DP=14;VDB=0.0355;AF1=1;AC1=2;DP4=0,0,9,4;MQ=56;FQ=-60	GT:<Genotype of A>	GT:<Genotype of B>

I'm pretty sure that the PL field is reporting the quality score of all of the allelic possibilities, which is why you see six of them. You will have to consult the documentation for how to get multiple sample depth information.
Also, I find that the Broad Institute does a much better job documentation than does sourceforge or 1000genomes.org. Since samtools and gatk both use VCF as the standard output, you might want to start with the GATK documentation if not just switch to GATK altogether.

Hope this helps.

**Rainbird** · 04-11-2013, 05:24 PM

Thanks Khen.
If I undertand correctly, samtools is designed for diploid genome. If there are 2 alleles in your sample other than the allele in the reference genome (for example, the reference genome has a T, and you have a A and a G in your sample), samtool might not work well.

Is there any tool specifically designed to find allele frequency in your own samples regardless what is in the reference genome?

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 38 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 102 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 123 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 114 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Snp calling between samples

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News