I am sure someone must have solved this before, and I wonder if there is not an obviously easy way to do this:
I have GAIIx-RAD data of bowtie-reference-mapped individuals. The resulting individual SNP data (produced with samtools in the form of pileups) is easily merged in a single table, but one major problem remains:
The samtools pileup only reports on SNPs which are different wrt the reference sequence, but not on those SNPs which happen to be identical to the reference (i.e. those which show differences in other individuals).
Is there any easy way of discriminating the absence of data from DNA-sequence identity of any given SNP with the reference sequence?
Any pointers most welcome!
I have GAIIx-RAD data of bowtie-reference-mapped individuals. The resulting individual SNP data (produced with samtools in the form of pileups) is easily merged in a single table, but one major problem remains:
The samtools pileup only reports on SNPs which are different wrt the reference sequence, but not on those SNPs which happen to be identical to the reference (i.e. those which show differences in other individuals).
Is there any easy way of discriminating the absence of data from DNA-sequence identity of any given SNP with the reference sequence?
Any pointers most welcome!
Comment