I have a VCFv4.2 (Sanger-imputation-service genotype data) file and I need to calculate the proportion of missing data per sample.
Each sample is encoded like so:
I am uncertain as to how 'missing' is identified - Any suggestions?
Each sample is encoded like so:
Code:
GT:ADS:DS:GP 0|0:0.25,0.15:0.4:0.6375,0.325,0.0375
Comment