Hi:
Why is it important to have a read group. Is it to keep track of
samples belonging to a family or a phenotype etc. OR is it useful to
optimize the variant calls among samples due to coverage issues.
What I mean:
For instance I have a family of 4 (Mother, father, 2 children) and 5
samples from this family (4 germ line from members and 1 tissue sample
from affected child).
Likewise I have 4 different families. Now using exome sequencing data,
I might not have equal coverage among a family and across families for
a locus. In a family that locus might be interesting , but due to
coverage in affected child, it is difficult to call that locus as a
significant call. In such case, If I specify read group, based on RG
info, will mpileup optimize the SNV at that locus and call it
significant?
Is this why we need a read group? Is mpileup works this way. Sorry for
asking a naive question.
thanks
Adrian
Why is it important to have a read group. Is it to keep track of
samples belonging to a family or a phenotype etc. OR is it useful to
optimize the variant calls among samples due to coverage issues.
What I mean:
For instance I have a family of 4 (Mother, father, 2 children) and 5
samples from this family (4 germ line from members and 1 tissue sample
from affected child).
Likewise I have 4 different families. Now using exome sequencing data,
I might not have equal coverage among a family and across families for
a locus. In a family that locus might be interesting , but due to
coverage in affected child, it is difficult to call that locus as a
significant call. In such case, If I specify read group, based on RG
info, will mpileup optimize the SNV at that locus and call it
significant?
Is this why we need a read group? Is mpileup works this way. Sorry for
asking a naive question.
thanks
Adrian
Comment