Seqanswers Leaderboard Ad

**KaiYe** · 02-24-2014, 08:29 AM

Originally posted by millerrruth View Post

Hi,
I am running Pindel on some Illumina data.
I am having trouble interpreting the final "per sample" columns of the output files.
The manual (http://gmt.genome.wustl.edu/pindel/0...er-manual.html) says that lines 32+ are "Per sample" and consist of the sample id, plus 4 values for each sample (ie. the sample name, followed by the total number of supporting reads whose anchors are upstream, the total number of unique supporting reads whose anchors are upstream, the total number of supporting reads whose anchors are downstream, and finally the total number of unique supporting reads whose anchors are downstream.)
However in my output I get 6 values following every sample name. (eg:
C9343 5 5 12 10 1 1)
Can you please explain what these extra 2 values are, and the order of the values.

Thanks,

Ruth

SampleID RefSupportingLeft RefSupportingRight AltSupportingLeft AltSupportingLeftUnique AltSupportingRight AltSupportingRightUnique

**millerrruth** · 02-24-2014, 11:20 AM

Hi,
Thanks.
So to clarify RefSupportingLeft and RefSupportingRight are the number of reads that support the reference sequence, and therefore not the variant?
Whereas Alt... are the number of reads that support the variant as documented in the rest of the row?
Ruth

**KaiYe** · 02-24-2014, 02:52 PM

Originally posted by millerrruth View Post

Hi,
Thanks.
So to clarify RefSupportingLeft and RefSupportingRight are the number of reads that support the reference sequence, and therefore not the variant?
Whereas Alt... are the number of reads that support the variant as documented in the rest of the row?
Ruth

Ref = reference allele
Alt = variant allele

take max(RefSupportingLeft, RefSupportingRight) and sum(AltSupportingLeft, AltSupportingRight) for your genotype. if you run pindel2vcf, this should be take cared already.

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, 11-08-2024, 11:09 AM	0 responses 244 views 0 likes	Last Post by seqadmin 11-08-2024, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, 11-08-2024, 06:13 AM	0 responses 184 views 0 likes	Last Post by seqadmin 11-08-2024, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 83 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 28 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Pindel output Row 32+ per sample details

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News