Seqanswers Leaderboard Ad

**KaiYe** · 02-24-2014, 08:29 AM

Originally posted by millerrruth View Post

Hi,
I am running Pindel on some Illumina data.
I am having trouble interpreting the final "per sample" columns of the output files.
The manual (http://gmt.genome.wustl.edu/pindel/0...er-manual.html) says that lines 32+ are "Per sample" and consist of the sample id, plus 4 values for each sample (ie. the sample name, followed by the total number of supporting reads whose anchors are upstream, the total number of unique supporting reads whose anchors are upstream, the total number of supporting reads whose anchors are downstream, and finally the total number of unique supporting reads whose anchors are downstream.)
However in my output I get 6 values following every sample name. (eg:
C9343 5 5 12 10 1 1)
Can you please explain what these extra 2 values are, and the order of the values.

Thanks,

Ruth

SampleID RefSupportingLeft RefSupportingRight AltSupportingLeft AltSupportingLeftUnique AltSupportingRight AltSupportingRightUnique

**millerrruth** · 02-24-2014, 11:20 AM

Hi,
Thanks.
So to clarify RefSupportingLeft and RefSupportingRight are the number of reads that support the reference sequence, and therefore not the variant?
Whereas Alt... are the number of reads that support the variant as documented in the rest of the row?
Ruth

**KaiYe** · 02-24-2014, 02:52 PM

Originally posted by millerrruth View Post

Hi,
Thanks.
So to clarify RefSupportingLeft and RefSupportingRight are the number of reads that support the reference sequence, and therefore not the variant?
Whereas Alt... are the number of reads that support the variant as documented in the rest of the row?
Ruth

Ref = reference allele
Alt = variant allele

take max(RefSupportingLeft, RefSupportingRight) and sum(AltSupportingLeft, AltSupportingRight) for your genotype. if you run pindel2vcf, this should be take cared already.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 17 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 46 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Pindel output Row 32+ per sample details

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News