Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • SD2010Bioinfo
    Junior Member
    • Sep 2012
    • 9

    VarScan.v2.3.2 output file format problem

    I'm using Varscan.v2.3.2 to do somatic variants calling in tumor-normal sample with command below. But the output snp and indel files have different columns between head line(19 columns) and content (23 columns). What do the extra 4 columns mean?

    (command:java -Xmx3g -jar VarScan.v2.3.2.jar somatic chr17_N.mpileup chr17_T.mpileup --min-coverage 10 --min-coverage-normal 10 --min-coverage-tumor 10 --min-var-freq 0.2 --min-freq-for-hom 0.75 --somatic-p-value 0.05 --output-snp chr17.snp --output-indel chr17.indel)

    chr17.snp
    chrom position ref var normal_reads1 normal_reads2 normal_var_freq normal_gt tumor_reads1 tumor_reads2 tumor_var_freq tumor_gt somatic_status variant_p_value somatic_p_value tumor_reads1_plus tumor_reads1_minus tumor_reads2_plus tumor_reads2_minus
    chr17 6115 G C 0 24 100% C 0 46 100% C Germline 1.0658598000218083E-41 1.0 0 0 4 42 0 0 11 13

    --------------------------------------------------------------------------
    chr17.indel
    chrom position ref var normal_reads1 normal_reads2 normal_var_freq normal_gt tumor_reads1 tumor_reads2 tumor_var_freq tumor_gt somatic_status variant_p_value somatic_p_value tumor_reads1_plus tumor_reads1_minus tumor_reads2_plus tumor_reads2_minus
    chr17 565264 A -T 9 7 43.75% */-T 9 14 60.87% */-T Germline 1.1401894182998399E-8 0.23331930373087006 0 9
    0 14 0 9 0 7


    Thank you!
  • fbsja
    Junior Member
    • Oct 2012
    • 3

    #2
    extra columns? 23 (from 19)

    i'm new to varScan,i have the same issue....

    Comment

    • Jane M
      Senior Member
      • Aug 2011
      • 239

      #3
      The last 4 columns are normal_reads1_plus normal_reads1_minus normal_reads2_plus normal_reads2_minus.

      Comment

      • SD2010Bioinfo
        Junior Member
        • Sep 2012
        • 9

        #4
        Originally posted by Jane M View Post
        The last 4 columns are normal_reads1_plus normal_reads1_minus normal_reads2_plus normal_reads2_minus.
        Thank you for your reply. But in fact, you may not get the point.
        The last 4 columns of the head is the content you mentioned. But the last 4 columns of variation informatin has extra 4 columns which has no corresponding head.

        Comment

        • Jane M
          Senior Member
          • Aug 2011
          • 239

          #5
          Originally posted by SD2010Bioinfo View Post
          Thank you for your reply. But in fact, you may not get the point.
          The last 4 columns of the head is the content you mentioned. But the last 4 columns of variation informatin has extra 4 columns which has no corresponding head.
          The last 4 columns of the head are tumor_reads1_plus tumor_reads1_minus tumor_reads2_plus tumor_reads2_minus, not normal_reads1_plus normal_reads1_minus normal_reads2_plus normal_reads2_minus.

          If I understand well your question, you wonder what are the values 0 0 11 13 in the .snp file. Am I right?
          In this case, these are normal_reads1_plus, normal_reads1_minus, normal_reads2_plus, normal_reads2_minus : 0+0=0 and 11+13=24 as in chr17 6115 G C 0 24.

          Comment

          • SD2010Bioinfo
            Junior Member
            • Sep 2012
            • 9

            #6
            Sorry I misunderstood you.
            I think your explanation is right to the question.
            Thank you very much!

            Comment

            • dkoboldt
              Member
              • Mar 2009
              • 62

              #7
              I'm hereby promoting Jane M to "VarScan veteran". Thanks for the answers!

              Comment

              • Jane M
                Senior Member
                • Aug 2011
                • 239

                #8
                Cool!
                It's nice to answer and not to ask questions sometimes!

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Today, 08:59 AM
                0 responses
                8 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                17 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                30 views
                0 reactions
                Last Post SEQadmin2  
                Working...