Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GATK "Code exception"

    I tried to run GATK after alignment with bowtie - file was SortSam'ed and ReorderSam'ed (both picard) prior to GATK.

    java -Xmx3g -jar GATK/GenomeAnalysisTK.jar -R ucsc.hg19.fasta -T UnifiedGenotyper -I file_in.bam --dbsnp dbsnp_132.hg19.vcf -o file_in_out.vcf -metrics snps.metrics -stand_call_conf 50.0 -stand_emit_conf 10.0 -dcov 1000 -A DepthOfCoverage -A AlleleBalance -L hg19_exons.bed


    Error:

    INFO 20:39:02,940 HelpFormatter - Date/Time: 2012/02/06 20:39:02
    INFO 20:39:02,940 HelpFormatter - ---------------------------------------------------------------------------------
    INFO 20:39:02,940 HelpFormatter - ---------------------------------------------------------------------------------
    INFO 20:39:02,956 RodBindingArgumentTypeDescriptor - Dynamically determined type of dbsnp_132.hg19.vcf to be VCF
    INFO 20:39:02,966 GenomeAnalysisEngine - Strictness is SILENT
    INFO 20:39:03,022 SAMDataSource$SAMReaders - Initializing SAMRecords in serial
    INFO 20:39:03,036 SAMDataSource$SAMReaders - Done initializing BAM readers: total time 0.01
    INFO 20:39:03,045 RMDTrackBuilder - Loading Tribble index from disk for file dbsnp_132.hg19.vcf
    INFO 20:39:05,881 GATKRunReport - Uploaded run statistics report to AWS S3
    ##### ERROR ------------------------------------------------------------------------------------------
    ##### ERROR stack trace
    java.lang.NullPointerException
    at org.broadinstitute.sting.gatk.datasources.reads.BAMScheduler.getNextOverlappingBAMScheduleEntry(BAMScheduler.java:259)
    at org.broadinstitute.sting.gatk.datasources.reads.BAMScheduler.advance(BAMScheduler.java:170)
    at org.broadinstitute.sting.gatk.datasources.reads.BAMScheduler.populateFilteredIntervalList(BAMScheduler.java:99)
    at org.broadinstitute.sting.gatk.datasources.reads.BAMScheduler.createOverIntervals(BAMScheduler.java:74)
    at org.broadinstitute.sting.gatk.datasources.reads.IntervalSharder.shardOverIntervals(IntervalSharder.java:58)
    at org.broadinstitute.sting.gatk.datasources.reads.SAMDataSource.createShardIteratorOverIntervals(SAMDataSource.java:1148)
    at org.broadinstitute.sting.gatk.GenomeAnalysisEngine.getShardStrategy(GenomeAnalysisEngine.java:452)
    at org.broadinstitute.sting.gatk.GenomeAnalysisEngine.execute(GenomeAnalysisEngine.java:230)
    at org.broadinstitute.sting.gatk.CommandLineExecutable.execute(CommandLineExecutable.java:122)
    at org.broadinstitute.sting.commandline.CommandLineProgram.start(CommandLineProgram.java:236)
    at org.broadinstitute.sting.commandline.CommandLineProgram.start(CommandLineProgram.java:146)
    at org.broadinstitute.sting.gatk.CommandLineGATK.main(CommandLineGATK.java:90)
    ##### ERROR ------------------------------------------------------------------------------------------
    ##### ERROR A GATK RUNTIME ERROR has occurred (version 1.4-24-g6ec686b):
    ##### ERROR
    ##### ERROR Please visit the wiki to see if this is a known problem
    ##### ERROR If not, please post the error, with stack trace, to the GATK forum
    ##### ERROR Visit our wiki for extensive documentation http://www.broadinstitute.org/gsa/wiki
    ##### ERROR Visit our forum to view answers to commonly asked questions http://getsatisfaction.com/gsa
    ##### ERROR
    ##### ERROR MESSAGE: Code exception (see stack trace for error itself)
    ##### ERROR --------------------------------------------------------------------



    Any ideas what could have caused that ?

  • #2
    Do what is says on the tin, go to http://getsatisfaction.com/gsa and check the forums - we've hit issues and found them already solved there, or acknowledged. If not the developers are very responsive.

    That looks like quite a recent version of GATK, we've learned here that slavishly following the GATK release cycle just leads to problems. Pick a version that works, stick to it, keep and eye on the changelog on github to see if any essential features have been added or bugs fixed.

    We had issues with the entire 1.3 release set, and once we established GenomeAnalysisTK-1.4-7-gc96fee4 fixed our problems - that's where we have left things. If you're using the GATK heavily you need to test each release thoroughly before introducing it into a production pipeline.

    Comment


    • #3
      Originally posted by Bukowski View Post
      Do what is says on the tin, go to http://getsatisfaction.com/gsa and check the forums - we've hit issues and found them already solved there, or acknowledged. If not the developers are very responsive.

      That looks like quite a recent version of GATK, we've learned here that slavishly following the GATK release cycle just leads to problems. Pick a version that works, stick to it, keep and eye on the changelog on github to see if any essential features have been added or bugs fixed.

      We had issues with the entire 1.3 release set, and once we established GenomeAnalysisTK-1.4-7-gc96fee4 fixed our problems - that's where we have left things. If you're using the GATK heavily you need to test each release thoroughly before introducing it into a production pipeline.

      I concur... I have encountered many problems by upgrading. I am on v1.2 for now. Will consider upgrading to 1.4 now that it seems more stable.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM
      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 06:37 PM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 06:07 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2024, 10:03 AM
      0 responses
      52 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-21-2024, 07:32 AM
      0 responses
      68 views
      0 likes
      Last Post seqadmin  
      Working...
      X