Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • splitting haplotyepecaller onto multiple threads

    Hi all,

    Quick question, I am doing some whole genome and am using the GATK haplotypecaller. To speed things up, I would like to break the genome into smaller chuncks and run each chromosome separately on its own thread (using -L options with a bed file for each chromosome). This will produce multiple gvcf files for 1 sample. If I use the genotypeGVCFs function, will it treat the gvcf's with the same sample name as a single file? Would the combineGVCFs do the same thing and that should be done first?

    Thanks

  • #2
    If you're just going to split things to run on individual threads, then why not just use the -nt option? What you suggest would only make sense if you're using a cluster and want to split individual samples across nodes.

    Comment


    • #3
      Sorry, I wasn't clear, but I am going to be splitting the samples across nodes on a cluster. also on the GATK documentation page, there is a caveat listed:

      Many users have reported issues running HaplotypeCaller with the -nct argument, so we recommend using Queue to parallelize HaplotypeCaller instead of multithreading
      I assume that means that the -nt options shouldn't be used, otherwise it would be a better way to do things.

      Comment


      • #4
        Ah, guess I've just never run into issues with it.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        51 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        67 views
        0 likes
        Last Post seqadmin  
        Working...
        X