Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Samtools problem on a "rocks" cluster

    I am having some problems getting samtools to run on a machine that is part of a rocks (v. 4.3) cluster (it happens to be the head-node which has 16GB RAM).

    Even if I use the pre-compiled binary available from the samtools site (or compile samtools locally from source) I am getting a segmentation fault running samtools ("import"). The job is *not* being submitted to cluster queue. I want to make sure samtools actually runs and completes a job on this OS.

    Has anyone else ran into this before?

    BTW: How may of you are using "rocks" based cluster(s) for processing/analyzing next gen sequence data?

    -- Hemant

  • #2
    I often use maq or bwa on a rocks cluster, but haven't used samtools yet. But if the job is not being submitted to the queue, then how do you know that samtools is segfaulting? Some examples of commands, output, your shell script you're submitting to the queue, etc, would be helpful ...

    Comment


    • #3
      Originally posted by jnfass View Post
      I often use maq or bwa on a rocks cluster, but haven't used samtools yet. But if the job is not being submitted to the queue, then how do you know that samtools is segfaulting? Some examples of commands, output, your shell script you're submitting to the queue, etc, would be helpful ...
      Let us set aside the fact that this is a cluster for a moment. I am trying to make sure the program can run successfully on the head node.

      I am logged into the head node and then run a basic --> samtools "import" database *.sam *.bam command. samtools runs for sometime (e.g. I get this: [sam_header_read2] 28203826 sequences loaded) and then the program seg faults without producing any additional error messages.

      Any ideas?

      Comment


      • #4
        sorry - seems like a straight samtools problem, unrelated to cluster usage (if you've used the command successfully on other data, you could verify that) ...

        I can't help, but I'd suggest you post the exact command, and the exact output, so others might be able to help you. Good luck!

        Comment


        • #5
          Originally posted by GenoMax View Post
          I am having some problems getting samtools to run on a machine that is part of a rocks (v. 4.3) cluster (it happens to be the head-node which has 16GB RAM).

          Even if I use the pre-compiled binary available from the samtools site (or compile samtools locally from source) I am getting a segmentation fault running samtools ("import"). The job is *not* being submitted to cluster queue. I want to make sure samtools actually runs and completes a job on this OS.

          Has anyone else ran into this before?

          BTW: How may of you are using "rocks" based cluster(s) for processing/analyzing next gen sequence data?

          -- Hemant
          Can you give me the command you are using? What version of rocks and zlib are you using? We have rocks 4 and 5 running with samtools.

          Comment


          • #6
            Originally posted by nilshomer View Post
            Can you give me the command you are using? What version of rocks and zlib are you using? We have rocks 4 and 5 running with samtools.
            This is the command line:

            samtools import ~/index/hg18.fa ~/single/single_end_mod.txt.part10.sam ~/single/single_end_mod.txt.part10.bam

            Rocks v.4.3 and the stock zlib that came with 4.3 (which is 1.2.1 I think).

            Comment


            • #7
              Originally posted by GenoMax View Post
              This is the command line:

              samtools import ~/index/hg18.fa ~/single/single_end_mod.txt.part10.sam ~/single/single_end_mod.txt.part10.bam

              Rocks v.4.3 and the stock zlib that came with 4.3 (which is 1.2.1 I think).
              Try running
              Code:
              samtools faidx ~/index/hg18.fa
              before the import. You need to index the reference.
              Then run
              Code:
              samtools import ~/index/hg18.fa.fai ~/single/single_end_mod.txt.part10.sam ~/single/single_end_mod.txt.part10.bam
              (notice the *fa.fai instead of *fa since the argument for import is a reference list, not the reference).

              Comment


              • #8
                samtools import <in.ref_list> <in.sam> <out.bam>
                Since 0.1.4, this command is an alias of:
                samtools view -bt <in.ref_list> -o <out.bam> <in.sam>

                so you can use "samtools view -bt" instead. Before "view" ,you should run "samtools faidx" first. I am using rocks 4.3 cluster, and have not met some problem with samtools. Good luck to you

                Comment


                • #9
                  Thanks "nilshomer" and "jfshao1984" for pointing out the solution.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  27 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  30 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  26 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  52 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X