Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Segmentation fault (core dumped) in Samtools

    Hi, All,

    I want to map the RNA-seq BMA files back to the reference, and I used the samtools pileup command. It always shows the error "Segmentation fault (core dumped)".

    It should not be the problem in the reference genome, because I have tried "fold" command to format the reference genome, as well as the "faidx" in samtools. They did not work. Strangely, for different BAM files, the pileup sometimes shows the error at the very beginning, some times stopped at chr6, while some other times shows the errors at chr10 or later chromosomes.

    My command is "samtools view -u Inputfile (BAM) | samtools pileup -vcf hg19.fa - > output". However, when I tried Partek, it runs very well.

    Anyone has any idea what is the problem with these BAM files? What I should do to fix these bugs/modify the BAM files?

    Thanks a lot.

  • #2
    I'm not sure quite what you want to do here. If you have a .bam file, isn't it already mapped? Pileup won't map anything for you, and I'm not sure it takes .sam files, which is what you are giving it.

    Pileup is also deprecated. mpileup is the current version of that bit of code.

    Comment


    • #3
      Thanks for the reply.

      Actually, I want to call mutations from BAM files against reference genome, and pileup/mpileup can generate a list of positions that is different between samples and reference genome.

      No matter I used pileup or mpileup, the problem is always there.

      My command in pileup is "samtools view -u Inputfile (BAM) | samtools pileup -vcf hg19.fa - > output ", while the command for mpileup is "samtools mpileup -uf hg19.fa BAM | bcftools view -bvcg -> output".

      Thanks again.

      Comment


      • #4
        With pileup, is your BAM file sorted? If so, try outputing the samtools view command to a file to ensure that the problem isn't there. If that completes properly, run the samtools pileup in gdb to get an idea where the segfault is (you might also compile samtools with debugging). That way you can provide a proper bug report if this is an issue with samtools.

        Comment


        • #5
          Originally posted by hanleng View Post
          Thanks for the reply.

          Actually, I want to call mutations from BAM files against reference genome, and pileup/mpileup can generate a list of positions that is different between samples and reference genome.
          Sure, but that's not mapping.

          My command in pileup is "samtools view -u Inputfile (BAM) | samtools pileup -vcf hg19.fa - > output ", while the command for mpileup is "samtools mpileup -uf hg19.fa BAM | bcftools view -bvcg -> output".
          Forget pileup. First thing to check with mpileup is that it made the fasta index properly. You can make it yourself with samtools faidx. Does samtools faidx finish without an error? Compare chromosome names in your .sam with those in your fasta file. Are they the same?

          Comment


          • #6
            Thanks for the reply. The samtools view completes properly. What do you mean by "samtools pileup in gdb"?

            Comment


            • #7
              Originally posted by swbarnes2 View Post
              Sure, but that's not mapping.



              Forget pileup. First thing to check with mpileup is that it made the fasta index properly. You can make it yourself with samtools faidx. Does samtools faidx finish without an error? Compare chromosome names in your .sam with those in your fasta file. Are they the same?
              fadix works well, and chromosome name is the same between reference genomes and BAM files.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 08:47 AM
              0 responses
              9 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              60 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              57 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              53 views
              0 likes
              Last Post seqadmin  
              Working...
              X