Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • dovah
    Member
    • Jul 2014
    • 18

    extended cigar in bwa-mem

    Hi all,

    I am aligning pacbio reads to a reference genome using bwa mem. Do you know how to output an extended CIGAR in output sam file from bwa v0.7.13?

    My command is:
    Code:
    bwa mem -x pacbio refgenome.fasta reads.fastq > output.sam
    Thanks in advance
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    What exactly do you mean by "extended CIGAR"? bwa mem will output SAM files with an appropriate CIGAR string already.

    Comment

    • Brian Bushnell
      Super Moderator
      • Jan 2014
      • 2709

      #3
      He probably means cigar strings with X and = symbols instead of M, which are very handy.

      Comment

      • dovah
        Member
        • Jul 2014
        • 18

        #4
        Yes, this is what I mean by "extended cigar". By the way, I found a tool that can do that a posteriori, in case someone else is interested: SamFixCigar (http://github.com/lindenb/jvarkit )

        Comment

        • sklages
          Senior Member
          • May 2008
          • 628

          #5
          Are there any plans to extend bwa to write X/= instead of M? At least optional?

          Comment

          • Brian Bushnell
            Super Moderator
            • Jan 2014
            • 2709

            #6
            You can now use BBMap's reformat.sh to add those:

            reformat.sh in=mapped.sam out=extended.sam sam=1.4

            I have not tested it extensively on things like hard-clipping but it should generally work.

            Comment

            • sklages
              Senior Member
              • May 2008
              • 628

              #7
              Brian,

              thanks for the suggestion. I will try :-)

              Nevertheless I'd wish bwa would provide an option to either use M or X=.
              Just to avoid more I/O due to another conversion ...

              Sven

              Comment

              • sklages
                Senior Member
                • May 2008
                • 628

                #8
                After studying some docs I do see that with optional MD:Z I am even as flexible as with X=, with the advantage of not having too "complicated" CIGARs. :-)

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Yesterday, 08:59 AM
                0 responses
                14 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                22 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                19 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                32 views
                0 reactions
                Last Post SEQadmin2  
                Working...