Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Some clarification about sam to bam conversion

    Hello,
    for getting a sorted bam file out of an unsorted sam file i often found these commands:

    Code:
    samtools view -bS file.sam | samtools sort - > file_sorted.bam
    samtools view -bT reference.fa file.sam | samtools sort - > file_sorted.bam
    My questions are:
    1. When do I need to give the reference file as an argument?
    2. Are there any disadvantages if I do this:
    Code:
    samtools sort -O BAM file.sam > file_sorted.bam
    fin swimmer

  • #2
    1. You only need to do that with CRAM files (and usually not even then)
    2. Samtools sort doesn't accept BAM input.


    As an aside, you can use the following for a slight speed improvement:

    Code:
    samtools view -uS file.sam | samtools sort - > file_sorted.bam
    Note that the syntax for "samtools sort" has changed in more recent versions.

    Comment


    • #3
      Hello
      Originally posted by dpryan View Post
      1. You only need to do that with CRAM files (and usually not even then)
      2. Samtools sort doesn't accept BAM input.
      you mean doesn't accept SAM input?

      Code:
      samtools 1.3.1
      Using htslib 1.3.1
      Copyright (C) 2016 Genome Research Ltd.
      
      Usage: samtools sort [options...] [in.bam]
      Options:
        -l INT     Set compression level, from 0 (uncompressed) to 9 (best)
        -m INT     Set maximum memory per thread; suffix K/M/G recognized [768M]
        -n         Sort by read name
        -o FILE    Write final output to FILE rather than standard output
        -T PREFIX  Write temporary files to PREFIX.nnnn.bam
        -@, --threads INT
                   Set number of sorting and compression threads [1]
            --input-fmt-option OPT[=VAL]
                     Specify a single input file format option in the form
                     of OPTION or OPTION=VALUE
        -O, --output-fmt FORMAT[,OPT[=VAL]]...
                     Specify output format (SAM, BAM, CRAM)
            --output-fmt-option OPT[=VAL]
                     Specify a single output file format option in the form
                     of OPTION or OPTION=VALUE
            --reference FILE
                     Reference sequence FASTA FILE [null]
      The strange thing is, that I didn't see any problems until now by sorting the way I described above...

      fin swimmer

      Comment


      • #4
        Oops, yes I meant SAM input. That at least used to not work, but perhaps it's been silently changed.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        58 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        54 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        46 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X