Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • samtools merge

    Hello all,

    I have >150 sorted alignment bam files (>50G) by applying bwa, do I have to merge them into one single file before any further analysis?

    If I just run SNP calling on each of the bam file and merge them together, how about the duplicates? It might be different with the result coming from a big single bam file.

    Thanks in advance for any help.

  • #2
    Originally posted by bair View Post
    Hello all,

    I have >150 sorted alignment bam files (>50G) by applying bwa, do I have to merge them into one single file before any further analysis?

    If I just run SNP calling on each of the bam file and merge them together, how about the duplicates? It might be different with the result coming from a big single bam file.

    Thanks in advance for any help.
    1. Merge all the BAMs that belong to the __same library__.
    2. Mark dups on those
    3. Remove dups
    4. Run your snp pipeline per each merged BAM.
    -drd

    Comment


    • #3
      And be sure to use Picard's duplicate marker. It is superior to the one in samtools (the samtools docs suggest this as well).

      Fear the duplicate...

      Comment


      • #4
        Right.. not only that. It is not recommended by the samtools authors.

        From the samtools page:

        "Samtools’ rmdup does not work for single-end data and does not remove duplicates across chromosomes. Picard is better."

        I think it would be useful to explain this in the samtools rmdup help.
        -drd

        Comment


        • #5
          Thank you guys.

          Why just merge BAM in the same library? Indel/Snps may have different depth in different library, if we do not merge them into one big BAM file, how to setup filter for snp calling?

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            05-06-2024, 07:48 AM
          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 06:35 AM
          0 responses
          12 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Yesterday, 02:46 PM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-07-2024, 06:57 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-06-2024, 07:17 AM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Working...
          X