Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GATK and SAM format

    I want to call SNP/indel using GATK tools.

    however, GATK occurs error because my input bam file is malformed

    because some of @HD, @SQ, @RG, @PG info are missing.

    I got the meaning of each component from 'the sam format specification(v1.4-r985)' pdf file from samtools,

    but i don't know exactly what should i type each information and wht's critical for GATK because there are many options to fill it.

    does anyone who knows it? is there any website to specify it automatically?

  • #2
    Hi!

    From which aligner do the bam files come from?

    I have never tried this but in samtools there is the option reheader, where you can replace (or add?) the header.
    You would have to prepare a file containing the header lines based on the information that is specific for your alignment = sam/bam file.
    You would have to include @HD (including the sort order of your alignments), @SQ (information on the reference sequence) and @PG (information on the program used for alignment).
    For the @RG lines, I think you could user the AddOrReplaceReadgroups command from picard.

    maybe that helps...
    cheers!
    Last edited by sdvie; 02-10-2012, 02:26 AM.

    Comment


    • #3
      I use both bwa and novoalign. i use data of illumina Hiseq, pair-end.

      Thank you for your reply, Sdvie.

      and if you don't mind, may i ask you a example sam file you had used without error?

      i do want the header section only, not aligned information.

      if you don't want to show others, please send me an email [email protected] ...

      Thank you

      Comment


      • #4
        I sent you an example of a header. You can see that the SQ line is multiplied because the reference sequence fasta and index has these subsections by chromosome etc. It is crucial that they coincide with the reference sequence you are using in all steps (alignment, GATK etc). Therefore, usually the aligner program itself adds these lines, I have never done this manually. bwa for sure does add the HD, SQ and PG headers, I have not worked with novoalign. As I said before, the read groups may be added either via a bwa argument (check the bwa documentation) during alignment or by picard afterwards.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM
        • seqadmin
          Recent Advances in Sequencing Technologies
          by seqadmin



          Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

          Long-Read Sequencing
          Long-read sequencing has seen remarkable advancements,...
          12-02-2024, 01:49 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 12-17-2024, 10:28 AM
        0 responses
        26 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-13-2024, 08:24 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-12-2024, 07:41 AM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-11-2024, 07:45 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Working...
        X