Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • gene coder
    Member
    • Jul 2011
    • 18

    Reverse engineering BAM files: BAM -> FASTQ

    How can I possibly extract the reads from a BAM file and put them into a FASTQ file for simulation (maq simutrain, then maq simulate)?

    Should I just extract col. 1, 10 and 11 from a BAM file and put them in a text file along with a '+'? That is the output by read simulators.

    But does it not happen that DNA sequences and base quality sequences are reversed and/or transformed depending on the direction and strand of reads relative to reference genomes? I would have to fix that too in that case.
  • raonyguimaraes
    Member
    • Jun 2010
    • 38

    #2
    We make Stack Overflow and 170+ other community-powered Q&A sites.


    Comment

    • Richard Finney
      Senior Member
      • Feb 2009
      • 701

      #3
      Check out http://seqanswers.com/forums/showthread.php?t=16395 for bampe2fq.c and bamse2fq.c for fast implementations of bam to fastq programs. It handles your concerns about reverse complimenting the sequence and reversing the quality string.

      Comment

      • gene coder
        Member
        • Jul 2011
        • 18

        #4
        Thanks. I found that SamToFastq in Picard did the job on a chromosome of NA12878 from the 1000 Genomes Project.

        1. I separated SE reads and PE reads by library into separate BAM files using SamTools.
        2. For PE reads, I also had to get rid of read-pairs that were unmapped (bit 3) or whose mate was unmapped (bit 4) or that were not properly aligned (bit 2).
        3. I further separated the BAM files by read length.
        4. Each BAM file now contained SE or PE reads only of the same read length and the same library.
        5. Now I could run SamToFastq to convert a SAM file to a DAT file for use with MAQ simutrain.

        Those were the steps to do to get what I wanted.

        Comment

        Latest Articles

        Collapse

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        25 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        31 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        39 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-02-2026, 12:03 PM
        0 responses
        62 views
        0 reactions
        Last Post SEQadmin2  
        Working...