Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • pileup2bam

    Anyone aware of a tool that can convert a pileup file to BAM files.

  • #2
    That doesn't make sense. BAM files (like SAM files) describe where individual reads are placed, whereas a pileup file is a coverage based summary of many reads.

    Comment


    • #3
      Yes, but theoretically you can re-create the BAM information from the pileup because all the information is stored in the pileup file. Or am I mistaken?

      Comment


      • #4
        Going from pileup to bam is a theoretical possibility, but I doubt anyone has taken the trouble to make it a reality. Even the best you could do would miss out on some information, like pairing. Why do you want to do this?

        Comment


        • #5
          The problem is I am using an RNA-Seq workflow that produces a pileup rather than a BAM, so I can't annotate it with GATK.

          Comment


          • #6
            What kind of workflow? If you are using Galaxy it probably is making a BAM file but hiding it by default - check the hidden entries.

            Comment


            • #7
              There must be an easier way to get what you want than resurrecting a bam from pileup. You could look at the files upstream of the pileup and see if they can be translated to bam format, or, on the other hand, go forward to create a vcf (or some other bed/gff file) from the pileup and annotate that.

              Comment


              • #8
                I find myself in this situation using legacy data (e.g., original sequencing files from an old project were lost but pileups are still available). I wanted to bump this thread to see if anyone has come up with a good solution to this problem- any utilities out there that can convert a pileup to a bam or something like it?

                Comment


                • #9
                  You need to be more specific about the pileup format that is used. A text-based format like the output of 'samtools mpileup' would be easier to convert to SAM than a graphical image, for example. But even given the huge amount of information that could be extracted from 'samtools mpileup', it doesn't store everything. For example, the sequence names are not preserved, and no pairing information is retained.

                  Any pileup output which is nothing more than coverage values without linking between different bases will not be sufficient for generating a source SAM file.

                  Comment


                  • #10
                    That's a good point- I'm talking about the text-based format, and the experiment was run single-end so pairing isn't important. It's true that I wouldn't have the original read names, but naively that doesn't seem very important- is there software that actually uses those, beyond for read pairing?

                    That said, I suppose that I hadn't considered whether the pileup format explicitly maintains the individual bases within reads. I had always assumed that the base calls at each position were ordered readwise, but it occurs to me that this may not be the case.

                    Anyway, it sounds like a tool to do this conversion probably doesn't exist. Thank you for your input.

                    Comment


                    • #11
                      mpileup does retain read order information in the per-base coverage, but I'm not aware of any other text-based formats that do... actually, I'm not aware of any other text-based formats that do anything more than coverage alone.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Innovations in Spatial Biology
                        by seqadmin


                        Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

                        3D Genomics
                        While spatial biology often involves studying proteins and RNAs in their...
                        01-01-2025, 07:30 PM
                      • seqadmin
                        Advancing Precision Medicine for Rare Diseases in Children
                        by seqadmin




                        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                        12-16-2024, 07:57 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 01-09-2025, 04:04 PM
                      0 responses
                      433 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 01-09-2025, 09:42 AM
                      0 responses
                      441 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 01-08-2025, 03:17 PM
                      0 responses
                      457 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 01-03-2025, 11:18 AM
                      1 response
                      50 views
                      1 like
                      Last Post Tonia
                      by Tonia
                       
                      Working...
                      X