Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • help - convert .bed to .bam

    hi,

    i need to convert a .bed file to .bam. I've tried using BEDTools, but i've been running into a lot of issues with it.

    are there any other tools/methods out there to do this?

    thanks

  • #2
    What are the issues with bedtools? If you post a few lines of your bed file, maybe these issues could be sorted out.

    Comment


    • #3
      hi,

      thanks for your reply.

      so the current issue i'm having is:

      Error: BED entry without name found at line: 1. Exiting!

      the command I gave in UNIX is:

      bedToBam -i chr1.bed -g hg18.txt > chr1.bam

      chr1.bed (bed file for only chromosome1) looks like this:

      chr1 100000000 100000036
      chr1 100000040 100000076
      chr1 100000090 100000126
      chr1 100000136 100000172
      chr1 100000189 100000225
      chr1 100000235 100000271
      chr1 100000287 100000323
      chr1 100000336 100000372
      chr1 100000401 100000437
      chr1 100000452 100000488
      chr1 100000512 100000548..
      .....

      and hg18.txt looks like this:

      chr1 247249719
      chr10 135374737
      chr11 134452384
      chr12 132349534
      chr13 114142980
      chr14 106368585
      chr15 100338915

      thanks!

      Comment


      • #4
        The bed file seems very sparse to me. BedToBam seems to be looking for a name which should be in a fourth column. Haven't really tried BedToBam, but that is my guess.

        Comment


        • #5
          Yes, that's the problem. Try the following:

          Code:
          cat chr1.bed | awk '{x++; printf "%s\tread%d\n",$0,x}' | bedtools bedtobam -g hg18.txt -i - > chr1.bam
          I get the following output then, when converted to SAM (with samtools view):

          Code:
          read1   0       chr1    100000001       255     36M     *       0       0       *       *
          read2   0       chr1    100000041       255     36M     *       0       0       *       *
          read3   0       chr1    100000091       255     36M     *       0       0       *       *
          read4   0       chr1    100000137       255     36M     *       0       0       *       *
          read5   0       chr1    100000190       255     36M     *       0       0       *       *
          read6   0       chr1    100000236       255     36M     *       0       0       *       *
          read7   0       chr1    100000288       255     36M     *       0       0       *       *
          read8   0       chr1    100000337       255     36M     *       0       0       *       *
          read9   0       chr1    100000402       255     36M     *       0       0       *       *
          read10  0       chr1    100000453       255     36M     *       0       0       *       *
          Last edited by arvid; 05-23-2012, 12:15 AM. Reason: typo

          Comment


          • #6
            thanks a lot for helping out! really appreciate it.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            22 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            47 views
            0 likes
            Last Post seqadmin  
            Working...
            X