Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • gff3-bed-bam for RNAseq

    Dear all,

    I have some splice junctions (in GFF3) that I ultimately want to convert to bam. I was thinking to convert first to bed myself and then to bam using bedToBam.
    To test the second step I grabbed a bed-example from teh UCSC browser (each line encodes TWO exons) and ran a little test on it, converting to bam and then back to bed. This results in information loss:

    nuvol:cufflinks htilgner$ cat test.bed
    chr22 1000 5000 cloneA 960 + 1000 5000 0 2 567,488, 0,3512
    chr22 2000 6000 cloneB 900 - 2000 6000 0 2 433,399, 0,3601
    nuvol:cufflinks htilgner$ bedToBam -i test.bed -g hg19.genome.file > test.bam
    nuvol:cufflinks htilgner$ bamToBed -i test.bam
    chr22 1000 5000 cloneA 255 +
    chr22 2000 6000 cloneB 255 -

    Do you know why ? More importantly, is there an appropriate way to encode a spliced read in bam format ? How would that work ? Alternatively, is there a way to convert gff3 to bam directly ?

    Thanks

  • #2
    Given that the bedToBam program is general purpose, I'd be pretty surprised if it knew enough about what the various meta-information columns meant in order to store them in a meaningful way in BAM format. Spliced read are generally denoted with Ns in the CIGAR string.

    Comment


    • #3
      Dear dpryan,

      thanks for your answer. Do you have a way to force the below 2 example read (that are spliced) into a bam file ?
      chr22 1000 5000 cloneA 960 + 1000 5000 0 2 567,488, 0,3512
      chr22 2000 6000 cloneB 900 - 2000 6000 0 2 433,399, 0,3601

      Best & thanks
      Last edited by Hagen; 08-20-2012, 08:09 AM.

      Comment


      • #4
        Not off-hand, I've never needed such a file. It'd be easy enough to just write a conversion program in C that would parse the BED file for the exonic regions, grab the relevant sequence from a fasta file, and write everything to a BAM file. I haven't a clue why you would want such a file, but that would work.

        Comment


        • #5
          OK, I get it now, I believe. My problem was that I hoped not to get into teh specific of sam/bam and simply convert ... will write the converter ...
          thanks for your help!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 08:47 AM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          57 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          53 views
          0 likes
          Last Post seqadmin  
          Working...
          X