Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • intersect file bam-bed and count read

    Hi all!
    I have RNA-seq data and I used STAR in order to align the reads.
    I would like to use my bed file in order to have the only read that are mapped in the position reported in bed file and after I would like to know how many read are mapped in each interval.
    For the first part I tried to use "bedtools intersect":
    bedtools intersect -abam -a file.bam -b file.bed
    I saw that the number of the read reported in the output file were less than the input file but It doesn't seem right. In fact, for example, in my bed file for chromosome the first interval begins with the position 335 and in my new output file there are reads mapped in position 80.

    Thank you in advance and I am sorry for my probably bad English!
    Best

  • #2
    The option you should look at is coverageBed.

    As for the other observation if you are providing a set of BED intervals then only reads that fall in those intervals will be reported (so that number can be expected to be less than the input). It is possible that the read in question is mapped beginning at position 80 but depending on how long it is it must be extending into the first interval.

    Comment


    • #3
      Since you did the mapping with STAR, it is possible a read will be really long once mapped to the genome if it is at an exon-exon junction... You can use the parameter -split in your bedtools command to report only the region your read is really mapping...

      Comment


      • #4
        Originally posted by GenoMax View Post
        The option you should look at is coverageBed.

        As for the other observation if you are providing a set of BED intervals then only reads that fall in those intervals will be reported (so that number can be expected to be less than the input). It is possible that the read in question is mapped beginning at position 80 but depending on how long it is it must be extending into the first interval.
        Thank you for your reply. The read is about 20 nucleotides, so beedtools should discard this read...

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        39 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        41 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        35 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X