Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • intersect file bam-bed and count read

    Hi all!
    I have RNA-seq data and I used STAR in order to align the reads.
    I would like to use my bed file in order to have the only read that are mapped in the position reported in bed file and after I would like to know how many read are mapped in each interval.
    For the first part I tried to use "bedtools intersect":
    bedtools intersect -abam -a file.bam -b file.bed
    I saw that the number of the read reported in the output file were less than the input file but It doesn't seem right. In fact, for example, in my bed file for chromosome the first interval begins with the position 335 and in my new output file there are reads mapped in position 80.

    Thank you in advance and I am sorry for my probably bad English!
    Best

  • #2
    The option you should look at is coverageBed.

    As for the other observation if you are providing a set of BED intervals then only reads that fall in those intervals will be reported (so that number can be expected to be less than the input). It is possible that the read in question is mapped beginning at position 80 but depending on how long it is it must be extending into the first interval.

    Comment


    • #3
      Since you did the mapping with STAR, it is possible a read will be really long once mapped to the genome if it is at an exon-exon junction... You can use the parameter -split in your bedtools command to report only the region your read is really mapping...

      Comment


      • #4
        Originally posted by GenoMax View Post
        The option you should look at is coverageBed.

        As for the other observation if you are providing a set of BED intervals then only reads that fall in those intervals will be reported (so that number can be expected to be less than the input). It is possible that the read in question is mapped beginning at position 80 but depending on how long it is it must be extending into the first interval.
        Thank you for your reply. The read is about 20 nucleotides, so beedtools should discard this read...

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM
        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        23 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        24 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        52 views
        0 likes
        Last Post seqadmin  
        Working...
        X