I have a bam file containing arabidopsis short RNA reads that I have mapped to the TAIR10 genome. Is there an easy way to get counts of reads that overlap each other and the corresponding region/sequence that they overlap. Essentially, I'm wanting to group all the reads in each bam file into groups that map to the same region of the genome to get a pseudo count (don't worry, I'm not using this for anything serious), I'm exploring at the moment.
I mapped them to the sRNA from mirBase and essentially nothing maps (0~20 reads). There's a reasonable number of reads mapping to the genome, so I want to find areas where there's significant numbers of those reads clustering.
I have a nagging feeling that this is a silly question with an easy answer, but the only way I can think of doing it at the moment is to align all of the mapped sequences with each other and count how many have decent overlaps. Which seems silly.
Cheers
Ben.
I mapped them to the sRNA from mirBase and essentially nothing maps (0~20 reads). There's a reasonable number of reads mapping to the genome, so I want to find areas where there's significant numbers of those reads clustering.
I have a nagging feeling that this is a silly question with an easy answer, but the only way I can think of doing it at the moment is to align all of the mapped sequences with each other and count how many have decent overlaps. Which seems silly.
Cheers
Ben.
Comment