Hello all,
I am relatively new to NGS and have just mapped a series of reads to a genome using Bowtie2. As I understand, the SAM/BAM output contains the information telling me where the reads map to the genome. I now want to:
a) transform this to a list of regions where reads map i.e., combine all the overlapping reads that map to one part of the genome into a starting position (bp), ending position (bp), and the number of reads that map to that region
b) cross-reference that list of regions against the genome annotation to identify whether that collection of reads maps to an exon, intron, etc. and, if an exon, extract the relevant gene and transcript id
I am sure there must be a tool for this - probably several! I am interested to know what people use? Any relative advantages/disadvantages, key tips or tricks, etc., would also be helpful.
Thanks for your help.
Regards,
Jonathan
I am relatively new to NGS and have just mapped a series of reads to a genome using Bowtie2. As I understand, the SAM/BAM output contains the information telling me where the reads map to the genome. I now want to:
a) transform this to a list of regions where reads map i.e., combine all the overlapping reads that map to one part of the genome into a starting position (bp), ending position (bp), and the number of reads that map to that region
b) cross-reference that list of regions against the genome annotation to identify whether that collection of reads maps to an exon, intron, etc. and, if an exon, extract the relevant gene and transcript id
I am sure there must be a tool for this - probably several! I am interested to know what people use? Any relative advantages/disadvantages, key tips or tricks, etc., would also be helpful.
Thanks for your help.
Regards,
Jonathan
Comment