Hi Everybody!
My problem is about mapping where only the location of 5'end of the read is meaningful, rest of the sequence is needed only to direct this end into the location - let's call this location a CAGE tag.
I did mapping using TopHat and while it is very easy to extract CAGE tags for +strand (it is read start position in SAM file), it is not at all the same story for minus strand. If this were bowtie mapping I would just add read length to the start position, but since this mapping allows for indels and exons it is always different number that would have to been added (it is possible to deduce it from CIGAR).
Do any of you know a straightforward method of obtaining this CAGE tags, or I have to figure out how to use CIGAR?
Best,
Lukasz
Edit: After more extensive search I found that in R package GenomicRanges there is function cigarToWidth() which is what I need! Anyway, maybe there are some even easier solutions.
My problem is about mapping where only the location of 5'end of the read is meaningful, rest of the sequence is needed only to direct this end into the location - let's call this location a CAGE tag.
I did mapping using TopHat and while it is very easy to extract CAGE tags for +strand (it is read start position in SAM file), it is not at all the same story for minus strand. If this were bowtie mapping I would just add read length to the start position, but since this mapping allows for indels and exons it is always different number that would have to been added (it is possible to deduce it from CIGAR).
Do any of you know a straightforward method of obtaining this CAGE tags, or I have to figure out how to use CIGAR?
Best,
Lukasz
Edit: After more extensive search I found that in R package GenomicRanges there is function cigarToWidth() which is what I need! Anyway, maybe there are some even easier solutions.
Comment