Hello everybody,
A research collaborator has asked us to classify promoters according to their CpG density. Data comes from a RRBS (Reduced Representation Bisulfite Sequencing) analysis in mouse (using mm9 sequence as template) and I don't have a clue on how to do this.
-Any of you knows about any tool, script or R-package that allows to do this ?
-Is there any repository where I can download a BED file where this kind of information is already available for organisms different than human?
We've been reading different papers that present such data but I haven't been able to understand how they manage to obtain such classification and then represent it graphically.
Any insight would be appreciated.
Thanks in advance.
JL
PD. I've been given this set of conditions. I write them here just in case anybody (even the authors of some similar analysis) may recognize an analog issue reading them, so that they can suggest a tool, script or R package that perform this calculation:
"To determine promoter classes, we measured the GC content and the CpG ratio of observed to expected values in sliding 500-bp windows with a 5-bp offset in regions -900 bp to +400 bp relative to the TSS. Promoter classes were defined as follows: LCPs contain no 500-bp window with a CpG ratio >0.45; HCPs contain at least one 500-bp window with a CpG ratio >0.65 and GC content >55%; ICPs do not meet the previous criteria."
A research collaborator has asked us to classify promoters according to their CpG density. Data comes from a RRBS (Reduced Representation Bisulfite Sequencing) analysis in mouse (using mm9 sequence as template) and I don't have a clue on how to do this.
-Any of you knows about any tool, script or R-package that allows to do this ?
-Is there any repository where I can download a BED file where this kind of information is already available for organisms different than human?
We've been reading different papers that present such data but I haven't been able to understand how they manage to obtain such classification and then represent it graphically.
Any insight would be appreciated.
Thanks in advance.
JL
PD. I've been given this set of conditions. I write them here just in case anybody (even the authors of some similar analysis) may recognize an analog issue reading them, so that they can suggest a tool, script or R package that perform this calculation:
"To determine promoter classes, we measured the GC content and the CpG ratio of observed to expected values in sliding 500-bp windows with a 5-bp offset in regions -900 bp to +400 bp relative to the TSS. Promoter classes were defined as follows: LCPs contain no 500-bp window with a CpG ratio >0.45; HCPs contain at least one 500-bp window with a CpG ratio >0.65 and GC content >55%; ICPs do not meet the previous criteria."
Comment