Seqanswers Leaderboard Ad

**simonandrews** · 05-10-2011, 11:22 PM

Originally posted by asb2718 View Post

1. What is the ground truth for CpG islands? We have looked at several datasets but they seem to provide locations as detected by their software (example, EMBOSS by EBI). Clearly, these cannot be used as ground truth when we are developing newer methods. Could any of you shed light on this matter and suggest a good data set with an accompanying ground truth?

You should look at the work of Adrian Bird's group. They have generated a set of functional CpG islands which aren't based on sequence analysis. We've been using this set for much of our analysis and have found that many of the islands they detect, but which are missed by traditional algorithms are functionally interesting.

**kshankar** · 06-23-2012, 08:32 PM

Is there a way to get the CpG islands described by Illingworth et al, PLoS Biol. I did find this in Ensembl browser as a Misc track (CPG island clones), but cannot figure out a way to download the whole file, after trying all day. Is there was a simple way just to get a bed file for these CGIs? Any help would be great, thanks.

**PeteH** · 06-24-2012, 04:35 AM

You might also be interested in work done in Rafael Irizarry's lab. Their method is based on sequence analysis using a statistical procedure called a hidden Markov model to define CpG islands, rather than the heuristic definition given in the classic Gardiner-Garden and Frommer paper. The link includes references to the relevant papers as well as downloadable CpG island definitions for several species using their definition. There is also code for generating CpG islands for other organisms.
Pete

**simonandrews** · 06-24-2012, 11:14 PM

Originally posted by kshankar View Post

Is there a way to get the CpG islands described by Illingworth et al, PLoS Biol. I did find this in Ensembl browser as a Misc track (CPG island clones), but cannot figure out a way to download the whole file, after trying all day. Is there was a simple way just to get a bed file for these CGIs? Any help would be great, thanks.

We've certainly got a file with all of these in but it was a while back so I'd need to go back to see how we got them in the first place. I don't think we pulled them from Ensembl (we usually download these kinds of tracks through table browser at UCSC but I'm not sure if that was the case with this data). If all else fails I can stick our copy up on our website if you like?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

CpG island detection

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News