Seqanswers Leaderboard Ad

**GenoMax** · 07-20-2016, 04:18 AM

Paul: Have you tried BioMart from Ensembl? You can find some help/video's on this page.

**SylvainL** · 07-21-2016, 06:49 AM

Using R...

Ref_annotations is your gff file you have to import using the function import.gff2 (with asRangedData=FALSE)
Ref_genome is your genome imported using read.DNAStringSet

The following code should give you the starting base of the first annotated exon of each gene

Code:

B <- Ref_annotations[which(seqnames(Ref_annotations) %in% names(Ref_genome))]
C <- B[which(strand(B) == "+")]
f <- as.factor(elementMetadata(C)$gene_name)
rg <- split(C,f)
rh <- unlist(range(rg))
end(rh) <- start(rh)
start(rh) <- start(rh)
names(rh) <- levels(f)
D <- rh
C <- B[which(strand(B) == "-")]
f <- as.factor(elementMetadata(C)$gene_name)
rg <- split(C,f)
rh <- unlist(range(rg))
start(rh) <- end(rh)
end(rh) <- end(rh)
names(rh) <- levels(f)
E <- rh
F <- sort(c(D, E))

Then you can export F as a bed file (function export.bed)

Hope it helps...

**pkstarstorm05** · 07-31-2016, 02:25 PM

Hi GenoMax and SylvainL,

Thanks so much for your suggestions and time! They were both very helpful.

For anyone later who comes across this post - I strongly urge you to familiarize yourself with biomaRt. Its a powerful tool for extracting all kinds of useful information.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 17 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 48 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Retrieving promoter sequences using gene symbol

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News