Seqanswers Leaderboard Ad

**dariober** · 06-13-2012, 05:52 AM

Hi,

If you can get gene length data, you can pass it as a vector to the argument bias.data of nullp. The length data format is (from http://www.bioconductor.org/packages.../doc/goseq.pdf)

5.1 Length data format
The length data must be formatted as a numeric vector, of the same length as the main named vector specifying gene names/DE genes. Each entry should give the length of the corresponding gene in bp. If length data is unavailable for some genes, that entry should be set to NA.

Good luck!
Dario

**SanderEST** · 06-13-2012, 06:16 AM

Thank you a lot! I should have read the manual more carefully. Actually the manual resolves my current issues clearly, but thank you for pointing that out!

Sander

**jfrias** · 03-16-2013, 04:16 AM

Dear Dario,

I was having the same problems Sander had and even following the format suggested in the manual I could not get rid of them. I really do not know what I am doing wrong.
I created a mock set of results to test the procedure. This is the code I am using:

> de.genes <- scan("de_genes.txt", what=character() )
Read 27 items
> assayed.genes <- scan("all_genes.txt", what=character() )
Read 37 items
> gene.length=scan("gene_lengths.txt", what=numeric() )
Read 27 items
> names(gene.vector) = assayed.genes
> pwf=nullp(gene.vector,bias.data=gene.length)
Error in nullp(gene.vector, bias.data = gene.length) :
bias.data vector must have the same length as DEgenes vector!

R is telling me the size of de.genes and gene.length is the same but it stills sends me the error message. If would really appreciate if someone could help me with this problem.

Thanks

Jorge

**thanhhoang** · 10-24-2013, 09:26 AM

Hi guys,
I have a similar problem as well when working with GOSeq. There is support for mm10 genome but not Gene ID ( I am using geneSymbol).
I am trying to get length information by following the Goseq manual but I still dont understand. So, could you please show me how to get the length information for mm10 genome and geneID geneSymbol ?

>genes = as.integer(all.genes %in% F.genes)
> names(genes) = all.genes
> head(genes)
Cryba1 Cryba4 Cryga Crygb Crygc Crygd
1 1 1 1 1 1
> pwf=nullp(genes,"mm10", "geneSymbol")
Can't find mm10/geneSymbol length data in genLenDataBase... Trying to download from UCSC. This might take a couple of minutes.
Error in value[[3L]](cond) :
Length information for genome mm10 and gene ID geneSymbol is not available. You will have to specify bias.data manually.
Thank you so much

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

GOSeq analysis problem with geneLenDataBase

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News