![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
DESeq - Error: subscript out of bounds | crh | Bioinformatics | 6 | 07-28-2015 03:56 AM |
DESeq error | clemen | Bioinformatics | 22 | 03-27-2014 04:59 AM |
DESeq Plot Error | coralgirl | Bioinformatics | 2 | 11-29-2011 09:35 AM |
RNAseq analysis using DESeq | katussa10 | Bioinformatics | 9 | 08-29-2011 06:32 AM |
Error message in DESeq (estimateVaranceFunctions, preplot.locfit.raw) | aquila | RNA Sequencing | 4 | 02-02-2011 08:42 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Europe Join Date: Sep 2011
Posts: 16
|
![]()
Hello,
I have RNA-seq data, which I am trying to analyze with DESeq. My file (.csv) appears to be correct >head(myfile) " gene_id VZ_w13 VZ_w14a VZ_w14b VZ_w15a VZ_w15b VZ_w16a 1 ENSG00000253101 0 0 0 0 0 0 ..." However, when I try >cds <- newCountDataSet(myfile,conds) I get the following error message: "Error in newCountDataSet(myfile, conds) : The countData is not integer." The problem, as far as I can tell, is that my data are numerical, not integer, because when I run >str(myfile) "'data.frame': 53507 obs. of 14 variables: $ VZ_w13 : num 0 0 0 0 8 0 0 0 0 0 ..." Does anyone have a way to make my data integer, not numerical? As you can see, the data are in fact integers. Thanks, Stephan |
![]() |
![]() |
![]() |
#2 |
Member
Location: Germany Join Date: May 2010
Posts: 24
|
![]()
If that is really the problem then applying as.integer to your columns should work (note that: as.integer(12.9) == 12)
Cheers, Paul
__________________
"You are only young once, but you can stay immature indefinitely." |
![]() |
![]() |
![]() |
#3 |
Member
Location: Europe Join Date: Sep 2011
Posts: 16
|
![]()
Excuse my ignorance. How do I apply as.integer to my columns?
|
![]() |
![]() |
![]() |
#4 |
Member
Location: Germany Join Date: May 2010
Posts: 24
|
![]()
Hi,
here a toy example, which you can just transfer to your data-set: Code:
> data.frame(SomeValue=seq(0,2,0.4)) SomeValue 1 0.0 2 0.4 3 0.8 4 1.2 5 1.6 6 2.0 > the.data <- data.frame(SomeValue=seq(0,2,0.4)) > the.data$SomeValue [1] 0.0 0.4 0.8 1.2 1.6 2.0 > as.integer(the.data$SomeValue) [1] 0 0 0 1 1 2 > the.data$SomeValue <- as.integer(the.data$SomeValue) > the.data$SomeValue [1] 0 0 0 1 1 2 Code:
> the.data <- data.frame(SomeValue=seq(0,2,0.4), AnotherValue=seq(4,6,0.4)) > the.data SomeValue AnotherValue 1 0.0 4.0 2 0.4 4.4 3 0.8 4.8 4 1.2 5.2 5 1.6 5.6 6 2.0 6.0 > apply( the.data, 2, as.integer ) SomeValue AnotherValue [1,] 0 4 [2,] 0 4 [3,] 0 4 [4,] 1 5 [5,] 1 5 [6,] 2 6 Cheers, Paul
__________________
"You are only young once, but you can stay immature indefinitely." |
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: Heidelberg Join Date: Aug 2010
Posts: 165
|
![]()
Hi Stephen, looks like the first column of your data are the geneIDs (not integers). Just remove this column and it should be ok. You could use the row.names option when reading the file.
|
![]() |
![]() |
![]() |
#6 |
Member
Location: Europe Join Date: Sep 2011
Posts: 16
|
![]()
Got it working now. Thanks for your help.
|
![]() |
![]() |
![]() |
Tags |
countdata, deseq, formatting |
Thread Tools | |
|
|