![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
DESeq: question about with replicates and without any replicates. | nb509 | RNA Sequencing | 2 | 10-25-2011 07:04 AM |
DESeq: question about baseMean | Azazel | Bioinformatics | 2 | 05-22-2011 06:40 PM |
DESeq: question about baseMean. Also, replicates. | Azazel | Bioinformatics | 5 | 05-18-2011 11:51 PM |
DESeq question | gfmgfm | Bioinformatics | 2 | 04-18-2011 04:15 AM |
Another DESeq question | shurjo | Bioinformatics | 2 | 05-15-2010 10:25 PM |
![]() |
|
Thread Tools |
![]() |
#21 |
Junior Member
Location: Birmingham, UK Join Date: Jul 2010
Posts: 5
|
![]()
Thank you Wolfgang Huber for you help, now the plot command is working but still I didnt get a proper plot.
Now I got a blank plot, I am not understanding y I got a blank plot instead of MvA plot IS it because of the presence of "-Inf" in log2FoldChange column? |
![]() |
![]() |
![]() |
#22 |
Junior Member
Location: Seattle Join Date: Jun 2010
Posts: 5
|
![]()
Hello Simon and Wolfgang -
Could any of you elaborate on the presence of variance outliers? I've done several DE tests and I've sometimes noticed values in resVarA and resVarB differ a lot for one specific gene (8x, 20x). The manual (pg 10) suggests to excercise caution during interpretation, but is there any rule-of-thumb to determine one of these values is large compared to the other? Thanks in advance, Daniel |
![]() |
![]() |
![]() |
#23 |
Member
Location: Qatar Join Date: Jun 2010
Posts: 15
|
![]()
Hi,
I have installed DESeq and all seems right: > biocLite("DESeq") Using R version 2.11.1, biocinstall version 2.6.9. Installing Bioconductor version 2.6 packages: [1] "DESeq" Please wait... Warning: unable to access index for repository http://brainarray.mbni.med.umich.edu...d/contrib/2.11 trying URL 'http://www.bioconductor.org/packages/2.6/bioc/bin/macosx/leopard/contrib/2.11/DESeq_1.0.6.tgz' Content type 'application/x-gzip' length 1533404 bytes (1.5 Mb) opened URL ================================================== downloaded 1.5 Mb but when I try to use > cds <- newCountDataSet( countsTable, conds ) Error: could not find function "newCountDataSet" what seems to be the problem? |
![]() |
![]() |
![]() |
#24 |
Senior Member
Location: Heidelberg, Germany Join Date: Aug 2009
Posts: 109
|
![]()
Afadda,
did you type library("DESeq") to load the library? Also, you're using an old version of R and - what's more of a problem - a rather old version of DESeq. I'd recommend to update. Wolfgang
__________________
Wolfgang Huber EMBL |
![]() |
![]() |
![]() |
#25 |
Member
Location: Qatar Join Date: Jun 2010
Posts: 15
|
![]()
ok. I'm ignorant in R...
I don't seem to be able to get the newCountDataSet to work. I get this error: Error in round(countData) : Non-numeric argument to mathematical function I checked, and non of the cells in the table contain alphabets. But, the class of one of the columns is "factor". I don't know why is it so and how to get around this. thanks, |
![]() |
![]() |
![]() |
#26 |
Member
Location: Qatar Join Date: Jun 2010
Posts: 15
|
![]()
how about this error
> cds= newCountDataSet( table, conds ) Error in if (any(round(countData) != countData)) stop("The countData is not integer.") : missing value where TRUE/FALSE needed here all the table columns are of class integer. if anyone would just give me a clue on what's wrong with my data |
![]() |
![]() |
![]() |
#27 |
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]()
Please send the output of the commands
Code:
head(table) Code:
str(table) (By the way, "table" is not a good name for a variable, as it shadows the function "table" that you may need later.) On the long run, it may also pay to learn a bit about R. Maybe read a few pages into the Introduction to R. |
![]() |
![]() |
![]() |
#28 |
Member
Location: Qatar Join Date: Jun 2010
Posts: 15
|
![]()
> head(table)
WT1 WT2 T Tb927.1.10 0 0 0 Tb927.1.30 0 0 0 Tb927.1.40 0 0 0 Tb927.1.110 2260 2914 4825 Tb927.1.270 214 278 359 Tb927.1.280 205 331 589 > str(table) 'data.frame': 7882 obs. of 3 variables: $ WT1: int 0 0 0 2260 214 205 628 26 379 142 ... $ WT2: int 0 0 0 2914 278 331 724 58 591 261 ... $ T : int 0 0 0 4825 359 589 1027 27 757 1380 ... |
![]() |
![]() |
![]() |
#29 |
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]()
Sorry, I didn't notice that you have missing values, according to your error message.
So, use the command Code:
which( is.na(table), arr.ind=TRUE ) Code:
table[ c(7, 12, 17), ] Of course, you could simply remove the offending rows with: Code:
cleanTable <- table[ -which( is.na(table), arr.ind=TRUE )[,"row"], ] |
![]() |
![]() |
![]() |
#30 |
Member
Location: Qatar Join Date: Jun 2010
Posts: 15
|
![]()
thanks Simon. it's working now!
|
![]() |
![]() |
![]() |
#31 |
Member
Location: malaysia Join Date: Mar 2009
Posts: 40
|
![]()
Hi Simon,
I'm new to DE analysis. Just tried DESeq for my dataset. FYI, the dataset don't have replicate. When running libsizes Code:
> libsizes <- c(C=6821199 , E=6037299) Code:
> sizeFactors (cds) C E 1 1 |
![]() |
![]() |
![]() |
#32 |
not just another member
Location: Belgium Join Date: Aug 2010
Posts: 264
|
![]()
Doing DE analysis without any replicate in any of the condition is a none sense. You can not have significant results without replicates.
For your question : I tried with artificial data and it works well. A <- data.frame(x=c(1000,10,10,10,0),y=c(100,2,2,2,1)) conds<-c("a","b") cds <- newCountDataSet(A,conds) cds <- estimateSizeFactors(cds) sizeFactors(cds) x y 2.2360680 0.4472136 |
![]() |
![]() |
![]() |
#33 |
Member
Location: malaysia Join Date: Mar 2009
Posts: 40
|
![]()
Hi NicoBxl
I've tried plying with the example dataset. I treated only two sample (T1b and T2) trying to mimic my dataset as there is no replicate. I still got the sizeFactors not to '1' (as shown below). So I believed non replicate dataset is not an issue here. Code:
> sizeFactors(cds) T1b T2 0.604218 1.655032 Still not sure what is the prob, why my sizeFactor = 1 on both sample. ![]() |
![]() |
![]() |
![]() |
#34 | |
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]()
Hi
The vignette is a bit badly worded here. Ignore the part about the 'libsizes', you do not need this, and the information is overwritten, once you call 'estimateLibrarySizes'. Have you done this and the got this here? Quote:
Code:
plot( counts(cds)[,1] + .01, counts(cds)[,2] + .01, log="xy" ) Simon |
|
![]() |
![]() |
![]() |
#35 |
Member
Location: malaysia Join Date: Mar 2009
Posts: 40
|
![]()
Hi Simon,
Thanks for the reply. Hope I got you right. Using my dataset I've follow exactly as instructed in the tutorial until I got Code:
> sizeFactors (cds) C E 1 1 I've create the plot. Attached is the plot |
![]() |
![]() |
![]() |
#36 | ||
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]() Quote:
Code:
cds <- newCountDataSet( countsTable, conds ) cds <- estimateSizeFactors( cds ) cds <- estimateVarianceFunctions( cds ) res <- nbinomTest( cds, "T", "N") Quote:
Simon |
||
![]() |
![]() |
![]() |
#37 |
Member
Location: malaysia Join Date: Mar 2009
Posts: 40
|
![]()
Hi Simon,
About the attachment, yup, tried to attach larger plot, but the maximum size attachment allowed is only 680x280. So the plot size has to be reduced. Sorry about that. Great. At least now I have some confident to work with the data. Thanks a lot for your explanation. ![]() |
![]() |
![]() |
![]() |
#38 |
Senior Member
Location: Heidelberg, Germany Join Date: Feb 2010
Posts: 994
|
![]()
Hi masterpiece
I've got another report from somebody who got exactly 1 as size factors, and so had a closer look. The problem is that the size factor estimation tends to round values when you have many genes with only very few counts. So, I've just improved the method slightly and the newest version of DESeq, version 1.3.3, now offers a new optional argument to 'estimateSizeFactors'. So, please try again, and now call estimateSizeFactors as follows: Code:
library( genefilter ) cds <- estimateSizeFactors( cds, locfunc=shorth ) Version 1.3.3 is available via SVN now, and for download in a day or so. Simon |
![]() |
![]() |
![]() |
#39 |
Member
Location: malaysia Join Date: Mar 2009
Posts: 40
|
![]()
Hi Simon,
It does make sense. Our data is from DGE sample and we dont do any count filtering on the data. I can say most of the sequence have very few count. I'll try redo the analysis. Thanks again |
![]() |
![]() |
![]() |
#40 |
Member
Location: Baton Rouge, Louisiana Join Date: Jun 2011
Posts: 11
|
![]()
Great program for processing data. I used it to find fold change and which genes had the greatest fold change between groups. I want to know if its possible to sort the data according to fold change and if I could view more than just the top six in the "head" display. Thank you much.
|
![]() |
![]() |
![]() |
Tags |
deseq, illumina, rna-seq |
Thread Tools | |
|
|