Seqanswers Leaderboard Ad

**Michael Love** · 07-16-2014, 06:35 AM

This is your countData, which has as many rows as genes:

Code:

mydata = read.table("matrix.txt", header=TRUE)
col1 <- mydata[,1]

It looks like this will be the colData (sample information table).

Code:

ExpDesign = data.frame(row.names=col1, condition=c("C", "C", "C", "A", "A", "A", "B", "B", "B")

...which has as many rows as samples.

So the error comes when you try to name the rows of your colData using the gene names in col1.

You will also get an error later when you try to run

Code:

assay( mydata )

because mydata is a data.frame. assay() is a function for getting a matrix from SummarizedExperiment objects. You can just use

Code:

as.matrix( mydata )

in order to supply a matrix to DESeqDataSet.

**rookie_genomics** · 03-19-2019, 08:51 AM

Hi,

I followed your advice and tried to import as a matrix. But when I try to set up col.data I still get an error

This is my code

deseq2_analysis2 <- read_excel("deseq2_analysis2.xlsx")
> View(deseq2_analysis2)
> analysis3 <- as.matrix(deseq2_analysis2)
> (condition <- factor(c(rep("group1", 4), rep("group2", 4), rep("group3", 4), rep("group4", 4))))
[1] group1 group1 group1 group1 group2 group2 group2 group2 group3 group3 group3 group3 group4
[14] group4 group4 group4
Levels: group1 group2 group3 group4
> (coldata <- data.frame(row.names=colnames(analysis3), condition))
Error in data.frame(row.names = colnames(analysis3), condition) :
row names supplied are of the wrong length

This is my result for head command

head(deseq2_analysis2)
# A tibble: 6 x 17
gene Sample1_group1 Sample2_group1 Sample3_group1 Sample4_group1 Sample1_group2 Sample2_group2
<chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 YAL0~ 0 0 0 0 2 0
2 YAL0~ 0 0 0 0 0 0
3 YAL0~ 243 242 109 130 271 233
4 YAL0~ 16 7 52 30 23 10
5 YAL0~ 23 21 21 33 11 28
6 YAL0~ 38 42 76 88 47 40
# ... with 10 more variables: Sample3_group2 <dbl>, Sample4_group2 <dbl>, Sample1_group3 <dbl>,
# Sample2_group3 <dbl>, Sample3_group3 <dbl>, Sample4_group3 <dbl>, Sample1_group4 <dbl>,
# Sample2_group4 <dbl>, Sample3_group4 <dbl>, Sample4_group4 <dbl>

What am I doing wrong here?

**rookie_genomics** · 03-19-2019, 09:05 AM

Originally posted by Michael Love View Post

This is your countData, which has as many rows as genes:

Code:

mydata = read.table("matrix.txt", header=TRUE)
col1 <- mydata[,1]

It looks like this will be the colData (sample information table).

Code:

ExpDesign = data.frame(row.names=col1, condition=c("C", "C", "C", "A", "A", "A", "B", "B", "B")

...which has as many rows as samples.

So the error comes when you try to name the rows of your colData using the gene names in col1.

You will also get an error later when you try to run

Code:

assay( mydata )

because mydata is a data.frame. assay() is a function for getting a matrix from SummarizedExperiment objects. You can just use

Code:

as.matrix( mydata )

in order to supply a matrix to DESeqDataSet.

Hi,

I followed your advice and tried to import as a matrix. But when I try to set up col.data I still get an error

This is my code

deseq2_analysis2 <- read_excel("deseq2_analysis2.xlsx")
> View(deseq2_analysis2)
> analysis3 <- as.matrix(deseq2_analysis2)
> (condition <- factor(c(rep("group1", 4), rep("group2", 4), rep("group3", 4), rep("group4", 4))))
[1] group1 group1 group1 group1 group2 group2 group2 group2 group3 group3 group3 group3 group4
[14] group4 group4 group4
Levels: group1 group2 group3 group4
> (coldata <- data.frame(row.names=colnames(analysis3), condition))
Error in data.frame(row.names = colnames(analysis3), condition) :
row names supplied are of the wrong length

This is my result for head command

head(deseq2_analysis2)
# A tibble: 6 x 17
gene Sample1_group1 Sample2_group1 Sample3_group1 Sample4_group1 Sample1_group2 Sample2_group2
<chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 YAL0~ 0 0 0 0 2 0
2 YAL0~ 0 0 0 0 0 0
3 YAL0~ 243 242 109 130 271 233
4 YAL0~ 16 7 52 30 23 10
5 YAL0~ 23 21 21 33 11 28
6 YAL0~ 38 42 76 88 47 40
# ... with 10 more variables: Sample3_group2 <dbl>, Sample4_group2 <dbl>, Sample1_group3 <dbl>,
# Sample2_group3 <dbl>, Sample3_group3 <dbl>, Sample4_group3 <dbl>, Sample1_group4 <dbl>,
# Sample2_group4 <dbl>, Sample3_group4 <dbl>, Sample4_group4 <dbl>

What am I doing wrong here?

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 22 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 28 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

DESeq2 error in data.frame (multiple treatments and multiple replicates)

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News