Seqanswers Leaderboard Ad

**dpryan** · 11-28-2016, 01:07 PM

It looks like you have an extra space in front of all of your numbers and that's screwing everything up. Fix how the values are imported and ensure they're actually numbers and not strings.

**Michael.Ante** · 11-28-2016, 11:51 PM

I'm not so familiar with the stringtie pipeline, but I recommend avoiding Excel for most NGS related analyses (see Zeeberg et al. 2004: Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics)

Can you use the python script to get simple csv/tsv output?
[Update]
The prepDE.py script produces csv files. Import these directly R; any selection and computation you've done with Excel can be done there as well.

**Schisto** · 11-29-2016, 02:18 AM

I have double checked and there is no extra space in each of my cells,
that is actually the reason I later saved this file as excel.

The python script gives me the gene counts in csv format, I have of course tried that too and it gives the same error.

Using the same file in edgeR for example works without issues.

**Michael.Ante** · 11-29-2016, 03:21 AM

Try as a first solution:
countdata <- as.matrix(read_excel("DEseqcounts.xlsx"),header=TRUE, row.names=1)

And check then
summary(is.numeric(countdata[,1]))

Maybe there are some empty lines at the end, which lead to the fact that R is reading it as factors rather than numbers. This can be checked by tail(countdata) .

**Schisto** · 11-29-2016, 03:28 AM

The class of countdata[,1] is "character"

summary(is.numeric(countdata[,1]))
Mode FALSE NA's
logical 1 0

class(countdata[,1])
[1] "character"

That should be the issue I guess?

Thanks for your help!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 46 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

DEseq2 - some values in assay are negative

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News