Seqanswers Leaderboard Ad

**blancha** · 10-19-2015, 09:41 AM

Garbage in, garbage out.

There may be so many densely packed points on the whole plot, that it appears entirely black.

On two occasions I have been asked, "Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?" ... I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.
— Charles Babbage, Passages from the Life of a Philosopher

**blancha** · 10-19-2015, 10:02 AM

As usual, @dpryan's answer in the other thread is better.

Still, the plot is so uniformly black, that I'm not sure that even plotting the density will work.

**Marcos Lancia** · 10-19-2015, 10:08 AM

I only plotted 2000 points. I don´t think is a dense packed points issue. It must be something else that I don´t understand. I´m just learning bioinformatics making errors, please be patient with me. Thanks

**blancha** · 10-19-2015, 10:17 AM

The best troubleshooting step in programming is always to simplify your problem.
You should also examine your data before plotting it.

Plot less points and check if you can then see the points.
For example, plot just the first 10 points.
The plot should not be dark.
If it is, you're having issues with the rendering of your plot.

Code:

plot(d1[,7][1:10], d1[,18][1:10])

If you've concluded that you don't have any rendering issues, check the number of points you are plotting.

Code:

nrow(d1)

If the number of points is reasonable, check the first few points.

Code:

head(d1[,7])
head(d1[,18])

For a simple scatter plot, Excel will also work fine.
Obviously, R opens up a whole world of possibilities, but it definitely takes a bit of time to master.

**Marcos Lancia** · 10-20-2015, 04:08 AM

Hi! Thanks for your help again. I figured it out that mi data were "factors" instead of "numerics". So, I did a transformation:

d1 <- read.csv("all filter1.csv", stringsAsFactors = FALSE, header=T)
trans<- as.numeric(d1[,7])

But a warning message appears:

NAs introduced by coercion

It replaces all my data different from 0 by NA. So, my plot is a simple point in 0:0.

How can I solve that??

Thanks a lot!

**blancha** · 10-20-2015, 06:34 AM

Hi! Thanks for your help again. I figured it out that mi data were "factors" instead of "numerics". So, I did a transformation:

This is why I prefer fread from data.table.
It is much faster, and the default setting are nearly always correct.
Strings are never read as factors by fread.

Code:

library(data.table) # Install it before loading it the first time. install.packages("data.table")
d1 <- fread("all filter1.csv",data.table = FALSE)

It replaces all my data different from 0 by NA.

Are you sure these are not strings of characters?
Characters will be converted to NA, as illustrated in the following example.

Code:

> as.numeric("1")
[1] 1
> as.numeric("a")
[1] NA
Warning message:
NAs introduced by coercion

Check your column first, before plotting it.

Code:

head(d1[,7])

**Marcos Lancia** · 10-20-2015, 09:00 AM

Hi again. You´re right. Mi data are "characters" after reading the file as you suggested with fread. Numbers different from 0 are replaced with NAs.

**blancha** · 10-20-2015, 09:50 AM

Do your numbers have commas in them, by any chance?
It's the only problem I can think of.
If they do, just replace the commas by periods.

Code:

> as.numeric("2.1")
[1] 2.1
> as.numeric("2,1")
[1] NA
Warning message:
NAs introduced by coercion

Depending on your locale, it could be the opposite, that is the comma could be the decimal separator, whereas your column could use the period as the column separator.
Check in your column whether your decimal separator is a period or a comma.
Test in the R console, whether as.numeric will work with the decimal separator you are using. Enter the number between quotes.
If you obtain NAs, you have identified your bug.

**Marcos Lancia** · 10-20-2015, 10:39 AM

Done!

Yes! you´re right again! I changed commas by dots and it worked!
Than you very much, I think I´m loving you right now

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 25 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Dark plot ???

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News