Seqanswers Leaderboard Ad

**Simon Anders** · 03-20-2013, 09:57 AM

Originally posted by SEQnovice View Post

In this case, should I consider that Samples A,B,C,D are all biological replicates of Subgroup 2, and Samples X,Y,Z,F,W as biological replicates of Subgroup3?

Short answer: Yes.

Longer answer: I should probably write a more extensive answer on this as you are not the first one to ask with question, discussing why the term "biological replicate" is actually quite an abuse of terminology, that did manage to cause quite some confusion. I'll get to that.

**Simon Anders** · 03-20-2013, 10:01 AM

I should add: For a comparison of cancer types, three and four samples are usually way too few, and even more so, if you don't have matched healthy tissue samples from the same patients, so I wouldn't be too optimistic about your results.

Also, are you sure that you got _more_ hits with "blind" than with the standard work-flow? Should be the other way round.

**SEQnovice** · 03-20-2013, 10:18 AM

Hi Simon,
Thanks for the very speedy answer! The cancer types are slightly larger (9 vs 6), I was just putting out a generic question, but you are right that they are still quite few in number either way.

I guess the main confusion is that in this case, the pooled samples are not truly biological replicates. In my scenario I would have originally considered that biological replicates are if I had multiple cancer samples per patient for each subgroup, so Sample A1, A2, etc. ...I look forward to reading your explanation on this.

And yes, I did get more hits with blind than standard workflow, which was why I started questioning the issue of replicates. I did have a look at the variance between the gene counts for the samples of each of my subgroups, there doesn't seem to be a high degree of variation in these samples with the exception of 8-9 genes that are outliers per subgroup.
This may be explain why the number of differentially expressed is quite poor?

I will run it again just to be sure and let you know.
Thanks,
Deena

**Simon Anders** · 03-20-2013, 10:27 AM

"sharing-mode="fit-only"' is extremely sensitive to outliers, which are turned into false positives. This is why we recommend to avoid it (except for the blind mode where it is unavoidable). So all the extra hits are probably false positives.

This whole stuff with the sharing mode is a bit of a hack, and replacing this with something more well founded was one of the main motivations for developing DESeq2.

**SEQnovice** · 03-20-2013, 10:38 AM

Thanks, I will look into DESeq2.

But just for the sake of completing this exercise, I am assuming the following dispersion estimation is correct?

cds = estimateDispersions( cds, method="blind", sharingMode="maximum", fitType="parametric" )

Why wouldn't you use "pooled" or "per-condition" here for the method? Just thinking with regards to dealing with the outliers.

Thanks,
Deena

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Determining Replicates for DESeq?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News