Seqanswers Leaderboard Ad

**mastal** · 02-06-2014, 07:03 AM

Did you run biological replicates of any of the samples, or did you just have one sample per condition?

You can't really do DE unless you have replicates, especially since they were run under different conditions.

**lincw** · 02-06-2014, 07:23 AM

I don't have biological replicates. So I can't say DE, but I should able to use the RPKM values to compare the expression level of each genes, right?

**bruce01** · 02-07-2014, 03:09 AM

Originally posted by lincw View Post

I don't have biological replicates. So I can't say DE, but I should able to use the RPKM values to compare the expression level of each genes, right?

Without even considering the real problem of not having replicates (ie biological variation) you will have a problem based on the differing amounts of sequence in this case. If you have 2x as much sequence in sample A vs sample B you cannot know if RPKM differs because of this, or because of abundance of transcript. You may be able to reduce the 150bp to 50bp (random sampling?) but I cannot see any reviewers accepting results from such a study because it is not possible to do requisite statistical analysis and so any 'result' is conjecture. You could check RPKM and then do qPCR on genes you found of interest?

**lincw** · 02-07-2014, 03:58 AM

Originally posted by bruce01 View Post

Without even considering the real problem of not having replicates (ie biological variation) you will have a problem based on the differing amounts of sequence in this case. If you have 2x as much sequence in sample A vs sample B you cannot know if RPKM differs because of this, or because of abundance of transcript. You may be able to reduce the 150bp to 50bp (random sampling?) but I cannot see any reviewers accepting results from such a study because it is not possible to do requisite statistical analysis and so any 'result' is conjecture. You could check RPKM and then do qPCR on genes you found of interest?

Thank you, I have more clear idea about this now.

**mbblack** · 02-07-2014, 06:37 AM

Originally posted by lincw View Post

The DE results made we confused. Which results should we trust? From the assembly with 50 bp reads or with 150 bp reads? If we using qPCR to qualify them, what will happen?

Does anyone can give me some advises?

Many thanks,

Chung-Wen

As far as picking genes for qPCR and what you will see, it is impossible to tell. Its already reasonably well known that DGE results correlate best with qPCR when the differentially expressed genes are selected based on the simultaneous application of both a statistical threshold and a fold change threshold.

That is, if the differentially expressed genes were both statistically significant and passed some minimum fold change cutoff (1.5 fold, 2.0 fold or whatever), then the qPCR genes will more often also be statistically significant and changing in the same direction (albeit the actual fold change may still not correlate terribly well, for all sorts of reasons).

In my personal experience, selecting differentially expressed genes solely by fold change generally gives poor or little correlation with qPCR results, at least for genes with moderate changes in expression (extremely high fold change usually correlates, but then again, those are often, at best, only the most trivially interesting genes).

Without biological replicates, you have zero statistics to base your selection on, so the best you can do is pick genes, run the qPCR, and see what you get. But do not be surprised if you get far less validation then you wished for.

**thomasblomquist** · 02-07-2014, 08:39 AM

The issue with not having biological/library replicates, aside, yes, you can trim the 3' 100 bases from your reads to achieve a pseudo-50 base read length.

I've done this for comparison purposes of the same library prep split to a 50 base read on Hiseq and a 150 base read chemistry on miseq.

The results were identical fitting a Poisson sampling curve distribution pattern for the sequencing sampling step. Thus normal sampling laws apply between the two platforms with slightly different colonization kits,etc.. However, this does not account for the cumulative sampling variance that is far greater doing biological replicates, which encompasses (and not limited to) differential extraction of RNA, differential efficiencies of RT to cDNA, differential ligation efficiences which interact with differential fragmentation phenomena between specimens, differential plateau rates of the limited PCR dscDNA creation steps, fractionation of the library with purification... ... Then, ontop of all that is the normal Poisson sampling that occurs on the flow cell of the prepped library. :-)

If these libraries, were prepped separately, I would be extremely cautious in comparing and drawing any costly conclusions. There are a number of articles delineating the issue with comparison between separate library preps, let alone the need for at least 2-3 biological preparations depending on the fold change you expect to see.

Be cautious in interpreting your data.

-Tom

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Differential expression results of different read length

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News