Seqanswers Leaderboard Ad

**bruce01** · 08-12-2013, 07:44 AM

So you have 3 groups of mice, each with three biological replicates? And the image shows the expression normalised to TBP. First I don't think 3 is a high enough number of replicates, I understand the reasons for this, I am not criticising.

From the image the counts do not look to be wildly different between replicates for the genes (maybe Gene3), is this the case? What sort of counts/FPKMs do you get for these genes (and for your house-keepers too?)

Also what are the basics of the RNAseq data: numbers of reads total, aligning, etc, just out of interest?

**ffinkernagel** · 08-12-2013, 07:46 AM

Barplots are nice, but they do hide the original intensities measured.

What were your genes ct and absolute tag counts (out of how many reads)?

**dpryan** · 08-12-2013, 07:53 AM

I always to qPCR follow-up, but my particular experiments pretty much require it. In my experience, if I do qPCR on the same samples on which I did RNAseq, then the results are very close. The differences come as I increase my N by adding additional samples. In effect, the results show me how well my original samples can be used to estimate actual biological variability. So, the more samples I use, generally the more reliable things turn out to be.

Also, keep in mind if you're using Taqman assays that while they may be billed as being 100% efficient, that's not always the case (this can lead to scale mismatches, though should result in some of the direction changes that you've seen).

**DonDolowy** · 08-12-2013, 08:00 AM

Samples were 8-plexed giving us 25-30 million mapped reads. This should be more than sufficient for gene expression analysis. We are not interested in splicing events etc, but use RNAseq as an alternative to microarray.

This is the info from Cuffdiff / CummeRbund.
CuffSet instance with:
3 samples
22986 genes
30963 isoforms
25081 TSS
24612 CDS
68958 promoters
75243 splicing
60939 relCDS

Attached are the Ct values, as well as the FPKM values for the different genes.

Attached Files

Ct_qPCR_FPKM.png (48.1 KB, 140 views)

**dpryan** · 08-12-2013, 08:02 AM

Out of curiousity, did you analyse the data with anything other than cufflinks (edgeR/DESeq/limma/etc.)? Depending on the version, cufflinks has had issues

**bruce01** · 08-12-2013, 08:13 AM

@dpryan: says he used HTSeq->DESeq in first post

@Don: The barplot does not show what is in the table, eg gene1 FPKM is ~20-30 but the plot shows normalised value (with TBP) of ~1, should be ~5 based on table. Are they FPKM in plot, or is it counts?

**dpryan** · 08-12-2013, 08:24 AM

Originally posted by bruce01 View Post

@dpryan: says he used HTSeq->DESeq in first post

Oops, missed that, thanks!

**Simon Anders** · 08-12-2013, 08:24 AM

Please tell us the raw count values, not the FPKM values. This is crucial to determine how much precision you can expected.

There is also something odd with your barchart: Gene 2 seems to go down significantly in the qPCR results, but in your table, it does not look like this: The first ct value for Gene 2 in Group #3 is even higher than all other values. How have you calculated the error bars?

To give a general answer to your question:

The precision of expression strength estimates from RNA-Seq data depends on the raw counts. The relative standard error can never be lower than 1/square_root(n), where n is the number of reads mapping to the gene in the sample under consideration. For genes with less than 100 counts, the error will always be at least 10%, which corresponds to a standard error of log2(1+0.1)=0.14 cycles.

**NextGenSeq** · 08-12-2013, 09:25 AM

We never use oligo-dT for RNA-Seq or any expression profiling for that matter. It gives 3' biased libraries (or expression arrays).

**DonDolowy** · 08-12-2013, 09:43 AM

Thanks for all your input so far.

Anders, you are absolutely right. I forgot to mention that the values are first normalized to TBP and then normalized to Ctrl group, which is the first black group on the bar plot. Thus the values around 1.

What I am comparing is wildtype (black) versus heterozygous mouse exhibiting two different phenotypes (grey and white).

Attached are the analysis done by DESeq. I have included the raw counts from the htseq count matrix, as well as the normalized counts from DESeq.

Regarding 3' bias. When looking at gene body coverage using RSeqC, the 3'bias is ranging from almost non-existing to very mild in the samples.

Attached Files

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 39 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 52 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 38 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 46 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

RNA-seq analysis is not consistent with qPCR results

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News