SEQanswers

Go Back   SEQanswers > Applications Forums > RNA Sequencing



Similar Threads
Thread Thread Starter Forum Replies Last Post
remove duplicate reads 100% sequence identity and genomic coordinates thiNGS Bioinformatics 1 01-12-2015 08:53 AM
Duplicate Sequence/High Overall Kmer Content lala2013 Bioinformatics 4 10-15-2013 02:01 PM
Duplicate Reads myronpeto Bioinformatics 7 03-07-2013 07:36 AM

Reply
 
Thread Tools
Old 09-13-2015, 02:52 PM   #1
Jasseq
Junior Member
 
Location: Europe

Join Date: Sep 2015
Posts: 2
Default Sequence in (biological) duplicate or not?

Dear all,

I am wondering: how much (methodological) advantage would it have to RNAseq in duplicate (i.e. 2 biological replicates) compared to 1 single seq?

I do not have the resources to have 3 replicates (or more). Samples will be pooled set anyhow (from 6 subjects), but can pool into 1 (=6 subjects) or 2 (=2x3 subjects) samples per condition.

Of course, statistics will be still sub-optimal when have only 2 measures per condition compared to >=3 -- but on the other hand, the reliability should be better. I am mainly interested in differential expression, btw.

Thanks!
Jasseq is offline   Reply With Quote
Old 09-13-2015, 11:04 PM   #2
morgane51
Junior Member
 
Location: zurich

Join Date: Sep 2015
Posts: 4
Default

I was asking myself the same thing, but I could go to 3 replicates.
Take into account that duplicates means twice as much work but not twice as much power. However, having 2 duplicates, you introduce noise but can not handle it/correct for it.
If your duplicates agree, then you gain in power. If they don't, you cannot explain it.
Merging the samples will bring some biological noise that could make you miss some signals, but on the other hand when you treat only one sample you have no way to spot incongruency due to noise.
I don't know if I am clear enough... But I think it is better to have duplicates - but do not sacrify to much on coverage... -
Still, the best improvment is from 2 to 3, and not from one to 2 replicates...
This is really well explained in the manuals for DSeq and EdgeR. They advise to refrain from going for one replicate.
What is your design exactly?
Understand that you waste the whole money at the end if your design doesn't allow for a correct analysis...
Good luck!
Morgane
morgane51 is offline   Reply With Quote
Old 09-15-2015, 01:28 AM   #3
Jasseq
Junior Member
 
Location: Europe

Join Date: Sep 2015
Posts: 2
Default

Thanks for your reply.
Specific design is: Two groups, 8 brain regions, 6 individual samples per brain region (mice).
Individual samples can be pooled per 3 (i.e. 2 pooled biological replicates per brain region, 32 pooled samples in total to sequence), or per 2 (48 pooled samples to sequence).
The fact that I want to analyse 8 brain regions makes the costs-to-number-of-samples-ratio so steep...
Jasseq is offline   Reply With Quote
Old 09-15-2015, 05:37 AM   #4
HESmith
Senior Member
 
Location: Bethesda MD

Join Date: Oct 2009
Posts: 505
Default

Be aware that triplicates do not require 50% more sequence data than duplicates. Given the same total number of reads, more replicates provide more statistical power to detect differential gene expression (i.e., 3 reps @ 10M reads is better than 2 reps @ 15M). See this paper for details. So the only added expense for triplicates vs duplicates is the cost of library prep.

Last edited by HESmith; 09-15-2015 at 05:38 AM. Reason: typo
HESmith is offline   Reply With Quote
Reply

Tags
biological replicates, rnaseq, statistical design

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 11:50 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO