Coverage for duplicates

billstevens

Senior Member

Join Date: Mar 2012

Posts: 120
- Share
- Tweet
#1

Coverage for duplicates

04-05-2012, 08:52 AM

Hi guys,

So I'm doing a 100bp PE reads on the human genome with 3 different conditions. I ran one set on one lane, and I just finished looking at the data. The plan originally was that after I looked at the data, if it looked like there were good differences (there are), I would run two sets of samples on one more lane.

However, now that the moment of truth is here, I'm wondering if this is the right move. Should I instead run only one more set of data for one lane, and just use my results in duplicate?

Sorry, I'm a such a newbie, so I'm not sure what stats are pertinent so let me just give what I have. The RNA is very good, and in the set that I have results for, I got 90% read alignment. After I used cuffdiff, cummeRbund gave me 300 significantly differntially expressed genes. So 30 of the 300 had "values" under 1. However, of the 300, about 40 had over a two fold difference, and of these 40, 25 of them had a "value" under 1.

For microarrays, a two fold change is the minimum you can have to call it useful. If that's the rule on sequencing, then I worry that if I split up my lane in essentially half, I'll lose 25 of the 40 genes that were very significantly differentially expressed.

Any thoughts or suggestions?

Thanks so much!
Tags: None
billstevens

Senior Member

Join Date: Mar 2012

Posts: 120
- Share
- Tweet
#2

04-06-2012, 10:55 AM

So I had some more info if anyone was searching and came across this. First off, so the values are FPKM values (I had a deadline yesterday, and I was running around all crazy and didn't actually stop and THINK).

So to address this problem, see how many mappable reads you have. I had approximately 100 M per condition. If I use two sets of samples on lane, I'll end up with 50 M per condition. With an FPKM value of 1, I'd have 50 fragments. I spoke to an Illumina Tech who told me they hear in the 30-50 range as the minimum in which you can still use. Is this what everyone else has heard?

Someone else seemed to wonder this as well:

Min FPKM Value for Diff Exp - SEQanswers

http://seqanswers.com/forums/showthread.php?t=9776&highlight=minimum+FPKM

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

Last edited by billstevens; 04-06-2012, 11:22 AM.
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
- Channel: Articles
Yesterday, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 39 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Coverage for duplicates

Comment

Latest Articles

ad_right_rmr

News