Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Coverage for duplicates

    Hi guys,

    So I'm doing a 100bp PE reads on the human genome with 3 different conditions. I ran one set on one lane, and I just finished looking at the data. The plan originally was that after I looked at the data, if it looked like there were good differences (there are), I would run two sets of samples on one more lane.

    However, now that the moment of truth is here, I'm wondering if this is the right move. Should I instead run only one more set of data for one lane, and just use my results in duplicate?

    Sorry, I'm a such a newbie, so I'm not sure what stats are pertinent so let me just give what I have. The RNA is very good, and in the set that I have results for, I got 90% read alignment. After I used cuffdiff, cummeRbund gave me 300 significantly differntially expressed genes. So 30 of the 300 had "values" under 1. However, of the 300, about 40 had over a two fold difference, and of these 40, 25 of them had a "value" under 1.

    For microarrays, a two fold change is the minimum you can have to call it useful. If that's the rule on sequencing, then I worry that if I split up my lane in essentially half, I'll lose 25 of the 40 genes that were very significantly differentially expressed.

    Any thoughts or suggestions?

    Thanks so much!

  • #2
    So I had some more info if anyone was searching and came across this. First off, so the values are FPKM values (I had a deadline yesterday, and I was running around all crazy and didn't actually stop and THINK).

    So to address this problem, see how many mappable reads you have. I had approximately 100 M per condition. If I use two sets of samples on lane, I'll end up with 50 M per condition. With an FPKM value of 1, I'd have 50 fragments. I spoke to an Illumina Tech who told me they hear in the 30-50 range as the minimum in which you can still use. Is this what everyone else has heard?

    Someone else seemed to wonder this as well:
    Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
    Last edited by billstevens; 04-06-2012, 11:22 AM.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    39 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X