Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • beeman
    Member
    • May 2012
    • 20

    FPKM values differ for sample sample between comparisons?

    Hi I have three biological samples (lets call them Sample_A-C)that I want to find DEGs between. Firstly, I'm performing pairwise comparisons between samples using cuffdiff using alignments generated using tophat.

    The problem is when I compare the fpkm value of genes in Sample_A (in the gene_exp.diff file), the values differ depending on whether the comparison of Sample_A was against Sample_B or Sample_C.

    How is this difference generated? I thought when running cuffdiff the fpkm value would be static and would originate from the mapping data in the bam/sam file used as input for cuffdiff.

    Here is just the header and the first gene in the gene_exp.diff file.
    Sample_A vs Sample_B
    test_id gene_id gene locus sample_1 sample_2 status value_1 value_2 log2(fold_change) test_stat p_value q_value significant
    ACYPI000001 ACYPI000001 acyp2eg0000191 chr1:2365250-2374181 q1 q2 OK 21.6621 25.2228 0.219556 0.389799 0.71015 0.991694 no
    Sample_A vs Sample_C
    test_id gene_id gene locus sample_1 sample_2 status value_1 value_2 log2(fold_change) test_stat p_value q_value significant
    ACYPI000001 ACYPI000001 acyp2eg0000191 chr1:2365250-2374181 q1 q2 OK 26.0189 587.227 4.49629 4.16814 0.0011 0.422675 no

    So as you can see, when compared against Sample_B, the fpkm value of ACYPI000001 in Sample_A is 21.6621 and in the second comparison it's 26.0189

    There are even more pronounced differences in these datasets and I just don't get why these differences are arising.

    Can someone please shed some light on what may be underlying this?

    Thanks

    EDIT: I'm using cufflinks version 2.1.1
    cmd# cuffdiff -p 5 -o sample_A_vs_sample_B annotations.gff sample_A_accepted_hits.bam sample_B_accepted_hits.bam
    Last edited by beeman; 04-16-2014, 07:06 PM. Reason: adding more info
  • yueluo
    Member
    • Aug 2013
    • 82

    #2
    You might want to read about 'Library Normalization Methods' on the cufflinks manual:



    FPKMs and fragment counts are scaled via the median of the geometric means of fragment counts across all libraries, as described in Anders and Huber (Genome Biology, 2010). This policy identical to the one used by DESeq. (default for Cuffdiff)

    Comment

    • beeman
      Member
      • May 2012
      • 20

      #3
      Great thanks for the info

      Comment

      • bhennuy
        Junior Member
        • May 2009
        • 6

        #4
        Hello,
        Did you try to compare the three conditions in the same process with cuffdiff ?
        As explained, Cuffdiff should support that with "cuffdiff sample1 sample2 sample3". The 3 conditions should be compared instead of doing several pairwise comparison. In my case, it doesn't work ! Do you know if some options should be used to do that.
        Thanks

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, Today, 05:37 AM
        0 responses
        5 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        16 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        50 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        109 views
        0 reactions
        Last Post SEQadmin2  
        Working...