Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks and cuffdiff FPKM values

    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.

  • #2
    Originally posted by combiochem View Post
    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.
    I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
    Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.

    Comment


    • #3
      Originally posted by thinkRNA View Post
      I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
      Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.
      No, I'm not comparing the FPKM values from "other" program. I've compared the FPKM between cufflinks output and cuffdiff output. As far as I know, in cufflinks, when the known annotations are given, the FPKM for each transcript can be obtained (in one sample). Also in cuffdiff tracking file, the FPKM in each sample (q0 and q1) can be obtained. I've used exactly same annotations and mapping results (which have used for cufflinks) for cuffdiff running (therefore the read counts for each transcript should be same), but the FPKM outputs are different from cufflinks. I'm wondering whether the estimation is taking care of both samples in cuffdiff (which is different from cufflinks?) rather than each sample (q0 or q1).

      Comment


      • #4
        i actually went through the cufflinks-cuffcompare-cuffdiff pipeline and I also have the same problems. cufflinks and cuffdiff report different FPKM values for a same transcript.

        Comment


        • #5
          EDIT
          My question was answered in another thread.
          Last edited by Thomas Doktor; 11-23-2010, 05:02 AM. Reason: Question already answered

          Comment


          • #6
            Where I can find it? Please post the link, thanks!

            Comment


            • #7
              Please download the latest Cufflinks from the link below. I think now cufflinks and cuffdiff return consistent values.

              Comment


              • #8
                same problem with the latest Cufflinks version

                I am using cufflinks version 1.0.2 and have the same problem.
                Any suggestions?

                Comment


                • #9
                  This might explain it:
                  Cuffdiff and Cufflinks now accept new options controlling whether all hits are counted towards the FPKM denominator, or only those compatible with some transcript in the reference annotation. Counting only compatible hits avoids certain types of bias that arise when one sample contains far more hits that aren't compatible with any transcript than the other sample does. For example, if one sample contains vastly more mapped ribosomal RNA hits, FPKM values will appear lower in that sample, potentially leading to false positive differential expression calls. Cuffdiff by default now uses only compatible hits. Cufflinks still uses total hits by default, as using compatible hit accounting requires a reference GTF.

                  Comment


                  • #10
                    Have anyone also seen that some isoforms exist in the files generated by Cufflinks but disappear in the files generated by Cuffcompare and Cuffdiff?

                    Comment


                    • #11
                      cufflinks or cuffcompare bugs?

                      1. New released 1.3.0, after Cuffcompare, FPKM column contains all 0, missing FPKM values even tracking files have them;

                      2. in all the versions of CuffDiff, if you compare different conditions against the same control samples, the FPKM in the same control samples in different comparing is different; for example,
                      CuffDiff I: condition 1 v.s. condition control;
                      CuffDiff II: condition 2 v.s condition control;

                      after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                      anybody has explanation or suggestions for this? Thanks!

                      Comment


                      • #12
                        Originally posted by lewewoo View Post
                        after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                        anybody has explanation or suggestions for this? Thanks!
                        Did you use the "-N" option? I found that FPKM are different when I used "-N" option. And FPKM is consistent between different comparison when I closed "-N" option.

                        Comment


                        • #13
                          hi tangx 2010,
                          i have used cuffdiff on the galaxy platform...what exaclty is the N option?

                          how can i select/deselect this option?

                          thanks,ib

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            Strategies for Sequencing Challenging Samples
                            by seqadmin


                            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                            03-22-2024, 06:39 AM
                          • seqadmin
                            Techniques and Challenges in Conservation Genomics
                            by seqadmin



                            The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                            Avian Conservation
                            Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                            03-08-2024, 10:41 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, Yesterday, 06:37 PM
                          0 responses
                          10 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, Yesterday, 06:07 PM
                          0 responses
                          9 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 03-22-2024, 10:03 AM
                          0 responses
                          49 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 03-21-2024, 07:32 AM
                          0 responses
                          67 views
                          0 likes
                          Last Post seqadmin  
                          Working...
                          X