Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Get the RPKM value of the genes analyzed using DESeq or edgeR

    Hi,

    I have done analyzation over RNA seq data using edgeR and DESeq to find DE genes (BAM files -> HTSeq -> edgeR and DEseq).
    For some comparisons I need to have the RPKM values related to each gene. What is the best way of getting it?

    Thank you in advance.

  • #2
    The version of edgeR on the Bioconductor developmental repository has a function rpkm().

    Comment


    • #3
      Originally posted by Gordon Smyth View Post
      The version of edgeR on the Bioconductor developmental repository has a function rpkm().
      Is it v3.08?

      Comment


      • #4
        Originally posted by ThePresident View Post
        Is it v3.08?
        No, the *development* repository:

        Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data.

        Comment


        • #5
          Originally posted by Gordon Smyth View Post
          The version of edgeR on the Bioconductor developmental repository has a function rpkm().
          Thank you but can I ask how does this function calculate the gene length? Because my problem is that I do not know how to get the gene length to calculate the RPKM values. The gtf file I have used is the hg19 latest version from UCSC genome browser. I can do something like this: endposition - startpositoin+1. But there are different transcripts for each gene. which of these transcripts should be the as the source for the gene length? the average? the longest?

          Comment


          • #6
            Im wondering the exact same thing..

            Comment


            • #7
              An appropriate measure of gene length must be input to rpkm(). Computing gene length is a job for the read count software rather than for the differential expression software because the appropriate measure of gene length depends on the way the reads have been counted.

              I use subread and featureCounts:

              featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.


              to count reads. For most RNA-seq analyses, I count reads that overlap any exon for each gene, so the appropriate measure of gene length is the total exon length. Gene length is returned as part of the output from featureCounts.

              Comment


              • #8
                Thanks!

                Is there a difference in choosing CPM/logCPM/FPKM to represent gene expression level if I want to correlate the expression of one gene against i.e. body weight?

                Or the change in gene expression of one gene against change in body weight etc.

                Comment


                • #9
                  Originally posted by Gordon Smyth View Post
                  An appropriate measure of gene length must be input to rpkm(). Computing gene length is a job for the read count software rather than for the differential expression software because the appropriate measure of gene length depends on the way the reads have been counted.

                  I use subread and featureCounts:

                  featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.


                  to count reads. For most RNA-seq analyses, I count reads that overlap any exon for each gene, so the appropriate measure of gene length is the total exon length. Gene length is returned as part of the output from featureCounts.
                  I have used HTSeq for counting, I guess I can run featureCounts on one sample, just to get the gene lengths from my UCSC GTF file.

                  Then import the lengths to edgeR/DEseq fpkm()

                  Or am I missing some points here?

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  30 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  32 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  28 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  53 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X