Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • RNAseq data analysis

    I recently started some RNAsequencing of tumor samples. My main question is regarding the differential gene expression among samples and controls. I have successful utilized tophat, cufflinks, and cuffdiff. I am new to bioinformatics and RNAseq analysis. My experience utilizing Linux terminals for analysis is only recent. I have seen a few tutorials online for RNAseq, but some of them only take you through the cufflinks data retrieval. I am looking for what next....the steps and analysis after cuffdiff. Are there programs that exist or tutorials that exist for running a script in python or R for analysis of the data? How do others go about analyzing RNAseq data for differential expression among samples?

    Any advice on the downstream analysis of RNAseq data after the tuxedo suite would be greatly appreciated.

    Thanks!

  • #2
    cumbRbund is the next step in the tuxedo suite. After that it all depends on what you want to do.
    --------------
    Ethan

    Comment


    • #3
      Hi

      whilst you get much further with the command line, you can always try out our tool RobiNA http://mapman.gabipd.org/web/guest/robin which will give you a graphical user interface and run all necessary steps in the background to call differential expression. It also comes with a long manual.

      Apart from that you might check out the tutorials in the seqanswers for running your data through.

      Cheers,
      björn

      Comment


      • #4
        Originally posted by ETHANol View Post
        cumbRbund is the next step in the tuxedo suite. After that it all depends on what you want to do.
        yes, I have used cummeRbund to plot some of the graphs. Which look nice, but I am more curious in pulling out differential expressed genes. I would like to look at the data globally vs. picking and choosing genes. From my use of cummeRbund, I was able to obtain plots which show differences and similarities between samples however, I wasn't able to see where those differences were located (i.e., specific genes or novel transcripts) - just dots on a graph. Maybe I am missing the feature that allows one to export the transcripts or genes that are upregulated or downregulated x-fold from the control or other samples.

        Comment


        • #5
          Originally posted by usad View Post
          Hi

          whilst you get much further with the command line, you can always try out our tool RobiNA http://mapman.gabipd.org/web/guest/robin which will give you a graphical user interface and run all necessary steps in the background to call differential expression. It also comes with a long manual.

          Apart from that you might check out the tutorials in the seqanswers for running your data through.

          Cheers,
          björn
          Hi Bjorn,

          Thank you for the reply. And for the link on this tool. I already have the cuffdiff output and I would use command line but I am wondering where to begin. How do I retrieve all of the differential regulated genes or transcripts?

          -Tom

          Comment


          • #6
            This is a very good tutorial here, but it still leaves some questions unanswered.


            How do you determine what cuffdiff lists as noteworthy genes for upregulation or downregulated? I am only seeing the FPKM for each gene and the lo and high confidence. How can I see the genes that are predicted to be noteworthy as mentioned in this tutorial?

            This author took the log2 of all FPKM values? Isn't the FPKM a value of within samples expression compared to overall transcript numbers?

            I would also like to know what scripts or programs did he run in R?

            Any guidance or other tutorials that will lead me forward after obtaining cuffdiff file would be helpful. I have four samples and run cuffdiff with tophat bam files for each and the chicken annotation gtf.

            Thanks!

            Comment


            • #7
              There should be a file gene_exp.diff or something like that which has the results of the statistical test. Sort by qvalue and chose a cutoff you like. Is that what you are asking?

              You can then take your significantly expressed gene list and feed that into goseq if you want to do gene ontology analysis.
              --------------
              Ethan

              Comment


              • #8
                Originally posted by ETHANol View Post
                There should be a file gene_exp.diff or something like that which has the results of the statistical test. Sort by qvalue and chose a cutoff you like. Is that what you are asking?

                You can then take your significantly expressed gene list and feed that into goseq if you want to do gene ontology analysis.
                This is helpful. Thank you!

                Comment


                • #9
                  The tuxedo procotol article has a section on what you want: http://www.nature.com/nprot/journal/...tml#/procedure check steps 16-18, there is some simple R code that you could follow and expand.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  31 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  32 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  28 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  53 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X