Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What's the best RNA Seq Tuxedo suite tutorial/streamlined intro protocol?

    Hi all, new to this community. I've been attempting to learn the Tuxedo suite using their protocol "Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks." (http://www.ncbi.nlm.nih.gov/pubmed/22383036) but keep dead ending with errors. Is there a better tutorial, more up to date and streamlined tutorial out there? Thanks

  • #2
    What errors are you getting? I had followed the protocol with newer versions of tuxedo suite a few months ago (at least to cuffdiff part) and recall that it had worked.

    Comment


    • #3
      One of the GFF ID is duplicated, but even our resident lab coder couldn't find the problem. A lot of things aren't explained all that well in the protocol, hence my troubles. I'd really like to find an additional resource to aid my understanding.

      Code:
      cuffmerge -g genes.gtf -s genome.fa -p 8 assemblies.txt
      
      [Thu Jan 22 11:26:25 2015] Beginning transcriptome assembly merge
      -------------------------------------------
      
      [Thu Jan 22 11:26:25 2015] Preparing output location ./merged_asm/
      [Thu Jan 22 11:26:29 2015] Converting GTF files to SAM
      [11:26:30] Loading reference annotation.
      [11:26:31] Loading reference annotation.
      [11:26:33] Loading reference annotation.
      [11:26:34] Loading reference annotation.
      [11:26:36] Loading reference annotation.
      [11:26:38] Loading reference annotation.
      [Thu Jan 22 11:26:40 2015] Quantitating transcripts
      Warning: Your version of Cufflinks is not up-to-date. It is recommended that you upgrade to Cufflinks v2.2.1 to benefit from the most recent features and bug fixes (http://cufflinks.cbcb.umd.edu).
      Command line:
      cufflinks -o ./merged_asm/ -F 0.05 -g genes.gtf -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 8 ./merged_asm/tmp/mergeSam_tmp.6.EZGXyZ 
      [bam_header_read] EOF marker is absent. The input is probably truncated.
      [bam_header_read] invalid BAM binary header (this is not a BAM file).
      File ./merged_asm/tmp/mergeSam_tmp.6.EZGXyZ doesn't appear to be a valid BAM file, trying SAM...
      [11:26:40] Loading reference annotation.
      [11:26:52] Inspecting reads and determining fragment length distribution.
      Processed 11167 loci.                       
      > Map Properties:
      >	Normalized Map Mass: 68011.00
      >	Raw Map Mass: 68011.00
      >	Fragment Length Distribution: Truncated Gaussian (default)
      >	              Default Mean: 200
      >	           Default Std Dev: 80
      [11:26:54] Assembling transcripts and estimating abundances.
      Processed 11167 loci.                       
      [Thu Jan 22 11:27:27 2015] Comparing against reference file genes.gtf
      Warning: Your version of Cufflinks is not up-to-date. It is recommended that you upgrade to Cufflinks v2.2.1 to benefit from the most recent features and bug fixes (http://cufflinks.cbcb.umd.edu).
      Error: duplicate GFF ID 'FBtr0079998' encountered!
      	[FAILED]
      Error: could not execute cuffcompare

      Comment


      • #4
        Just want to confirm that you are using the test data (and not your own). I will take another look later this evening at this.

        Comment


        • #5
          Originally posted by GenoMax View Post
          Just want to confirm that you are using the test data (and not your own). I will take another look later this evening at this.
          Yes I am.I downloaded GSE32038_simulated_fastq_files.tar.gz from http://www.ncbi.nlm.nih.gov/geo/quer...i?acc=GSE32038 and alligned it against the igenome http://support.illumina.com/sequenci...e/igenome.html BDGP5

          Comment


          • #6
            I tried cuffmerge (using cufflinks v.2.2.1). No problems in getting the command to complete.

            Comment


            • #7
              Originally posted by GenoMax View Post
              I tried cuffmerge (using cufflinks v.2.2.1). No problems in getting the command to complete.
              Thanks, I'll update it. Is there an easy command for updating?

              Comment


              • #8
                Are you administering your own machine or do you need to get someone else to do it on a central server/cluster?

                Comment


                • #9
                  Originally posted by GenoMax View Post
                  Are you administering your own machine or do you need to get someone else to do it on a central server/cluster?
                  It's our lab's own mini supercomputer, so I do have root access to it

                  Comment


                  • #10
                    You will have to download and compile it: https://github.com/cole-trapnell-lab/cufflinks Make necessary changes to your path so you start using the new version (if you are keeping the old version around).

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM
                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 11:49 AM
                    0 responses
                    15 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-24-2024, 08:47 AM
                    0 responses
                    16 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    61 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    60 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X