Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cuffmerge help for novice please!

    Hi all,
    be gentle with me - its my first post! I have two conditions with 3 replicate files.
    Ive run tophat and cufflinks successfully on the RNAseq data.
    However when it comes to merging the gtf transcripts, despite following the nature protocol closely, cuffmerge is not running properly.

    Ive seen a few people post this question in different forums but I haven't seen anyone answer the query so I dont know what the solution is!
    Essentially after running the cuffmerge command on a compiled assemblies script file list, it looks like the operation is starting but after about 10 seconds I get returned to the command prompt. No error messages appear but the merged asm folder is empty and the program has clearly aborted. I just dont know why .

    Can anyone please help? Im running linux on an ubuntu parallels system with 12GB memory allocated. The bin folder contains symbolic links to the bowtie2 genome files and the gtf gene file. Thanks again if anyone can point me to things to check! Apologies in advance for any forum rules I have probably broken!

    $ cuffmerge -g genes.gtf -s genome.fa -p 4 assemblies.txt

    [Wed Oct 7 03:01:46 2015] Beginning transcriptome assembly merge
    -------------------------------------------

    [Wed Oct 7 03:01:46 2015] Preparing output location ./merged_asm/
    [Wed Oct 7 03:02:23 2015] Converting GTF files to SAM
    j12@ubuntu:~/bin$

  • #2
    What does your assemblies.txt file look like?

    Comment


    • #3
      Hi, file is as follows:

      /home/jcf1r12/rnaseq/JFcontrol1_clout/transcripts.gtf
      /home/jcf1r12/rnaseq/JFcontrol2_clout/transcripts.gtf
      /home/jcf1r12/rnaseq/JFcontrol3_clout/transcripts.gtf
      /home/jcf1r12/rnaseq/JFct1_clout/transcripts.gtf
      /home/jcf1r12/rnaseq/JFct2_clout/transcripts.gtf
      /home/jcf1r12/rnaseq/JFct3_clout/transcripts.gtf

      Comment


      • #4
        From cuffmerge help:

        Note that <seq_dir> must contain one fasta file per reference chromosome, and each file must be named after the chromosome, and have a .fa or .fasta extension.

        Comment


        • #5
          Thank you so much for suggestion.

          I have tried rerunning but with passing -s to the /Users/JF/Homo_sapiens/rnaseq/Ensembl/GRCh37/Sequence/Chromosomes directory which contains all individual named chromosome fa files but the same abort happens?

          Comment


          • #6
            Have you tried to remove/move the empty "merged_asm" folder from past runs and re-running?

            Comment


            • #7
              Originally posted by GenoMax View Post
              Have you tried to remove/move the empty "merged_asm" folder from past runs and re-running?
              Yes just tried that, sadly no different

              Comment


              • #8
                Are you using the latest cuff* programs? The transcript.gtf files in the cl_out directories are not empty, correct? Does the program run if you don't provide the reference GTF file? Since you are running in a VM also remove the -p option.

                Comment


                • #9
                  Originally posted by GenoMax View Post
                  Are you using the latest cuff* programs? The transcript.gtf files in the cl_out directories are not empty, correct? Does the program run if you don't provide the reference GTF file? Since you are running in a VM also remove the -p option.
                  Yes cufflinks is the latest version. The transcript.gtf files are all about 40MB. No the same thing happens without the gtf file (after it warns me that I havent included a reference file).

                  And similarly doesn't run when I remove the -p option either.

                  I wondered if it was something to do with samtools as when I check the version installed, it is an older version. Ive downloaded, unzipped and ran 'make' in the new version folder and then copied all the files to the bin directory but when I run dpkg it still states the old version of samtools.

                  Not sure if this is relevent but obviously Im struggling!

                  Comment


                  • #10
                    Did you compile cufflinks yourself or did you download pre-compiled binaries? Last I remember TopHat suite programs need the old version of Samtools (0.1.19).

                    Comment


                    • #11
                      Originally posted by GenoMax View Post
                      Did you compile cufflinks yourself or did you download pre-compiled binaries? Last I remember TopHat suite programs need the old version of Samtools (0.1.19).
                      The Nature protocol paper uses version 1.3 so this is what I have used. I downloaded the binary files and used following script:

                      tar zxvf /media/psf/Homo_sapiens/cufflinks-1.3.0.OSX_x86_64.tar.gz
                      cd cufflinks-1.3.0.OSX_x86_64/
                      cp * $HOME/bin
                      export PATH=$PATH:/home/JF/bin/cufflinks-1.3.0.OSX_x86_64

                      Comment


                      • #12
                        I wonder if that is part of the problem.

                        Can you download the latest cufflinks from here: http://cole-trapnell-lab.github.io/cufflinks/install/

                        What version of TopHat did you use?

                        Comment


                        • #13
                          I used tophat on an insitutional hub for speed. It runs version 2.0.14

                          I can try that but im pretty sure I have tried with a few cufflinks versions now. Do I just run a similar script as I posted above to install the new version of cufflinks or is there any other script I need to use to ensure I am running the up to date version? (or should i run cuffmerge from the cufflinks_2.2.1 folder?)

                          Comment


                          • #14
                            DOH!!!! Ok. . . who's spotted it.......

                            Comment


                            • #15
                              Make sure you run the cuffmerge from the cufflinks_2.2.1 folder (and the chromosome option). You may need to append your $PATH so that the 2.2.1 cuff* programs are found by the system.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Essential Discoveries and Tools in Epitranscriptomics
                                by seqadmin




                                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                                04-22-2024, 07:01 AM
                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 11:49 AM
                              0 responses
                              15 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-24-2024, 08:47 AM
                              0 responses
                              16 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              61 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              60 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X