Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    I did not pay close attention to your original question (i.e. you are getting the error while running tophat itself and not the actual script). My fault.

    Let us start over.

    Is caref_ncbiall available in the same directory that you are running tophat from? If not, can you provide the full path to that file when you execute tophat. The same should apply to cp04.fastq and caref_seq.gff. Also make certain that your gff file matches the format specification as noted here: http://cufflinks.cbcb.umd.edu/gff.html
    Last edited by GenoMax; 04-30-2014, 04:00 PM.

    Comment


    • #17
      Hi GenoMAx

      All the files are in the same directory. I also made sure that the names are same in gff and genome file.


      Thank you

      Varsha

      Comment


      • #18
        Hi

        I forgot to mention that the fastq file is also in same directory. The gff file and genome sequences were downloaded from NCBI (NCBI has separate fasta file for each chromosomes and all the unplaced scafolds are in one fasta file. I concatenated these files to make the genome file and renamed it as per the gff file. The same gff file is working with other genome sequence file which does not have unplaced sequences.


        Thank you

        Varsha

        Comment


        • #19
          TopHat is picky about the order of options on the command line. Can you try the following:

          Code:
          $ tophat -o cp04_thout5 -p 2 -G caref_seq.gff caref_ncbiall cp04.fastq
          Let me also verify that the basename for your genome index files is "caref_ncbiall", that is there are several files (that comprise of the index) that have that prefix?

          Comment


          • #20
            Originally posted by varshacp View Post
            Hi

            I forgot to mention that the fastq file is also in same directory. The gff file and genome sequences were downloaded from NCBI (NCBI has separate fasta file for each chromosomes and all the unplaced scafolds are in one fasta file. I concatenated these files to make the genome file and renamed it as per the gff file. The same gff file is working with other genome sequence file which does not have unplaced sequences.


            Thank you

            Varsha
            Does it mean that you have not created the "index" files for this combined fasta reference file? You will need to index the reference in order to use tophat. You can build the reference index using the directions here: http://tophat.cbcb.umd.edu/tutorial.shtml#ref

            Comment


            • #21
              HI GenoMAx

              The genome index was created using the same fasta file and is in the same directory

              Thankx

              Varsha

              Comment


              • #22
                Can you post a listing of the files in this directory?

                Also see my previous post about the order of options. If the genome index is correctly created then give that command line a try.

                Comment


                • #23
                  Originally posted by GenoMax View Post
                  TopHat is picky about the order of options on the command line. Can you try the following:

                  Code:
                  $ tophat -o cp04_thout5 -p 2 -G caref_seq.gff caref_ncbiall cp04.fastq
                  Let me also verify that the basename for your genome index files is "caref_ncbiall", that is there are several files (that comprise of the index) that have that prefix?

                  Hi

                  The basename is caref_ncbiall for the index files

                  Comment


                  • #24
                    Were you able to get tophat working?

                    Comment


                    • #25
                      Hi Genomax

                      The index is also build using the same genome sequence file in the same directory

                      Comment


                      • #26
                        Are things working now? Or are you still seeing an error?

                        Comment


                        • #27
                          HI GenoMax

                          I am still getting the same error.


                          Thank you

                          Comment


                          • #28
                            Varsha: Without seeing a listing of the files (related to this error, e.g. caref_ncbiall) in the directory you are running this from there is not much further help I can offer.

                            Comment


                            • #29
                              Hi GenoMax

                              The following is the list of file in the directory from which I am running the tophat command

                              caref_ncbiall.fa (genome sequence file)
                              caref_ncbiall.1.bt2 (bowtie index files)
                              caref_ncbiall.2.bt2
                              caref_ncbiall.3.bt2
                              caref_ncbiall.4.bt2
                              caref_ncbiall.rev.1.bt2
                              caref_ncbiall.rev.2.bt2
                              cp04.fastq (reads files)
                              caref_seq.gff (genome annotation file)

                              Thank you

                              Kind regards


                              Varsha

                              Comment


                              • #30
                                HI

                                I checked the log file and besides the run.log which I posted earlier I get the following error in the g2f.log file

                                terminate called after throwing an instance of 'std:ut_of_range'
                                what(): basic_string::substr


                                Help me to understand this

                                Thank you
                                Varsha

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  Essential Discoveries and Tools in Epitranscriptomics
                                  by seqadmin


                                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                                  Yesterday, 07:01 AM
                                • seqadmin
                                  Current Approaches to Protein Sequencing
                                  by seqadmin


                                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                  04-04-2024, 04:25 PM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, 04-11-2024, 12:08 PM
                                0 responses
                                37 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 04-10-2024, 10:19 PM
                                0 responses
                                41 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 04-10-2024, 09:21 AM
                                0 responses
                                35 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 04-04-2024, 09:00 AM
                                0 responses
                                55 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X