Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GTF file for cuffdiff 0.9.1

    The GTF file I'm using to run cuffdiff has transcript IDs but no p_ids. Consequently, cuffdiff is unable to make the cds, promoters, splicing, and tss_groups files. Is there a database where I could get an improved GTF file? If not, what table schema in the UCSC genome browser have people used to construct their GTF files?

    Thanks!

  • #2
    Have you run cuffcompare on your samples first? Cuffcompare attaches p_ids and tss_ids to the combined GTF file that you can then use as input for Cuffdiff.

    Comment


    • #3
      Thanks! I tried using the .gtf file from cuffcompare as my reference gtf for cuffdiff as you suggested. This solved some problems, but created others. The isoforms, promoters, splicing, and tss files are now populated, but the cds files still aren't. The other thing that happened is that there were no recognizable gene names in any of the files created by cuffdiff with the cuffcompare .gtf file. Instead, the gene names were "XLOC..". I'm thinking there is a problem with my reference gtf file that I used in cuffcompare. Where can I find a better reference gtf for mm9?

      Comment


      • #4
        Hi,
        I'm facing the same issue with the mouse gtf file,
        It will be good that the gtf of major organism will be made available of the cufflinks page.
        Best,
        Ramzi
        Research Scientist - Bioinformatics
        Sidra Medical and Research Center

        Comment


        • #5
          from my understanding, to have p_id you need to run cuffcompare with the -s option. Also no gene names are showing up probably because the gtf that you are supplying it does not have a gene_name attribute in the 9th column, you should try the Ensembl GTF, that one has gene names http://uswest.ensembl.org/info/data/ftp/index.html

          Comment


          • #6
            fkuo: Thanks! I tried running cuffcompare with the -s option and was able to generate a p_id. Unfortunately, my troubles didn't stop there. My combined.gtf file contained tss and p ids that didn't really make much sense. This resulted in lots of NO TEST error messages when I ran cuffdiff. What did you use as your -r .gtf files? other .gtf? Also, did you use the -p option? if so, how do you specify the prefix?

            Comment


            • #7
              hi kalidaemon,

              for the -r, I used a combined reference gtf (UCSC, Ensembl, Refseq). For the -p option, you just used -p4 for 4 threads or --num-threads 4. hope this helps!

              Comment


              • #8
                no p_id attribute

                Originally posted by kalidaemon View Post
                fkuo: Thanks! I tried running cuffcompare with the -s option and was able to generate a p_id. Unfortunately, my troubles didn't stop there. My combined.gtf file contained tss and p ids that didn't really make much sense. This resulted in lots of NO TEST error messages when I ran cuffdiff. What did you use as your -r .gtf files? other .gtf? Also, did you use the -p option? if so, how do you specify the prefix?
                Hi kalidaemon,

                I see that you ran cuffcompare with the -s option and was able to generate a p_id. I tried this and still wasn't able to generate the attribute. Could you offer any tips?
                Many thanks!

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Choosing Between NGS and qPCR
                  by seqadmin



                  Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                  10-18-2024, 07:11 AM
                • seqadmin
                  Non-Coding RNA Research and Technologies
                  by seqadmin




                  Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                  Nobel Prize for MicroRNA Discovery
                  This week,...
                  10-07-2024, 08:07 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 06:09 AM
                0 responses
                10 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 10-30-2024, 05:31 AM
                0 responses
                12 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 10-24-2024, 06:58 AM
                0 responses
                21 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 10-23-2024, 08:43 AM
                0 responses
                52 views
                0 likes
                Last Post seqadmin  
                Working...
                X