Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GTF file for cuffdiff 0.9.1

    The GTF file I'm using to run cuffdiff has transcript IDs but no p_ids. Consequently, cuffdiff is unable to make the cds, promoters, splicing, and tss_groups files. Is there a database where I could get an improved GTF file? If not, what table schema in the UCSC genome browser have people used to construct their GTF files?

    Thanks!

  • #2
    Have you run cuffcompare on your samples first? Cuffcompare attaches p_ids and tss_ids to the combined GTF file that you can then use as input for Cuffdiff.

    Comment


    • #3
      Thanks! I tried using the .gtf file from cuffcompare as my reference gtf for cuffdiff as you suggested. This solved some problems, but created others. The isoforms, promoters, splicing, and tss files are now populated, but the cds files still aren't. The other thing that happened is that there were no recognizable gene names in any of the files created by cuffdiff with the cuffcompare .gtf file. Instead, the gene names were "XLOC..". I'm thinking there is a problem with my reference gtf file that I used in cuffcompare. Where can I find a better reference gtf for mm9?

      Comment


      • #4
        Hi,
        I'm facing the same issue with the mouse gtf file,
        It will be good that the gtf of major organism will be made available of the cufflinks page.
        Best,
        Ramzi
        Research Scientist - Bioinformatics
        Sidra Medical and Research Center

        Comment


        • #5
          from my understanding, to have p_id you need to run cuffcompare with the -s option. Also no gene names are showing up probably because the gtf that you are supplying it does not have a gene_name attribute in the 9th column, you should try the Ensembl GTF, that one has gene names http://uswest.ensembl.org/info/data/ftp/index.html

          Comment


          • #6
            fkuo: Thanks! I tried running cuffcompare with the -s option and was able to generate a p_id. Unfortunately, my troubles didn't stop there. My combined.gtf file contained tss and p ids that didn't really make much sense. This resulted in lots of NO TEST error messages when I ran cuffdiff. What did you use as your -r .gtf files? other .gtf? Also, did you use the -p option? if so, how do you specify the prefix?

            Comment


            • #7
              hi kalidaemon,

              for the -r, I used a combined reference gtf (UCSC, Ensembl, Refseq). For the -p option, you just used -p4 for 4 threads or --num-threads 4. hope this helps!

              Comment


              • #8
                no p_id attribute

                Originally posted by kalidaemon View Post
                fkuo: Thanks! I tried running cuffcompare with the -s option and was able to generate a p_id. Unfortunately, my troubles didn't stop there. My combined.gtf file contained tss and p ids that didn't really make much sense. This resulted in lots of NO TEST error messages when I ran cuffdiff. What did you use as your -r .gtf files? other .gtf? Also, did you use the -p option? if so, how do you specify the prefix?
                Hi kalidaemon,

                I see that you ran cuffcompare with the -s option and was able to generate a p_id. I tried this and still wasn't able to generate the attribute. Could you offer any tips?
                Many thanks!

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Advancing Precision Medicine for Rare Diseases in Children
                  by seqadmin




                  Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                  12-16-2024, 07:57 AM
                • seqadmin
                  Recent Advances in Sequencing Technologies
                  by seqadmin



                  Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                  Long-Read Sequencing
                  Long-read sequencing has seen remarkable advancements,...
                  12-02-2024, 01:49 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 12-17-2024, 10:28 AM
                0 responses
                26 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-13-2024, 08:24 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-12-2024, 07:41 AM
                0 responses
                28 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-11-2024, 07:45 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Working...
                X