Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Software for identification of Trinity/Cufflinks transcripts

    Are there any good available programs/scripts for analyzing assembled transcripts?

    I imagine something like a script to blastn, blastx and tblastx each transcript and report the best hit. Something like that wouldn't be too hard to write, but I don't want to re-invent the wheel, and I also am concerned that sometimes the highest scoring hit reported by blast has a lot of gaps and is not the right result, while a lower scoring shorter hit is more likely correct, but the only way I can think of to accurately determine this is manually.

    The genome I am interested in is poorly annotated, and particularly bad in my region of interest, so just using a reference gtf with my cufflinks transcripts would not be very helpful.

  • #2
    Your subject title suggests you're working on Trinity transcripts, so why not try the workflows suggested on the Trinity website?



    In particular, it sounds like you might be interestested in the Read Alignment / Abundance Estimation workflow:



    On the other hand, if you're working from cufflinks transcripts then you should probably use cufflinks for the initial analysis:

    Last edited by gringer; 02-29-2012, 12:08 AM. Reason: added cufflinks cuff link

    Comment


    • #3
      I have already done all of that, I am talking about analysis downstream of cuffdiff for cufflinks/tophat and RSEM for trinity.

      As I mentioned, my region of interest and my model organism as a whole have poor annotation and assembly, so many genes that I find to be differentially expressed are either unannotated or only annotated as hypothetical or xenoref genes. With a few hundred to a few thousand differentially expressed genes, it is not really feasible to manually examine each one. I have seen reports done by other groups where they provide an excel sheet with every Trinity transcript, FPKM, log change, and then (importantly) the blastn results for each gene, and then the blastx/tblastx for any gene that does not map or does not have annotation.

      It seems simple to write a script to do this, except for the problem I mentioned regarding gapped alignments, but rather than re-invent the wheel I was wondering if there was an available script/program.

      Comment


      • #4
        If I have understood you perfectly you have a file with sequences of transcripts and you want to annotate them. I would suggest for that blast2GO is a good platform (http://www.blast2go.com/b2ghome).

        Comment


        • #5
          This looks promising, I will try using it, thanks.

          Is there anything similar that can be installed locally and run over a command line instead of a GUI?

          Comment


          • #6
            Blast2Go has a pipeline option which you can run locally (http://www.blast2go.com/b2glaunch/resources).

            Another interesting package is SeqGene which performs many common tasks for a next generation sequencing analysis. It has a one script complete analysis pipeline which works well with a little bit of configuration.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM
            • seqadmin
              Techniques and Challenges in Conservation Genomics
              by seqadmin



              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

              Avian Conservation
              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
              03-08-2024, 10:41 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 06:37 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, Yesterday, 06:07 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-22-2024, 10:03 AM
            0 responses
            51 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-21-2024, 07:32 AM
            0 responses
            67 views
            0 likes
            Last Post seqadmin  
            Working...
            X