Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Re-Annotate legacy gene predictions

    Hello everyone,

    I have a number of single-ended samples generated by Illumina-based Hsq2000. About 10 to 35 million 58-bp. A legacy annotation of an organism "genemark predictions". What are my option to redo the annotations using the short read files I have.

    Any tools, steps , suggestions, examples are really appreciated

    Thank you!

  • #2
    Either de-novo assemble all SE HiSeq reads and predict genes on the assembled contigs, or map those reads against the current annotation and identify where improvements can be made

    Denovo: Velvet, ALLPATHS, SOAPdenovo
    Mapping: BFAST, Bowtie(2), BWA
    Gene Prediction: FGENESH, GenMark, Genscan, Glimmer ....
    Last edited by jimmybee; 04-22-2013, 02:50 PM.

    Comment


    • #3
      Thanks Jimmy,
      I am not quite sure I understand exactly how to do your second suggestion. Because I have already done the alignment to the genome and that is what caused my question.

      Here is how this whole re-annotation idea came up:
      I aligned the SE reads (different time points of fruit development and different tissues) to the reference genome. Then viewed the (Genome + Old predicted genes + Alignment results "BAM") using GBrowse. At this point we noticed that the reads are not always aligning perfectly to a number of the genes.

      Comment


      • #4
        I would highly suggest a two pronged strategy for use inside maker.

        1) Use tophat and cufflinks RABT annotations to do transcriptome assembly on the genome.
        2) Use trinity to de novo assemble the reads into transcripts.

        Then reannotate your genome inside maker. You will be able to pass the legacy annotation through, along with refseq alignments from other species or a variety of other lines of evidence along with your de novo and reference based transcriptome assembly.

        Finally, update your maker annotations with PASA using your de novo assembled transcripts.

        Comment


        • #5
          Thanks a lot Wally
          Sounds interesting, and a lot of work. I have never used any of the tools you suggested and excited to do so. Do you know of any links or documents and that list these steps with more details (not the manual of each), as I am no expert and need as much data as possible about this pipeline.

          Have a good weekend

          Comment


          • #6
            Originally posted by Amative View Post
            Thanks a lot Wally
            Sounds interesting, and a lot of work. I have never used any of the tools you suggested and excited to do so. Do you know of any links or documents and that list these steps with more details (not the manual of each), as I am no expert and need as much data as possible about this pipeline.

            Have a good weekend
            Yeah, I did something similar for this paper: http://www.biomedcentral.com/1471-2164/14/49

            I did not feed legacy annotations to Maker though. Instead I merged Ensembl and NCBI annotations in EVM then fed the merged annotations to Maker.

            If I were to do it again though, I'd probably have just skipped EVM and fed both Ensembl and NCBI into Maker.

            Comment


            • #7
              Excellent, Thanks Wally.
              I will definitely take a look at it, and hopefully I can do something similar.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-27-2024, 06:37 PM
              0 responses
              12 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-27-2024, 06:07 PM
              0 responses
              11 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              53 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              69 views
              0 likes
              Last Post seqadmin  
              Working...
              X