Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Obtaining ORFs from DNA sequence

    Hi Everyone,

    I have small question in regards to obtaining ORFs in DNA sequences. I know that EMBOSS tools have a collection of awesome tools to do this specifically.

    I have used six-pack from EMBOSS. Sixpack is able to find the ORFs and then translate them across all the 6 frames. Strangely it gives me a smaller amount of ORFs compared to the rest.

    Also there is Transeq, but that translates the whole sequence, which should be better suited for transcript sequences as I am working with Eukaryotic organisms.

    Then there is getORF, which finds the ORFs and outputs them.

    What is different between getORF and Sixpack in the EMBOSS tool set?

    My question is what is the difference between ORFget and Sixpack?

    Also I found this awesome perl script: http://navjeet-ahalawat.blogspot.com...n-reading.html

    ORFget, Sixpack and the Perl Script give me three different results. Why is that? I was expecting them to be the same.

    Also I was wondering is there a method to obtain or extract all of the ORF nucleotide sequence in fasta format for each of ORFs with the proper FASTA header between the protein and nucleotide sequences.

    Many thanks,

    Zapages

  • #2
    Rather than a plain ORF finder you should be looking at a gene prediction program if you are working with an eukaryote genome (https://en.wikipedia.org/wiki/List_o...ction_software).

    Comment


    • #3
      Hi Genomax,

      Thank you Genomax. I'm definitely using gene prediction programs as well. I'm trying to build confidence on newly denovo/ NCBI assembled sequences for a specific group of gemes of interest.

      I'm really interested in the orfs and the protein sequences due to variance in protein, PSI, and nucleotide BLASTs.

      Comment


      • #4
        Hi Everyone,

        Sorry for bumping the thread. I am just wondering what is the difference between getorf and six-pack from the EMBOSS tool set.

        Why am I getting different results among the two?

        Also I could use EMBOSS backtrans to get the transcript sequence from the translated protein sequence.

        Many thanks in advance.

        -Zapages

        Comment


        • #5
          Originally posted by GenoMax View Post
          Rather than a plain ORF finder you should be looking at a gene prediction program if you are working with an eukaryote genome (https://en.wikipedia.org/wiki/List_o...ction_software).
          Hi GenoMax,

          I am new to eukaryotic genomes and trying to get an idea of the "best" workflow for translating DNA sequences into predicted proteins. I came across this post and your recommendation to not use a plain ORF finder, rather use a gene prediction program. Is this because of the introns/exons and large number of repeats?

          Thanks

          Comment


          • #6
            More or less. Since you need to account for possibility of splicing you can't just use start/stops. That said what are you trying to get the proteins from? Are you working with de novo DNA sequence or transcriptome sequence.

            Comment


            • #7
              Hi Genomax,

              Thanks. It is de novo DNA sequence. 2 x 150 read, assembled using Megahit.

              Arvind

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Recent Advances in Sequencing Analysis Tools
                by seqadmin


                The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                Today, 07:48 AM
              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 07:17 AM
              0 responses
              11 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-02-2024, 08:06 AM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-30-2024, 12:17 PM
              0 responses
              20 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-29-2024, 10:49 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Working...
              X