Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Large RNA sequences ? Does it has any sense ?

    Hi!

    First, i'm a computer scientist recently exploring bioinformatics field, so please forgive me if i say something really stupid

    Basically i'm studying the possibility of implementing Nussinov-Jacobsen algorithm on GPU's, accelerating, if possible, time performance in orders of magnitude; but to accomplish that, the RNA sequence has to be very large. I was wondering if it has some sense since i've seen most RNA seqs are about 200 bases.

    Thanks!

  • #2
    How long do you mean by 'very large'? It depends on the sequencing technology used and the length ordered. Even 200 is somewhat in the 'long' range for NGS (I believe).

    Comment


    • #3
      What kind of RNA do you mean? There are entire RNA genomes of bacteria. And normal mRNAs are some 100s to 1000s nucleotides long.
      By quickly googling Nussinov-Jacobsen I learned that you can do RNA folding prediction with it. That only makes sense for small RNAs.

      Comment


      • #4
        454 has 400+ base reads & PacBio is promising reads that long or much longer.

        Folding of longer RNAs could be interesting, as secondary structure is sometimes involved in the stability, localization or utilization of an RNA.

        It's a niche, but that doesn't mean it isn't interesting.

        Comment


        • #5
          Many RNAs are long, but the current sequencing technologies fragment them prior to sequencing, since they perform better on shorter sequences. SOLiD works up to 50 bp, Illumina works up to about 100 bp, and 454 can get a few hundred bp. If you want something larger you'll have to piece together multiple reads into a longer consensus sequence.
          It seems to me, though, that if you need a longer consensus sequence you could just use the complement of the genomic sequence (which is the RNA sequence) for some interesting genes. If your goal is to demonstrate an algorithmic speedup using a GPU-based approach it seems that it wouldn't be important to have cutting-edge RNA data, but it would be better to use a well studied RNA (like ribosomal RNA or tRNA) for your comparison.

          Comment


          • #6
            Originally posted by mrawlins View Post
            Many RNAs are long, but the current sequencing technologies fragment them prior to sequencing, since they perform better on shorter sequences. SOLiD works up to 50 bp, Illumina works up to about 100 bp, and 454 can get a few hundred bp. If you want something larger you'll have to piece together multiple reads into a longer consensus sequence.
            It seems to me, though, that if you need a longer consensus sequence you could just use the complement of the genomic sequence (which is the RNA sequence) for some interesting genes. If your goal is to demonstrate an algorithmic speedup using a GPU-based approach it seems that it wouldn't be important to have cutting-edge RNA data, but it would be better to use a well studied RNA (like ribosomal RNA or tRNA) for your comparison.
            Ok.

            And how many nucleotides can have that ribosomal or tRNA ?

            Comment


            • #7
              In Shewanella the longest ribosomal sequence is about 2900 bases long. Human ribosome sequences may be a bit longer. The tRNAs (in Shewanella) are about 76 bases long. I seem to recall that tRNAs have some of the best documented secondary structure, though, so while they may not make a good test of your algorithm's speed, they might be a good test of accuracy.

              Comment


              • #8
                Originally posted by mrawlins View Post
                In Shewanella the longest ribosomal sequence is about 2900 bases long. Human ribosome sequences may be a bit longer. The tRNAs (in Shewanella) are about 76 bases long. I seem to recall that tRNAs have some of the best documented secondary structure, though, so while they may not make a good test of your algorithm's speed, they might be a good test of accuracy.
                Thanks!

                I wonder, what about the searching for structures on an entire genome ?

                Comment


                • #9
                  Originally posted by perencia View Post
                  I wonder, what about the searching for structures on an entire genome ?
                  Only some viruses have an RNA genome (most organisms use DNA), and their genomes tend not to be very big (not big enough to worry about GPU optimisations I would guess).

                  Comment


                  • #10
                    Have you had a chance to take a look at Rfam and Sean Eddy's Infernal?

                    Comment


                    • #11
                      Originally posted by Bruins View Post
                      Have you had a chance to take a look at Rfam and Sean Eddy's Infernal?
                      No, but i'll look for them now

                      I've searching a little more, and found that report



                      and that implementation



                      Former is a GPU implementation of the Unafold Algorithm
                      ( http://mfold.bioinfo.rpi.edu/ )

                      It seems that a GPU optimisation may take room on a multiple RNA structure prediction, as in the first report ( a set of 11 Picor-
                      naviral sequences (7124 to 8214 nucleotides)).
                      I'll post what i find

                      Anyway, i'd like to known which are the benefits from such procedures, where do they impact. Bioinformatics is a large field i guess .
                      Last edited by perencia; 07-29-2010, 07:12 AM.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Strategies for Sequencing Challenging Samples
                        by seqadmin


                        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                        03-22-2024, 06:39 AM
                      • seqadmin
                        Techniques and Challenges in Conservation Genomics
                        by seqadmin



                        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                        Avian Conservation
                        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                        03-08-2024, 10:41 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Yesterday, 06:37 PM
                      0 responses
                      10 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, Yesterday, 06:07 PM
                      0 responses
                      9 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-22-2024, 10:03 AM
                      0 responses
                      51 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-21-2024, 07:32 AM
                      0 responses
                      67 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X