Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Assembly of Large Genomes using Cloud Computing by Contrail

    I have found a new software (the code source is not available yet) for large genomes assembly(http://sourceforge.net/projects/contrail-bio/). It relies on Hadoop to iteratively transform an on-disk representation of the assembly graph, allowing an in depth analysis even for large genomes, which can reduce the memory requirement. Contrails also use de Brujin graph strategy to do the short reads assembly.
    for more see the wiki: http://sourceforge.net/apps/mediawik...title=Contrail

  • #2
    does anyone know when the first release of Contrail is scheduled?

    The quality of software coming from the Salzberg and Pop labs has been very high. Despite the inelegant name, I am really looking forward to seeing how Contrail compares with Velvet, Abyss, SOAPdenovo, etc..
    Last edited by Zigster; 01-11-2010, 10:34 AM.
    --
    Jeremy Leipzig
    Bioinformatics Programmer
    --
    My blog
    Twitter

    Comment


    • #3
      I am also wondering if this assembler is written in entirely in Java. Isn't that a Hadoop requirement?
      --
      Jeremy Leipzig
      Bioinformatics Programmer
      --
      My blog
      Twitter

      Comment


      • #4
        I am also looking forward to the first release of Contrail....

        Comment


        • #5
          The source code has been released but it does not look like one could just run it. No documentation, no hints. Any word on when a usable version might be available?

          Comment


          • #6
            Originally posted by jjv5 View Post
            The source code has been released but it does not look like one could just run it. No documentation, no hints. Any word on when a usable version might be available?
            Meanwhile ->

            Michael Schatz (Cold Spring Harbor Laboratory)
            "Cloud Computing and the DNA Data Race: Theory and Practice."

            Comment


            • #7
              has anyone gotten this up and running? does it read raw sequence files? so many questions!
              Petri Dish Talk

              Comment


              • #8
                Hi all,

                I managed to get contrail up and running.

                Here is how to run the program on the test case provided by Schatz.

                http://homolog.us

                Comment


                • #9
                  Such an amazing post! This is an interesting thread. I wonder how effective this cloud computing is.






                  _____________________________________________________________________________________________________
                  "Defect-free software does not exist."
                  ~ Wietse Venema ~
                  Hosting Dallas

                  Comment


                  • #10
                    Effectiveness of which one are you asking about - cloud computing in general or hadoop?

                    Cloud computing basically allows you to rent computer time from some company managing the hardware. To an user, it is nothing different from using a local supercomputing facility at the university or other place.

                    Hadoop is a different paradigm and has been useful for large data. You can even set it locally and do not need cloud computing for it. I have few posts on hadoop for bioinformatics.

                    http://homolog.us

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Advancing Precision Medicine for Rare Diseases in Children
                      by seqadmin




                      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                      12-16-2024, 07:57 AM
                    • seqadmin
                      Recent Advances in Sequencing Technologies
                      by seqadmin



                      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                      Long-Read Sequencing
                      Long-read sequencing has seen remarkable advancements,...
                      12-02-2024, 01:49 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 12-17-2024, 10:28 AM
                    0 responses
                    27 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-13-2024, 08:24 AM
                    0 responses
                    43 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-12-2024, 07:41 AM
                    0 responses
                    29 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-11-2024, 07:45 AM
                    0 responses
                    42 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X