Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Arriba: Fast and accurate gene fusion detection from RNA-Seq data

    Dear all,

    We developed an algorithm called "Arriba" to detect gene fusions from RNA-Seq data of tumor samples. It is based on the ultrafast STAR aligner (https://github.com/alexdobin/STAR) and the post-alignment runtime is typically just ~2 minutes. Hence, fusion detection comes at virtually no cost, since the alignment of FastQ reads is a task that needs to be done anyway in a typical RNA-Seq workflow.

    But Arriba is not only fast, it is also very accurate: It is currently the best-performing algorithm in the ongoing ICGC-TCGA DREAM SMC Challenge about gene fusion algorithms (final results pending):
    Synapse is a platform for supporting scientific collaborations centered around shared biomedical data sets. Our goal is to make biomedical research more transparent, more reproducible, and more ac...


    Some more highlights:
    - ability to detect intergenic and intronic breakpoints
    - ability to detect exon duplications/inversions
    - utilization of structural variants obtained from whole-genome sequencing
    - filtering of transcript variants observed in healthy tissue
    - comprehensive manual available at http://arriba.readthedocs.io/
    - simple installation routine; especially, if you already use STAR

    We would be glad, if you could give it a try, and are happy to receive feedback!
    Please visit the homepage to download the code or in case you need help:


    Best regards,
    Sebastian

  • #2
    Hi Sebastian,

    has, or will this method be published? Would be nice. Cheers,

    P

    Comment


    • #3
      Yes, the method will certainly be published. I have just started working on the manuscript. Stay tuned ...

      Comment


      • #4
        We are happy to announce that Arriba won first place in the DREAM SMC-RNA Challenge! The final results can be viewed here (requires a free Synapse account): https://www.synapse.org/#!Synapse:sy...89/wiki/588511 As a result, Arriba will be presented at the DREAM Challenge satellite workshop of the RECOMB conference in Washington, D.C. beginning of next month.

        In addition, since our first announcement on this forum a year ago, many improvements have been made to Arriba:

        - streamlined workflow, which makes Arriba even faster and easier to implement
        - installation via Docker, Singularity, and Bioconda
        - automatic generation of publication-quality figures
        - prediction of peptide sequences and retained protein domains
        - CRAM support

        Comment


        • #5
          Version 2 of our gene fusion detection algorithm Arriba is available. It comes with a number of new features and enhancements:

          - detect viral integration sites
          - detect fusions supported by multi-mapping reads (e.g., CIC-DUX4, NPM1-ALK)
          - detect internal tandem duplications (e.g., FLT3, BCOR, ERBB2)
          - support for mouse (mm10)
          - more comprehensive annotation
          - speed improvements
          - accuracy enhancements

          As usual, the code is available on GitHub: https://github.com/suhrig/arriba/releases

          Documentation and installation instructions are available on ReadTheDocs: https://arriba.readthedocs.io/en/latest/quickstart/

          Comment


          • #6
            Hi Sebastian,

            I have recently installed Arriba v 2.1.0 and struggling with the statistics about the number of supporting reads in my pdfs that are being generated. Could you please suggest me what to do in order to get the number of split reads in gene1 and no of split reads in gene 2?

            Thanks in advance!
            AK

            Comment


            • #7
              Hi Anju,

              As discussed via mail, since Arriba version 2, the split read counts are not reported separately for gene1 and gene2 anymore in the visualization PDF. This change was made to be compatible with STAR-Fusion output, which does not report the numbers separately for gene1 and gene2.

              If you have further questions, feel free to reach out to me via mail or the issue tracker on GitHub: https://github.com/suhrig/arriba/issues

              Kind regards,
              Sebastian

              Comment


              • #8
                Hi Sebastian,

                Thank you for your response. I wasn't sure whether you would reply to my email. Hencewhy I tried on seqanswers. I have send the job for running and it still in the queue. I will let you know when its done on the email.

                Thanks again!

                Best regards,
                Anju

                Comment


                • #9
                  We are proud to announce that our manuscript about Arriba has been published in this month's issue of the Genome Research journal. From now on, please cite the following article if you use Arriba for published research:

                  Sebastian Uhrig, Julia Ellermann, Tatjana Walther, Pauline Burkhardt, Martina Fröhlich, Barbara Hutter, Umut H. Toprak, Olaf Neumann, Albrecht Stenzinger, Claudia Scholl, Stefan Fröhling and Benedikt Brors: Accurate and efficient detection of gene fusions from RNA sequencing data. Genome Research. March 2021 31: 448-460; Published in Advance January 13, 2021. doi: 10.1101/gr.257246.119

                  Comment


                  • #10
                    After almost a year of further development of enhancements, new features, and bug fixes, the next version of our gene fusion detection tool Arriba is finally out (version 2.2.0). The code and user manual are available on Github: https://github.com/suhrig/arriba/

                    The most notable enhancements are:

                    improved detection of viruses and viral integration sites
                    improved detection of internal tandem duplications
                    support for mm39/GRCm39
                    utility scripts which facilitate common tasks related to fusion detection
                    polishing of fusion visualizations

                    More details can be found in the release notes: https://github.com/suhrig/arriba/releases

                    Comment


                    • #11
                      Hi uhrigs, thanks for sharing updates on you fusion detection tool here. I was wondering if there is any official nomenclature regarding gene fusions. I am only aware of this: https://www.nature.com/articles/s41375-021-01436-6 , but this is only on gene level like EML4::ALK. Intuitively it is also quite clear what EML4-exon13::ALK-exon20 means. But what would such a description technically mean? "BAG4-intron 1::FGFR1-intron 1" Would be really nice to agree on a nomenclature here and have a proper definition for it...

                      Comment


                      • #12
                        Hi JonasBehr

                        Standardization of the nomenclature is in preparation. The Variant Interpretation for Cancer Consortium (VICC) is working on it. You can find links to related resources and discussions here: https://cancervariants.org/projects/fusions/

                        Regards,
                        Sebastian

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Advancing Precision Medicine for Rare Diseases in Children
                          by seqadmin




                          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                          12-16-2024, 07:57 AM
                        • seqadmin
                          Recent Advances in Sequencing Technologies
                          by seqadmin



                          Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                          Long-Read Sequencing
                          Long-read sequencing has seen remarkable advancements,...
                          12-02-2024, 01:49 PM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 12-17-2024, 10:28 AM
                        0 responses
                        33 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 12-13-2024, 08:24 AM
                        0 responses
                        49 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 12-12-2024, 07:41 AM
                        0 responses
                        34 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 12-11-2024, 07:45 AM
                        0 responses
                        46 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X