Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • RSEM with HISAT2

    Hello guys,

    I have RNA-sequencing data of around 250 patients with leukaemia.
    I have built up a basic pipeline using HISAT2 as my aligner with satisfying results. I want to test differential transcript expression between my samples using RSEM, but I can't get it to work with HISAT2.

    I am running the tools on the GALAXY platform and using the instructions provided I used RSEM prepare reference to create the reference files to provide to the aligner. My outputs are the following:

    rsem ref name.log
    rsem ref name.grp
    rsem ref name.ti
    rsem ref name.chrlist
    rsem ref name.transcripts.fa
    rsem ref name.seq
    rsem ref name.idx.fa
    rsem_ref name.3.ebwt
    rsem_ref name.4.ebwt
    rsem_ref name.1.ebwt
    rsem_ref name.2.ebwt
    rsem_ref name.rev.1.ebwt
    rsem_ref name.rev.2.ebwt

    From what I gather from the RSEM Readme I now have to align my reads using rsem ref name.idx.fa as a reference file. Trying this I get the following error:

    (ERR): hisat2-align died with signal 11 (SEGV)
    [W::sam_read1] parse error at line 43388
    [main_samview] truncated file.

    Does anyone have experience using HISAT2 with the RSEM reference file?
    It seems to me, that the prepared reference only works with TopHat2 since it creates BowTie index files. I would appreciate any response!

    Thanks!
    Sbamo

  • #2
    You can't use those files with HISAT2. You'll need to reindex either name.idx.fa or name.seq (I assume it's a fasta file) and use the resulting .ht2 files.

    Comment


    • #3
      dpryan, thank you for your response! Since I am new to Bioinformatics is there any tool or method you can suggest, in order to convert those files to .ht2-files?

      Comment


      • #4
        There are no conversion programs. Just delete them and build the index with hisat2-build.

        Comment


        • #5
          Hi, thanks again for your reply.
          It was possible to reindex the "rsem ref name.idx.fa" file (trying to reindex "rsem ref name.seq" threw an error). Unfortunately when using it as a reference HISAT2 presented the following problem:

          [E::sam_parse1] CIGAR and query sequence are of different length
          [W::sam_read1] parse error at line 42905944
          [main_samview] truncated file.
          Error while flushing and closing output
          terminate called after throwing an instance of 'int'
          (ERR): hisat2-align died with signal 6 (ABRT)
          [bam_sort_core] merging from 19 files...

          I looked up this part: "Error while flushing and closing output
          terminate called after throwing an instance of 'int'"
          and it seems this is a common problem with the Bowtie/TopHat2 aligners. Sadly, none of the fixes suggested worked and there was no post about this issue in HISAT2.

          Please note that the GTF-file and Fasta-file used to create the RSEM reference have been tested and HISAT2 can succesfully run through using them prior to RSEM-conversion (They are Hg19 references).

          As always any help is appreciated!

          Comment


          • #6
            Can't RSEM just be fed a BAM file? That'd be easier. Anyway, I'd personally just use salmon or kallisto and be done in a fraction of the time

            Comment


            • #7
              RSEM cannot handle alignments with gaps in them; only matches / mis-matches. I would suggest that you use the alignment-based mode of salmon if you'd still like to use the HISAT alignments downstream. Otherwise, you could try using the quasi-mapping-based mode of salmon or sailfish. These programs can produce accurate quantification estimates very quickly without the need to first perform traditional alignment of the reads. Full disclosure: I am the main developer of both of these tools .

              Comment


              • #8
                Robp, thank you for your reply! I may try them out, since for my analysis gapped alignment is mandatory!

                Comment


                • #9
                  P.S. dpryan, RSEM can be fed an BAM-file directly, but I had no luck doing this either. Probably due to the fact that it was created using gapped alignment. Thanks for your quick replies!

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM
                  • seqadmin
                    Techniques and Challenges in Conservation Genomics
                    by seqadmin



                    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                    Avian Conservation
                    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                    03-08-2024, 10:41 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 06:37 PM
                  0 responses
                  10 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, Yesterday, 06:07 PM
                  0 responses
                  9 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-22-2024, 10:03 AM
                  0 responses
                  49 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-21-2024, 07:32 AM
                  0 responses
                  67 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X