Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Gene-level Quantification

    Currently I still use htseq-count to perform gene-level quantification (and RSEM for transcript-level by the way), but I found htseq-count being overly cumbersome to use, requiring re-sorting alignments and even the script per se takes extremely long to run.

    Right now I'm considering faster alternatives such as Kallisto or Salmon. So what are your experience
    • using it for gene-level differential expression analyses such as edgeR or deSeq2? I understand these programs generate transcript-level output, but they can easily be aggregated. I also wonder how STAR's gene-count output file (ReadsPerGene.out.tab) compare to these.
    • For Salmon users: I'd still have to generate an alignment anyway for sequence analyses. Have you noticed any difference between quasi-mapping and alignment-based modes?


    Thanks for your input in advance!

  • #2
    I can't comment on using Kallisto or Salmon, but STAR's gene counts seem to perfectly match the three count modes of htseq-count, so long as you used the same gtf when indexing the genome.

    Comment


    • #3
      FYI, featureCounts is a faster drop-in replacement for htseq-count, so you can also just use that instead.

      The counts from STAR or featureCounts are pretty similar to what you'll get from salmon. Below is a scatter plot that one of our post-docs made a while back. I asked him to look into the genes that were highly discordant between the two, but I don't recall what he found (or if he looked).



      Regarding feeding BAM files into Salmon, this seems to work nicely. Rob has usually suggested that this leads to a bit better results, so if you can spare the memory and a bit of time to run STAR first then that's the way to go.

      BTW, for getting counts for edgeR or DESeq2, have a look at the tximport package in Bioconductor.

      Comment


      • #4
        Originally posted by dpryan View Post
        FYI, featureCounts is a faster drop-in replacement for htseq-count, so you can also just use that instead.
        Well, as I mentioned I'd still need to do STAR alignments for variant calling, so if it gives a sufficiently accurate gene count, probably I'd go for it.

        But as transcript-level counting is concerned... It's a different story, as we all know.

        Comment


        • #5
          In terms of aligner choice for mapping RNA-seq data, you may take a look at this paper that just came out this week:

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          25 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          29 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          52 views
          0 likes
          Last Post seqadmin  
          Working...
          X