Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multi-mapped reads in RNAseq experiment

    Dear all,

    I am currently working with RNAseq data obtained from different strains of Plasmodium parasite. We are mainly interested in differential expression between the strains. After quality trimming of the reads, I planned to align reads on Plasmodium genome with TopHat2 or HISAT2 (still not yet decided, I am comparing both), and to use HTseq-count for counting.

    Alignment works quite correctly, but I have a lot of reads with multiple alignments (> 40% with MAPQ < 10) and I am not sure to really understand how to handle them for the downstream analysis. Based on what I read, I first understood that for differential expression analysis, it would be better to keep multi-mapped reads for counting, but I found that multireads are not counted with HTseq-count. Do I have to force HTseq-count to count them (remove NH optional flag in .bam file)? Is there any way to reduce the levels of multi-mapped reads?

    Thank you very much for any help.

  • #2
    Were these libraries ribo-depleted? If not the multi-mapping reads could be from rRNA.

    Comment


    • #3
      Thank you for your answer.

      Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).

      Comment


      • #4
        Originally posted by nmerienn View Post
        Thank you for your answer.

        Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).
        One would think so but that may always not be the case. You should check what the multi-mapping reads are? Ideally aligning against rDNA repeat (from your species) would give you an exact idea of how much rRNA had remained.

        Comment


        • #5
          Thank you for your advice. I will align them on rRNA sequences to check if this is remaining rRNA.

          But in general, I think there is always some multi-mapped reads in RNAseq data. What is the consensus concerning them for counting, is it better to keep them or remove before counting?

          Thanks

          Comment


          • #6
            See this article for some pointers.

            You could be strict and drop them altogether, allow the aligner to randomly pick one spot (out of many where read aligns well) or allow the reads to map in every location that they map equally well to. All those options have some consequences which you would need to weigh. There are newer methods like Salmon which consider read distribution to assign the reads, which may be important if you are looking for transcripts.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM
            • seqadmin
              Techniques and Challenges in Conservation Genomics
              by seqadmin



              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

              Avian Conservation
              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
              03-08-2024, 10:41 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 06:37 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, Yesterday, 06:07 PM
            0 responses
            9 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-22-2024, 10:03 AM
            0 responses
            50 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-21-2024, 07:32 AM
            0 responses
            67 views
            0 likes
            Last Post seqadmin  
            Working...
            X