Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multi-mapped reads in RNAseq experiment

    Dear all,

    I am currently working with RNAseq data obtained from different strains of Plasmodium parasite. We are mainly interested in differential expression between the strains. After quality trimming of the reads, I planned to align reads on Plasmodium genome with TopHat2 or HISAT2 (still not yet decided, I am comparing both), and to use HTseq-count for counting.

    Alignment works quite correctly, but I have a lot of reads with multiple alignments (> 40% with MAPQ < 10) and I am not sure to really understand how to handle them for the downstream analysis. Based on what I read, I first understood that for differential expression analysis, it would be better to keep multi-mapped reads for counting, but I found that multireads are not counted with HTseq-count. Do I have to force HTseq-count to count them (remove NH optional flag in .bam file)? Is there any way to reduce the levels of multi-mapped reads?

    Thank you very much for any help.

  • #2
    Were these libraries ribo-depleted? If not the multi-mapping reads could be from rRNA.

    Comment


    • #3
      Thank you for your answer.

      Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).

      Comment


      • #4
        Originally posted by nmerienn View Post
        Thank you for your answer.

        Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).
        One would think so but that may always not be the case. You should check what the multi-mapping reads are? Ideally aligning against rDNA repeat (from your species) would give you an exact idea of how much rRNA had remained.

        Comment


        • #5
          Thank you for your advice. I will align them on rRNA sequences to check if this is remaining rRNA.

          But in general, I think there is always some multi-mapped reads in RNAseq data. What is the consensus concerning them for counting, is it better to keep them or remove before counting?

          Thanks

          Comment


          • #6
            See this article for some pointers.

            You could be strict and drop them altogether, allow the aligner to randomly pick one spot (out of many where read aligns well) or allow the reads to map in every location that they map equally well to. All those options have some consequences which you would need to weigh. There are newer methods like Salmon which consider read distribution to assign the reads, which may be important if you are looking for transcripts.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            22 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            46 views
            0 likes
            Last Post seqadmin  
            Working...
            X