Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nmerienn
    Member
    • Sep 2014
    • 12

    Multi-mapped reads in RNAseq experiment

    Dear all,

    I am currently working with RNAseq data obtained from different strains of Plasmodium parasite. We are mainly interested in differential expression between the strains. After quality trimming of the reads, I planned to align reads on Plasmodium genome with TopHat2 or HISAT2 (still not yet decided, I am comparing both), and to use HTseq-count for counting.

    Alignment works quite correctly, but I have a lot of reads with multiple alignments (> 40% with MAPQ < 10) and I am not sure to really understand how to handle them for the downstream analysis. Based on what I read, I first understood that for differential expression analysis, it would be better to keep multi-mapped reads for counting, but I found that multireads are not counted with HTseq-count. Do I have to force HTseq-count to count them (remove NH optional flag in .bam file)? Is there any way to reduce the levels of multi-mapped reads?

    Thank you very much for any help.
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Were these libraries ribo-depleted? If not the multi-mapping reads could be from rRNA.

    Comment

    • nmerienn
      Member
      • Sep 2014
      • 12

      #3
      Thank you for your answer.

      Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Originally posted by nmerienn View Post
        Thank you for your answer.

        Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).
        One would think so but that may always not be the case. You should check what the multi-mapping reads are? Ideally aligning against rDNA repeat (from your species) would give you an exact idea of how much rRNA had remained.

        Comment

        • nmerienn
          Member
          • Sep 2014
          • 12

          #5
          Thank you for your advice. I will align them on rRNA sequences to check if this is remaining rRNA.

          But in general, I think there is always some multi-mapped reads in RNAseq data. What is the consensus concerning them for counting, is it better to keep them or remove before counting?

          Thanks

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            See this article for some pointers.

            You could be strict and drop them altogether, allow the aligner to randomly pick one spot (out of many where read aligns well) or allow the reads to map in every location that they map equally well to. All those options have some consequences which you would need to weigh. There are newer methods like Salmon which consider read distribution to assign the reads, which may be important if you are looking for transcripts.

            Comment

            Latest Articles

            Collapse

            • GATTACAT
              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by GATTACAT
              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
              07-01-2026, 11:43 AM
            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Yesterday, 11:08 AM
            0 responses
            7 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-30-2026, 05:37 AM
            0 responses
            11 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            53 views
            0 reactions
            Last Post SEQadmin2  
            Working...