Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multi-mapped reads in RNAseq experiment

    Dear all,

    I am currently working with RNAseq data obtained from different strains of Plasmodium parasite. We are mainly interested in differential expression between the strains. After quality trimming of the reads, I planned to align reads on Plasmodium genome with TopHat2 or HISAT2 (still not yet decided, I am comparing both), and to use HTseq-count for counting.

    Alignment works quite correctly, but I have a lot of reads with multiple alignments (> 40% with MAPQ < 10) and I am not sure to really understand how to handle them for the downstream analysis. Based on what I read, I first understood that for differential expression analysis, it would be better to keep multi-mapped reads for counting, but I found that multireads are not counted with HTseq-count. Do I have to force HTseq-count to count them (remove NH optional flag in .bam file)? Is there any way to reduce the levels of multi-mapped reads?

    Thank you very much for any help.

  • #2
    Were these libraries ribo-depleted? If not the multi-mapping reads could be from rRNA.

    Comment


    • #3
      Thank you for your answer.

      Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).

      Comment


      • #4
        Originally posted by nmerienn View Post
        Thank you for your answer.

        Libraries were subjected to polyA selection, so there should not be so much remaining rRNA inside (normally).
        One would think so but that may always not be the case. You should check what the multi-mapping reads are? Ideally aligning against rDNA repeat (from your species) would give you an exact idea of how much rRNA had remained.

        Comment


        • #5
          Thank you for your advice. I will align them on rRNA sequences to check if this is remaining rRNA.

          But in general, I think there is always some multi-mapped reads in RNAseq data. What is the consensus concerning them for counting, is it better to keep them or remove before counting?

          Thanks

          Comment


          • #6
            See this article for some pointers.

            You could be strict and drop them altogether, allow the aligner to randomly pick one spot (out of many where read aligns well) or allow the reads to map in every location that they map equally well to. All those options have some consequences which you would need to weigh. There are newer methods like Salmon which consider read distribution to assign the reads, which may be important if you are looking for transcripts.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Today, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            37 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            35 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Working...
            X