Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Mnase cause high multimapper fraction?

    Hello Community.

    I am working on histone modifications in zebrafish.
    I recently ran several samples for a histone methylation ChIP after MNase digestion.

    Now I run into the problem of having 60-80% of Multimappers in my mappable reads.
    Does anyone have the same problems or knows why that could happen?

    Some more specifications:
    I have "produced" this problem in 10 independent samples, plus in 4 Input samples (meaning that it is not the IP that enriches for Multimappers).

    Digestion by MNase was performed to have 80% Mono-and Dinucleosomes, but no over digestion (fragments smaller than 150bp)

    FastQC gives OK quality. There are some ambiguities with Kmer content, but other than that quality seems ok.

    I sequenced around 50Mio reads, from which 70-80% were mappable. However, from these, 60-80% were Multimappers. When analysing the Multimappers, they are distributed throughout the genome, and all "classes" of multiplicity is present.

    From that I reason that already the MNase digestion prefers repetitive sequences, but I have no idea why and how to prevent this.

    Does anyone has Input on this?

  • #2
    Did you try to report only uniquely mappable reads?

    Comment


    • #3
      Yes sure.
      IT makes 10-20% of the whole genome. However, in next experiments I do not want to waste 80% of my data on multimappers, if avoidable.

      As the problem appears also in the Input, I suspect the MNase digestion or clean-up to cause the problem. This is why I ask for help

      Comment


      • #4
        What is your definition of multimappers?

        Comment


        • #5
          That one single read maps to more than one position in the genome with the same confidence.

          Is there any other definition?

          Comment


          • #6
            I analysed some public MNase data recently for a large complex genome >2gb and had this issue too with 50bp reads. There appears to be a very high background in MNase data.

            Longer read lengths might partially mitigate the issue. However, if MNase indicates you are primarily sequencing complex repeats this problem will always occur if read lengths are shorter than the compound repeats. Interestingly, those using the data are very happy with it, especially the patterns around gene boundaries appear as expected.

            To practically deal with it I have produced two tracks, one with all reads and one filtered by samtools to only include Mapping Quality 20+ reads.

            Comment


            • #7
              Thank you all for your replies!

              Would you think that using paired-end seq might help?
              First of all it is likely that one of the two reads is not multi-mapper, and thus would give more confidence for the second read.
              Second, since one knows the possible distance between the reads, one can further increase mapping confidence.

              IS that an option?

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM
              • seqadmin
                The Impact of AI in Genomic Medicine
                by seqadmin



                Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                02-26-2024, 02:07 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-14-2024, 06:13 AM
              0 responses
              33 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-08-2024, 08:03 AM
              0 responses
              72 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-07-2024, 08:13 AM
              0 responses
              81 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-06-2024, 09:51 AM
              0 responses
              68 views
              0 likes
              Last Post seqadmin  
              Working...
              X