Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie2 parameters for chip-seq

    Working with the new bowtie2, anybody here done alignment for chip-seq using it. Please comment of choice of parameters

    It is common to choose reads if they match uniquely, for old --best and -m 1 did the trick, not sure about bowtie2, will try -M 1.

  • #2
    Is there any suggestion about this question?

    Comment


    • #3
      From Bowtie2's Manual

      it is said:
      =======
      Mapping quality: higher = more unique

      Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
      =======

      Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

      Mapping quality higher than 30 are considered as uniquely mapping reads ?

      tks

      Comment


      • #4
        I think I find the answer from this paper.

        The answer is YES.

        "...Reads were filtered by removing those with a BWA alignment quality score less than 15..."

        Differential oestrogen receptor binding is associated with clinical outcome in breast cancer
        Nature, Vol. advance online publication (4 January 2012) doi:10.1038/nature10730


        Originally posted by harryzs View Post
        From Bowtie2's Manual

        it is said:
        =======
        Mapping quality: higher = more unique

        Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
        =======

        Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

        Mapping quality higher than 30 are considered as uniquely mapping reads ?

        tks

        Comment


        • #5
          What does a mapping quality of 0 mean then? That the read may have originated anywhere in the genome?
          And if I understand well, reads with low Mapq should be filtered before calling peaks, right?
          Luca

          Comment


          • #6
            In bowtie2, a MAPQ of 0 means one of the following:
            1. The reported alignment and the next best alignment are both equivalently good, but neither are exact matches (if they're exact matches, the MAPQ is set to 1).
            2. The absolute difference in alignment score between the best and second best alignment is >= 10% (and <30%) of the maximum possible difference in alignment scores and the best alignment's score is itself <67% of the maximum difference in alignment scores.

            This is for end-to-end alignments. For local alignments, only #1 will produce this (#2 would produce MAPQs of 9, 12, 14, or 17, depending).

            Yes, this is highly confusing and no, it's not documented (unless you consider source code to be documentation).

            Comment


            • #7
              Originally posted by dpryan View Post
              Yes, this is highly confusing


              Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
              Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?

              Comment


              • #8
                Originally posted by crepaldi View Post


                Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
                Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?
                That seems like a sensible MAPQ threshold. I agree that the multimappers can still be quite informative. It's likely a good idea to look at them in IGV and bring up a repeatmasker track to see if these might turn out to be interesting or not. The last thing you want to do is throw out multimappers if it turns out that your protein does bind to a repeat region!

                Comment


                • #9
                  Yes. I agree.
                  See how people from ENCODE are doing:
                  samtools view -b -F 1548 -q 30 chipSampleRep1.bam

                  they(Anshul Kundaje) use -q 30 in their guideline.

                  Access Google Sites with a personal Google account or Google Workspace account (for business use).

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  27 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  30 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  26 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  52 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X