Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie: How to retain only uniquely mapped reads?

    Hi!

    I have been searching for the answer to this questions on the internet (google, this forum, bowtie manual), but can't find a clear answer.

    I am mapping RNASeq reads to the mouse genome . If one read aligns to multiple positions on the genome I want it to not be mapped at all.
    I used the --best option for this, but I am not sure if it's right. The bowtie manual only says: "--best: Make Bowtie guarantee that reported singleton alignments are "best" in terms of stratum [...] and in terms of the quality values at the mismatched position(s)." But what happens if two alignments are equally good?

    If it is not right to use --best for this purpose, which combination of output options can I use?

    Thanks for any help.

    Cheers,
    Esther

  • #2
    -m 1

    exemple on bowtie website :

    Example 7: -a -m 3

    $ ./bowtie -a -m 3 -v 2 e_coli -c ATGCATCATGCGCCAT
    No alignments
    Specifying -m 3 instructs bowtie to refrain from reporting any alignments for reads having more than 3 reportable alignments. The -m option is useful when the user would like to guarantee that reported alignments are "unique", for some definition of unique.

    Comment


    • #3
      Thank you very much, Nico. I'll try with that one.

      I am still wondering though, what happens with the output using --best if there are two (or more) best (equally good) alignments.

      And am I right if I say that it doesn't make sense to combine -m 1 with -n 2 (allow 2 mismatches), because this would throw out a read entirely if it matches one time perfect and another time with mismatches? I am having a dilemma here, because I want to allow for mismatches.

      If --best threw out a read entirely if it has two equally good best alignments and still reported reads that have more than one alignment (including one best) this would solve my problem, wouldn't it? Does --best behave this way?

      Cheers,
      Esther

      Comment


      • #4
        yeah indeed allowing two mismatches and only one alignment in the whole genome will not give you good results ( I guess, try it to see the results ).

        Comment


        • #5
          I think the solution to your problem would be a combination of several options, such as -n 2 -l {your_read_length} --best -m 1.

          If you do it this way --best is trying to report the best alignment while allowing up to 2 mismatches. If there are more equally good alignments which would be considered 'best', none of them will be reported.

          Comment


          • #6
            Thanks very much to the both of you!

            I now figured out entirely how all the different bowtie output options work together. Sorry for my question, I could have answered it by reading the manual very carefully, but as a starter in this topic I was very confused.

            So, thanks again!

            Comment


            • #7
              Hi there
              For human analysis I used -n 2 and -m 1 but I obtained variations within repeat area. What do you think I should do to avoid such results?
              Thanks

              Comment


              • #8
                I forget to say :
                My reads are 76 bases long

                Comment


                • #9
                  No idea?
                  Thanks for your time

                  Comment


                  • #10
                    It is not a good idea to be using Bowtie to map RNA-seq reads. Why aren't you using TopHat ? You are mapping mouse reads and they do have introns, because a mouse is a Eukaryote.

                    Comment


                    • #11
                      I may be mistaken. But I think you also need the --strata option to do what you want. E.g.
                      -a -n 2 -m 1 --best --strata

                      You need -a otherwise -m 1 is meaningless (e.g. -k 1 ensures more than 1 alignment will never be reported anyway, so -m 1 will never get evaluated.)
                      In that case --best by itself just lists all the alignments from best to worst, whereas --strata will cause it to only report alignments with the best strata, then -m 1 can filter out cases where there are more than 1 alignment matching the best strata.
                      Last edited by dhirallin; 03-24-2018, 11:42 PM.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin


                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                        Yesterday, 07:01 AM
                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      44 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      43 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      38 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-04-2024, 09:00 AM
                      0 responses
                      55 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X