Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie option --best --strata for unique mapper in the genome

    Hi,

    I have some trouble with alignment using bowtie.
    I'm searching for reads that map at a unique place on the genome, so, i take 1 reads that i know it's a unique mapper and it map the genome 100%, and i try to align it with bowtie with the following command, allowing up to 2 mismatches (because for the other reads i want to allow up to 2 mismatches).

    For verification :
    Code:
    [B]bowtie -v 0 -m 1 -t droso_build -c ATGCGAGAGTGGT[/B]
    Time loading forward index: 00:00:01
    Time for 0-mismatch search: 00:00:00
    0	+	chr2L	5308	ATGCGAGAGTGGT	IIIIIIIIIIIII	0	
    # reads processed: 1
    # reads with at least one reported alignment: 1 (100.00%)
    # reads that failed to align: 0 (0.00%)
    Reported 1 alignments to 1 output stream(s)
    Time searching: 00:00:01
    Overall time: 00:00:01
    But, when i try to align the other reads (allowing up to 2mismatches), this reads have no hit :
    Code:
    [B]bowtie -m 1  -v 2 -t droso_build -c ATGCGAGAGTGGT[/B]
    Time loading forward index: 00:00:00
    Time loading mirror index: 00:00:01
    End-to-end 2/3-mismatch full-index search: 00:00:00
    # reads processed: 1
    # reads with at least one reported alignment: 0 (0.00%)
    # reads that failed to align: 0 (0.00%)
    # reads with alignments suppressed due to -m: 1 (100.00%)
    No alignments
    Time searching: 00:00:01
    Overall time: 00:00:01
    But,when i do:

    And whan i add -a --best --strata i get :
    Code:
    bowtie -a -v 2 -m 1 --best --strata -t droso_build -c ATGCGAGAGTGGT
    Time loading forward index: 00:00:00
    Time loading mirror index: 00:00:01
    End-to-end 2/3-mismatch full-index search: 00:00:00
    0	+	chr2L	5308	ATGCGAGAGTGGT	IIIIIIIIIIIII	0	
    # reads processed: 1
    # reads with at least one reported alignment: 1 (100.00%)
    # reads that failed to align: 0 (0.00%)
    Reported 1 alignments to 1 output stream(s)
    Time searching: 00:00:01
    Overall time: 00:00:01
    I'm really lost, is the -a --best --strata a good solution ?

    Thanks in advance for your reply.

  • #2
    Reporting uniquely mapped reads with bowtie

    I have a similar problem - I want bowtie to report only uniquely mapped reads. I figured using the flags -n 3 --best -m1 will accomplish this:
    --best should ensure that only the best alignments among the hits are considered, and -m 1 should ensure that the alignment is reported only when there is a unique best alignment.

    But, it doesn't seem to work that way.
    bowtie -n 3 -a returns two alignments for the read:
    Code:
    alignment1 CIGAR: 48M flags: XA:i:0	MD:Z:48	NM:i:0
    alignment2 CIGAR: 48M flags: XA:i:0	MD:Z:0C47	NM:i:1
    Clearly, the second alignment is not the same as the first.
    bowtie -n 3 --best returns only the first alignment:
    Code:
    alignment1 CIGAR: 48M flags: XA:i:0	MD:Z:48	NM:i:0
    Everything looks good so far. Now, I add the -m 1 flag, to report only unique reads, but:
    bowtie -n 3 --best -m 1 reports no alignments for this read:

    Code:
    # reads processed: 1
    # reads with at least one reported alignment: 0 (0.00%)
    # reads that failed to align: 0 (0.00%)
    # reads with alignments suppressed due to -m: 1 (100.00%)
    No alignments
    Am I doing anything wrong? Any help is appreciated.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    39 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X