Several questions regarding BBMap/BBSplit:
1. The ambig flag -- Brian Bushnell states:
Set behavior on ambiguously mapped reads (with multiple top-scoring mapping locations)
--> How is "ambiguously mapped reads"/"top-scoring" defined exactly? Is it about reads which map "good enough" to be mapped on several positions in the first place, but the mapping quality can differ? Or what is the rule/statistic you use? Do they have to be "significantly better" than some lower-scoring reads (which still would map). Is is there some exact doc / or where in the code would I find it to see what exactly is going on? :-)
2. BBSplit: Are the criteria for ambig2 the same as ambig? (Except the fact that we are talking about different ref. genomes).
What would happen if we have the following three scenarios:
We have three top-scoring hits for one read (let's say they have score1 to score3, score1 is the best, but all are "very good" hits). We have two hits to ref1 with score1 and score3, one to ref2 with score2
Scenario 1: I have ambig=best and ambig2=best --> which aligments get reported?
Scenario 2: I have ambig=all and ambig2=best --> which aligments get reported?
Scenario 3: I have ambig=best and ambig2=all --> which aligments get reported?
3. How to intrepret the "XT" flag in the sam file (like shown in IGV):
- What does "XT = R" mean? Repeat?
- What does the flag "AM" mean?
Many thanks for this good tool!
Michael
1. The ambig flag -- Brian Bushnell states:
Set behavior on ambiguously mapped reads (with multiple top-scoring mapping locations)
--> How is "ambiguously mapped reads"/"top-scoring" defined exactly? Is it about reads which map "good enough" to be mapped on several positions in the first place, but the mapping quality can differ? Or what is the rule/statistic you use? Do they have to be "significantly better" than some lower-scoring reads (which still would map). Is is there some exact doc / or where in the code would I find it to see what exactly is going on? :-)
2. BBSplit: Are the criteria for ambig2 the same as ambig? (Except the fact that we are talking about different ref. genomes).
What would happen if we have the following three scenarios:
We have three top-scoring hits for one read (let's say they have score1 to score3, score1 is the best, but all are "very good" hits). We have two hits to ref1 with score1 and score3, one to ref2 with score2
Scenario 1: I have ambig=best and ambig2=best --> which aligments get reported?
Scenario 2: I have ambig=all and ambig2=best --> which aligments get reported?
Scenario 3: I have ambig=best and ambig2=all --> which aligments get reported?
3. How to intrepret the "XT" flag in the sam file (like shown in IGV):
- What does "XT = R" mean? Repeat?
- What does the flag "AM" mean?
Many thanks for this good tool!
Michael
Comment