Seqanswers Leaderboard Ad

**mastal** · 04-23-2014, 11:51 AM

I don't think you can use the --very-sensitive setting if you plan to vary the number of mismatches, I think --very-sensitive corresponds to a certain predefined set of parameters, namely ' -D 20 -R 3 -N 0 -L 20 -i S,1,0.50',
where N is the number of mismatches in the seed region.

**dpryan** · 04-23-2014, 12:46 PM

The presets don't alter the score-min setting, so you can mix them at will. The default --score-min will allow up to 15 mismatches for your(I'm ignoring indels here and assuming that you're either using --ignore-quals or that the mismatches occur at positions with high phred scores), so you can gauge the effect by just filtering by MAPQ, the AS tag, or the MD tag.

A constant (C) score-min is only useful if you want to have a maximum edit-distance regardless of read length (again, I'm ignoring the effect of phred scores on the mismatch penalty). In general with end-to-end alignments, you would want the minimum alignment score to change a bit with length. For actually determining the exact equation to use, the main way to do this is with simulated reads, since the --score-min setting directly affects the MAPQ calculation. The defaults generally produce pretty similar MAPQ scores as other programs (namely BWA).

**sanjeevksh** · 04-24-2014, 07:53 AM

Hi mastal and dpryan,

Thanks for your replies, are very helpful indeed.

@mastal:
I was not referring to the mismatches in the seed region which -N (0 or 1) actually pertains to but for the entire end-to-end alignment. On the same subject, if say I have to use '--very-sensitive' and only change '-N' parameter, do I submit '--very-sensitive -N 1' or '--very-sensitive' becomes obselete if any parameter covered by this option is altered and one has to submit the entire parameter set separately like '-D 20 -R 3 -N 1 -L 20 -i S,1,0.50'?

@dpryan:
So this means just map once with a default '--score-min' setting and then apply filters and use as it deems fit? Now would try to find out how to filter by MAPQ, AS tag, or MD tag.

I am not specifying anything like --ignore-quals.

Regarding the function to use, I would stick to 'L' which I have used in some test runs.

Regards,
Sanjeevksh

**dpryan** · 04-24-2014, 10:50 AM

Yup, exactly. You'll want to filter according to the MD tag, possibly also looking at the sequence in the case of Ns, which you presumably don't want to count.

**sanjeevksh** · 04-24-2014, 11:11 AM

MD gives a string so 'XM:i:<N>' field may be more helpful in this context?

Cheers!

**dpryan** · 04-24-2014, 11:24 AM

Indeed, yes, I'd forgotten about that tag!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 23 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Bowtie2 --score-min or mismatch setting

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News