Hello,
I'm running some metagenomic data through blastn, and I'd like to have the output sorted by e-value.
The default output does sort by e-value, however it first sorts by % coverage. If I limit the search only to a certain family, the top hits have lower e-values, albeit only ~98% coverage.
I'd like to blast these sequences across the entire nucleotide database to eliminate biases introduced by limiting to certain families/orders, however I'd like the top hits to be the lowest e-values, even if the total coverage is a bit lower.
The problem I am running into is that even if I allow 20,000 hits, each has total coverage of 100%, so I can't simply sort by the e-value, since many hits aren't showing up.
Thank you!
I'm running some metagenomic data through blastn, and I'd like to have the output sorted by e-value.
The default output does sort by e-value, however it first sorts by % coverage. If I limit the search only to a certain family, the top hits have lower e-values, albeit only ~98% coverage.
I'd like to blast these sequences across the entire nucleotide database to eliminate biases introduced by limiting to certain families/orders, however I'd like the top hits to be the lowest e-values, even if the total coverage is a bit lower.
The problem I am running into is that even if I allow 20,000 hits, each has total coverage of 100%, so I can't simply sort by the e-value, since many hits aren't showing up.
Thank you!