Seqanswers Leaderboard Ad

**Brian Bushnell** · 09-30-2015, 06:18 PM

If you are looking for specific microbial species (or really doing anything with NGS reads), BLAST is not really the best tool. You can calculate abundance by mapping or kmer-matching your sequences to just the set of references of interest.

And certainly, you should first validate that your approach works on synthetic data before proceeding. That will allow you to quantify true positive, false positive, and false negative rates, which may lead you to adjust your methodology prior to doing any real work.

For kmer-matching, I recommend Seal (part of the BBMap package), which is fast and easy to use for quantifying sequence expressions levels. For example:

seal.sh in=reads.fq ref=ecoli.fa,klebsiella1.fa,klebsiella2.fa stats=stats.txt refstats=refstats.txt

If you generate synthetic reads with errors, and annotate the reads with the name of the organism they came from, you can then objectively calculate the accuracy. Alternatively, if you just want to measure levels of various bacteria, you can generate synthetic reads and mix them in a specific ratio, then evaluate your approach by calculating how closely the output of expression levels matches the ratio of reads you mixed together. That's easier as it does not require parsing the read names.

Topics	Statistics	Last Post
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, Today, 07:17 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 20 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM

Seqanswers Leaderboard Ad

Announcement

Making a in silico mock community

Comment

Latest Articles

ad_right_rmr

News