SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
how to merge contiguous blast hsps! (-m 8 tab) Lyn Hsiong Bioinformatics 2 06-06-2013 06:24 PM
how to compare methylomes dnusol Bioinformatics 1 05-22-2013 01:04 AM
Compare Genomes infrared1983 Bioinformatics 6 10-12-2011 01:29 PM
compare samples jgSoton Bioinformatics 2 10-03-2011 01:38 AM
Compare SNPs bair Bioinformatics 5 01-27-2010 03:38 AM

Reply
 
Thread Tools
Old 08-19-2013, 06:40 AM   #1
Petrichor
Member
 
Location: Germany

Join Date: Apr 2013
Posts: 10
Default How to compare two hsps with each other?

Hi there! I have small problem.

Within BioPerl I use SearchIO to parse my BLAST-output. Within this, the output is a result, which contain Hits, which contain HSPs.

How can I compare two HSPs with each other?
The two HSPs i want to compare are the current one and the ones before. If the current one is longer than the one before and is on the same place on the Query as the HSP before, i want to keep the current HSP and replace the old one (e.g. in an array).
I want to do that for all hits for one query entry, so that i have a maximum of one hit on one range of the query.

Thanks for any help in advance!

For reference: http://www.bioperl.org/wiki/HOWTO:SearchIO
Petrichor is offline   Reply With Quote
Old 08-26-2013, 01:12 AM   #2
mbayer
Member
 
Location: Dundee, Scotland

Join Date: Mar 2009
Posts: 29
Default

Hi Petrichor,

by default BLAST hits with multiple HSPs to the same subject are sorted by bit score in descending order, i.e. the best (and usually longest) HSP is topmost.

Here is what I have done in the past to extract these:

1. Generate tabular BLAST output (use "-outfmt 6" option in BLAST+ executables)
2. Load this into Excel or whatever you use to view spreadsheets
3. Remove duplicates based on the subject column. This should get rid of any secondary HSPs for each subject, and leave only topmost ones, as the output is already sorted. If there are more than two HSPs per subject you may need to repeat this until you are left with just one each.

cheers

Micha
mbayer is offline   Reply With Quote
Reply

Tags
bioperl, perl

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 02:05 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO