Seqanswers Leaderboard Ad

**GenoMax** · 11-25-2014, 04:00 PM

If these two strains are relatively closely related then you can identify the similarities using BLAT (https://genome.ucsc.edu/FAQ/FAQblat.html). Post-alignment processing will have to be done to extract the information you need from the results.

You could learn to do some of this but if you are working against a deadline then it may be better to find a programmer friend or your local bioinformatics support facility. They should be able to this for you.

**GenoMax** · 11-25-2014, 04:14 PM

CD-HIT-2D may be useful: http://weizhong-lab.ucsd.edu/cdhit_s...?cmd=cd-hit-2d

Best of all you can try it yourself without waiting for someone's help. You may still need to do some parsing afterwards.

**zerhacker** · 12-02-2014, 03:26 PM

Originally posted by GenoMax View Post

If these two strains are relatively closely related then you can identify the similarities using BLAT (https://genome.ucsc.edu/FAQ/FAQblat.html). Post-alignment processing will have to be done to extract the information you need from the results.

You could learn to do some of this but if you are working against a deadline then it may be better to find a programmer friend or your local bioinformatics support facility. They should be able to this for you.

thank you! I checkout out the programs that you suggested, but I ended up generating a fake sets of illumina reads out of both sequences using Simseq https://github.com/jstjohn/SimSe,
then I used bowtie2 to align them to each other and pulled out reads that dont align, then denovo assemble them into short contigs and extracted their ORF which codes for unique proteins.
I'm book marking BLAT as it seem like a fairly useful program.

Edited: bolded out my procedure to make it easier to read

**GenoMax** · 12-02-2014, 03:58 PM

Long as you were able to get what you needed :-)

What program did you use to generate the "illumina" reads. Just for the record. For someone running across this thread later-on via a search.

**zerhacker** · 12-02-2014, 04:37 PM

Originally posted by GenoMax View Post

Long as you were able to get what you needed :-)

What program did you use to generate the "illumina" reads. Just for the record. For someone running across this thread later-on via a search.

Build software better, together

https://github.com/jstjohn/SimSe

GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.

I think Simseq works great. but I used a python script wrote by the departments programmer that works similarly.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 55 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Align two sets of amino acid sequences

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News