Seqanswers Leaderboard Ad

**dariober** · 10-27-2010, 01:10 AM

What about...

Code:

x <- "ATGCGACTG"
y <- "ATGGNACTG"

seqdiff<- function(seq1, seq2){
    seq<- strsplit(c(seq1, seq2), split= '')
    mismatches<- which(seq[[1]] != seq[[2]])
    return(mismatches)
    }

seqdiff(x, y)
--> [1] 4 5

(Note: No attempt is made to cope with sequences of different length, which() will throw a warning in such case).

Dario

**NicoBxl** · 10-27-2010, 01:15 AM

thanks but in most of the cases the sequences have different length

**dariober** · 10-27-2010, 01:37 AM

...The variant below will right-pad the shorter sequence with 'X', which in turn will be considered as differences (not sure this is what you want...)

I assume that your sequences have been already aligned by some other program and what you want is to pull out the mismatches.

If instead you really want to do sequence alignment within R, I accidentally found the package bio3d which has a function called seqaln, see if it helps.

Anyway, I think R is not ideal for such jobs, I'd rather go for python or perl after BLAST or other aligner.

Dario

Code:

seqdiff<- function(seq1, seq2){
    seq<- strsplit(c(seq1, seq2), split= '')

    ## If the length of the two sequences differs, 
    ## pad the shorter one with X
    seqlen<- length(seq[[1]]) - length(seq[[2]])
    if(seqlen > 0){
        seq[[2]]<- append(seq[[2]], rep('X', seqlen))
        }
    if(seqlen < 0){
        seq[[1]]<- append(seq[[1]], rep('X', abs(seqlen)))
        }

    mismatches<- which(seq[[1]] != seq[[2]])
    return(mismatches)
    }

**svl** · 10-27-2010, 08:36 AM

Are you using bioconductor?

This should be doable with bioconductor as well.

How to install bioconductor:
source(“http://www.bioconductor.org/biocLite.R”)
biocLite()

How to create an aligment:

Sign in - Google Accounts

http://manuals.bioinformatics.ucr.edu/home/ht-seq#TOC-Computing-Pairwise-Sequence-Alignme

And here's the alignments vignette/doc:

http://bioconductor.org/packages/2.5/bioc/vignettes/Biostrings/inst/doc/Alignments.pdf

... in this pdf focus on the function: mismatchTable -- Creates a table for the mismatching positions

Seems to be what you want. Let us know if and how you got it to work!

/Stef

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

R mismatch position in pairwise alignment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News