SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Reply
 
Thread Tools
Old 01-28-2010, 12:34 PM   #1
bioenvisage
Member
 
Location: it

Join Date: Oct 2009
Posts: 40
Default vevet

Iam working on transcriptome data from illumina genome analyzer II.I woudl like to do a velvet assembly for these short reads ,but i see lot of repeats and homopolymers like AAAAAAAAA and TTTTTTTTTTTTTTT, so i would like to know whether i should mask all the repeats before i start denovo assembly by using velvet.
bioenvisage is offline   Reply With Quote
Old 01-29-2010, 01:27 PM   #2
apratap
Member
 
Location: Bay Area

Join Date: Jan 2009
Posts: 58
Default

Hi ..

I am copying a reply from the velvet mailing list which I feel fits in here.

-=======

If the absolute error length (as in the length difference between the
actual polymer and the one measured) is longer than 3bp and this error
has been observed more times than than the chosen coverage cutoff then
the error is maintained open. This is a lot of if's, so hopefully does
not occur too often.

The reason for this conservative stance on the error length is to avoid
drastic merging of genuine homopolymers (which are quite frequent). I
have not found any method to relax this constraint without having all
sorts of repeats collapse into each other.
=====

Best,
-Abhi
apratap is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 07:37 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO