![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
[Velvet,assembly] core dumped occured by runnning velvet | matador3000 | De novo discovery | 0 | 12-17-2011 08:31 AM |
Velvet Assembler: expected coverage versus estimated coverage versus effective covera | DMCH | Bioinformatics | 1 | 11-30-2011 05:21 AM |
Velvet assembler | bioinf | Bioinformatics | 31 | 08-24-2011 10:19 AM |
velvet's output: Roadmaps | shuang | Bioinformatics | 1 | 08-18-2011 02:43 PM |
velvet assembler contigs into gbrowse | Zimbobo | Bioinformatics | 0 | 04-15-2010 02:36 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]()
Hello. Velvet firstly hashes all the reads, creating s.c "roadmaps". Could someone explain how exactly that helps in the construction of de Bruijn graph? Thank you.
|
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Birmingham, UK Join Date: Jul 2009
Posts: 356
|
![]()
Your question is probably better directed to the velvet-users list. Daniel Zerbino recently posted this answer to a similar question:
Also, have you read the Velvet paper and manual? http://genome.cshlp.org/content/18/5/821 http://www.ebi.ac.uk/~zerbino/velvet/Manual.pdf Code:
The Roadmap file is not normally meant to be parsed by the user, it is simply internal to velvet. However, if you really want to know the format is: ROADMAP $query_id $target_id $prev_kmers $target_start $target_finish etc. Where: $target_id is negative if on reverse strand [ $target_start, $target_finish [ is the domain of the target sequence being aligned (note that it is inclusive at the start and exclusive at the finish) $prev_kmers is the number of unaligned k-mers in the query sequence before the alignment to the target (note in the examples below how this value does not necessarily change from alignment to alignment, this means that they are contiguous on the query.) |
![]() |
![]() |
![]() |
#3 | |
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]()
I've read those papers and searched/thought a lot on that. Still no idea how it works exactly. Only assumptions. I need it for a presentation. Hashing k-mers seems to have no sense for winning the time if some regions in two reads overlap in k-1 chars. Only in cases when you have at least k length identical sequences in two or more reads.
Example: Quote:
PS. Thx for the tip, just found the list and wrote to them. Last edited by bioinf; 01-11-2011 at 04:59 AM. |
|
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: Birmingham, UK Join Date: Jul 2009
Posts: 356
|
![]() |
![]() |
![]() |
![]() |
#5 | |
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]()
Daniel was very kind to write to me personally. What he wrote is:
Quote:
![]() |
|
![]() |
![]() |
![]() |
#6 | ||
Senior Member
Location: NL, Leiden Join Date: Feb 2010
Posts: 245
|
![]() Quote:
Where does it say you need k identical chars? You need k-1 identical overlap.. The example he shows means that a read needs to have a read containing CCTCT before it is picked up. Both your reads in the example (ACCTCGAT and GCTCTAGG) do not contain CCTCT. |
||
![]() |
![]() |
![]() |
#7 |
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]()
Ok, I see now. If the second read has CCTCT inside of it, it means it has two k-mers there CCTC and CTCT, which leads to a situation where I have the same CCTC k-mer in both reads. I guess what he implicitly meant was that you actually have to have identical k-mers in two reads in order to draw an arc from that X k-mer of the first read to a Y k-mer of the second read following right after that identical X k-mer of the very second read.
It is then in some sense related to the transitivity rule. There is X in read 1 There is X in read 2 There is Y in read 2 connected to X in read 2 ----- Conclusion: draw arc from X of read 1 to Y of 2; thus you clearly see the connection between read 1 and read 2 Am I warmer? Last edited by bioinf; 01-12-2011 at 02:15 AM. |
![]() |
![]() |
![]() |
#8 |
Senior Member
Location: NL, Leiden Join Date: Feb 2010
Posts: 245
|
![]()
Yes, i think that is it. However, since you have already had contact with Daniel Zerbino, you could better ask him, because he is the one who developed Velvet and knows it better than anyone here on this forum
![]() |
![]() |
![]() |
![]() |
#9 | |||||
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]() Quote:
I tried to make a small demo of how velvet works from the command line. Here is my fasta file: Quote:
Quote:
Here is the output: Quote:
Quote:
Last edited by bioinf; 01-12-2011 at 02:15 AM. |
|||||
![]() |
![]() |
![]() |
#10 |
Senior Member
Location: NL, Leiden Join Date: Feb 2010
Posts: 245
|
![]()
Probably has something to do with your coverage
![]() Try to increase the size of your first read and double them for increased coverage; >SEQUENCE_0 ACCTAGAACGT >SEQUENCE_1 ACCTAGAACGT >SEQUENCE_2 CCTAGAACGTT >SEQUENCE_3 CCTAGAACGTT This should work... |
![]() |
![]() |
![]() |
#11 |
Member
Location: Germany Join Date: Nov 2010
Posts: 25
|
![]()
Yes, worked for me. Thx!
|
![]() |
![]() |
![]() |
#12 |
Member
Location: helsinki Join Date: Oct 2010
Posts: 14
|
![]()
Hi all,
I am running denovo assembly with velvet color space build and when i run velveth_de, it terminates halfway before it writes the ./Roadmap file. I have no idea what the problem is and i am running this on a cluster with 12 cores and 2400mb memory. Version is 1.1.04. this is what i see on the terminal: [2468.477166] 73728280 sequences found [2468.477184] Done [2468.477501] Reading FastA file /v/users/okeke/reads/547_3H_doubleEncoded_input.de; [2641.750344] 103910882 sequences found [2641.750359] Done [2641.750610] Reading FastA file /v/users/okeke/reads/547_1D_doubleEncoded_input.de; [2765.845768] 79934018 sequences found [2765.845785] Done [2765.846042] Reading FastA file /v/users/okeke/reads/547_4D_doubleEncoded_input.de; [3176.325687] 251833816 sequences found [3176.325703] Done [3176.434276] Reading read set file pine_denovo/Sequences; Killed 2804.848u 794.606s 1:04:54.36 92.4% 0+0k 0+0io 30pf+0w Thanks for your help in advance. Last edited by urchgene; 06-21-2011 at 06:08 AM. |
![]() |
![]() |
![]() |
#13 |
Member
Location: Russia, Irkutsk Join Date: Feb 2011
Posts: 40
|
![]()
Same stuff as in urchgene's post observed with velvet 1.2.08.
Seems like a problem is just a lack of RAM. I moved to cluster with more of it and everything works fine. Would be nice if velvet could write normal crash logs, though. Last edited by A_Morozov; 02-12-2013 at 09:39 PM. |
![]() |
![]() |
![]() |
Thread Tools | |
|
|