SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
TGICL for denovo transcriptome gfmgfm Bioinformatics 16 01-24-2013 04:59 PM
[Velvet,assembly] core dumped occured by runnning velvet matador3000 De novo discovery 0 12-17-2011 08:31 AM
Denovo before and after an alignment scami Bioinformatics 0 05-13-2011 06:11 AM
Need help on soap denovo sundar De novo discovery 4 11-29-2010 04:03 AM
velvet denovo bioenvisage De novo discovery 2 02-11-2010 08:35 AM

Reply
 
Thread Tools
Old 08-24-2010, 08:50 AM   #1
bair
Member
 
Location: London

Join Date: Jan 2010
Posts: 65
Default Velvet denovo

Dear all,

We have an alignment file in bam format including three type of reads with different length ( pair-end and mate-pair), we want to do denovo assembly for the novel sequence by using velvet.

We got three fastq files with unmapped reads from the bam file by using picard ViewSam.jar.

How to input these reads to velvet? Since the reads in fastq file are not in proper order, e.g no pair-end relation, can we treat these reads as singletons?

Can we use velvet to handle the three fastq files together?

Many thanks!
bair is offline   Reply With Quote
Old 08-25-2010, 12:24 AM   #2
francesco.vezzi
Member
 
Location: Udine (Italy)

Join Date: Jan 2009
Posts: 50
Default

You are free to treat the reads as they are singletons but this will probably lead to a bad assembly with a lot of short contigs. If you have a way to keep the paired read information (with the SAM file it must be possible) this will help a lot the assembly phase.

As far as the last question on the on line manual you will find an example to how give to velvet more then one file in input

Francesco
francesco.vezzi is offline   Reply With Quote
Old 08-25-2010, 01:21 AM   #3
bair
Member
 
Location: London

Join Date: Jan 2010
Posts: 65
Default

Thank you Francesco,

So pair-end information will lead to better assembly, I need to extract this from the sam file.
bair is offline   Reply With Quote
Old 08-25-2010, 03:50 AM   #4
maubp
Peter (Biopython etc)
 
Location: Dundee, Scotland, UK

Join Date: Jul 2009
Posts: 1,543
Default

Sorting the SAM/BAM file by name may help - or you could do the sorting after converting to FASTQ. See also the thread "BAM to fastq how?" on the samtools-help mailing list this August:

http://sourceforge.net/mailarchive/f...=samtools-help
maubp is offline   Reply With Quote
Old 08-26-2010, 01:10 AM   #5
Torst
Senior Member
 
Location: The University of Melbourne, AUSTRALIA

Join Date: Apr 2008
Posts: 275
Default

Quote:
Originally Posted by bair View Post
We have an alignment file in bam format including three type of reads with different length ( pair-end and mate-pair), we want to do denovo assembly for the novel sequence by using velvet.We got three fastq files with unmapped reads from the bam file by using picard ViewSam.jar.
So you only want to de novo the unmapped reads to find novel segments?

Quote:
How to input these reads to velvet? Since the reads in fastq file are not in proper order, e.g no pair-end relation, can we treat these reads as singletons?
You should write a script to split your unmapped reads into pairs & singletons, and do this for each of the three libraries.
eg. lib1p.fa lib1s.fa lib2p.fa lib2s.fa lib3p.fa lib3s.fa

Quote:
Can we use velvet to handle the three fastq files together?
Yes, but you may need to re-compile velvet to support CATEGORIES=4. Here is a suitable partial command line:

% velveth -shortPaired lib1p.fa -shortPaired2 lib2p.fa -shortPaired3 lib3p.fa -short4 lib1s.fa lib2s.fa lib3s.fa
% velvetg -exp_cov auto -cov_cutoff auto
Torst is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 10:46 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO