Unconfigured Ad

**NeilMcCauley** · 09-14-2011, 06:44 AM

I am also very interested in the comparison or any kind of experience. Accuracy is important to me and computational intensity or RAM requirement is not a problem for me since we have enough computational resources.

**Thorondor** · 09-15-2011, 12:47 AM

i did assemble an eukaryotic transcriptome with oases and also with trinity. And as far as i did check the results there are high similarities and it is not easy to say which assembly is strictly better. Trinity results in more shorter transcripts because it does not use scaffolding (e.g. inserting Ns in the sequences like Oases does) and at least the version i used did only support k-mer 25 (and atm they still state it on their website is the only possible one if you want to run inchworm, chrysalis and butterfly).
Also trinity predicts less splice variants at least with the default edge-thr value. Some highly similar genes seems to be resolved by Oases with higher k-mers but not with Trinity but Trinity also assembles some transcripts better. :-/

So, as always their is no clear better assembly. A de novo assembly with oases and trinty of something were a good reference is available would be maybe yield some clues. ;-)

**NeilMcCauley** · 09-16-2011, 03:31 AM

Thanks for your comprehensive answer ! What kind of sequence data did you use (Illumina or 454) ?
I have a 454 raw sequence data and I was wondering whether Oases can produce good assemblies with 454 data. According to this paper Comparing de novo assemblers for 454 transcriptome data, short-read assembler are not very suitable for 454 since they require high and even coverage depths. Indeed, I did some preliminary try with Oases and it gave me very short contigs.

Now I'm trying Trinity.

**Thorondor** · 09-19-2011, 05:50 AM

I am working with illumina reads (PE 100bp).

well the minimum kmer coverage is default 1 for trinity for oases it is 3, so maybe results will be better with trinity. but of course a lot depends on your expected coverage.

**hiddenrisk** · 10-13-2011, 07:21 AM

Trinity questions...

Since we are talking about Trinity here, does anyone know the answers to the following questions:

1) What, explicitly is a "full-length transcript"? According to the second sentence of their paper, it is a "...complete and contiguous mRNA sequence form the transcription start site to the transcription end...". However, I was wondering how they were able to demonstrate this. It seems to me that it is entirely possible that with a single k-mer size of 25, it might collapse predicted transcripts if there are areas of repeat, and though it returns a contiguous piece from start to stop codon, it might not really be a complete transcript.

2) Are these not predicted transcripts? They don't refer to them as such, but at least for the whitefly stuff I didn't see any biological verification of the predicted transcript sequences....

3) What organism(s) do the "all reference protein-coding sequences that are reconstructable to full length given the read set" (p 647, left column, 3rd sentence under the header "Sensitivity limit for full-length reconstruction") come from? It sounded to me, from the paper, that they used Schizosaccharomyces pombes as their reference organism... if this is true, and this if the Oracle set with which they determined the working parameters of their program, didn't they basically optimize their program to run best with Illumina reads from fission yeast?

**enkia** · 03-26-2012, 07:10 AM

I thought I would revive this thread to see if anyone has any more recent input on the comparison between Velvet and Trinity.

I am working with a RNA data set that presumably has a mixture of viruses present in it and am looking to assemble them. For one of the viruses, I have a reference genome to assemble, the others are novel and will need to be assembled de novo.

Another thought is whether either of these programs will better be able to handle contigs with very different copy numbers? Based on some preliminary dsRNA sequencing, one virus is about 25-fold higher levels than the other ones.

**ians** · 03-29-2012, 07:12 AM

Since the main discussion, Oasis-M was released. The authors did a direct comparison of Oasis, Trinity, trans-Abyss, and Cufflinks.

**schalivendra** · 10-26-2013, 11:56 AM

Hi,

I am using Oases to assemble a eukaryote transcriptome from Illumina reads using different k-mer values. However, I am getting the same stats (abyss-fac output: N50, min, max, median, total number of contigs) for all the k-mers tested. This is the script I am using:
velveth_transcripts_kxx xx -short -fastq Inputfile1.fastq Inputfile2.fastq Inputfile3.fastq Inputfile4.fastq

velvetg transcripts_kxx -read_trkg yes

oases transcripts_kxx

I appreciate to know if there is anything wrong with the script.

thank you very much,
Subbaiah Chalivendra

Topics	Statistics	Last Post
Study Captures the First Moments of DNA Replication by SEQadmin2 Started by SEQadmin2, 07-24-2026, 12:17 PM	0 responses 29 views 0 reactions	Last Post by SEQadmin2 07-24-2026, 12:17 PM
Chemotherapy Leaves Detectable DNA Signatures in Childhood Tumors by SEQadmin2 Started by SEQadmin2, 07-23-2026, 11:41 AM	0 responses 21 views 0 reactions	Last Post by SEQadmin2 07-23-2026, 11:41 AM
Single-Cell Atlases Skew Toward European Ancestry, Analysis Finds by SEQadmin2 Started by SEQadmin2, 07-20-2026, 11:10 AM	0 responses 212 views 0 reactions	Last Post by SEQadmin2 07-20-2026, 11:10 AM
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, 07-13-2026, 10:26 AM	0 responses 78 views 0 reactions	Last Post by SEQadmin2 07-13-2026, 10:26 AM

Unconfigured Ad

de novo assembly using Trinity versus Velvet-Oases

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News