SEQanswers

Go Back   SEQanswers > Applications Forums > RNA Sequencing



Similar Threads
Thread Thread Starter Forum Replies Last Post
De Novo Assembly of a transcriptome Neil De novo discovery 82 02-28-2012 09:44 AM
De Novo Transcriptome Assembly QC Noremac General 0 05-19-2011 11:02 AM
de novo transcriptome assembly Niharika Introductions 8 02-07-2011 05:29 AM
de novo transcriptome assembly chenjy RNA Sequencing 4 12-06-2010 11:54 PM
advice for de novo assembly of plant genome using 454 bio-x 454 Pyrosequencing 4 07-24-2009 10:05 AM

Reply
 
Thread Tools
Old 05-10-2011, 05:09 AM   #1
raonyguimaraes
Member
 
Location: Belo Horizonte - Brazil

Join Date: Jun 2010
Posts: 38
Default De Novo assembly of a plant transcriptome

Hello all,

I work with Bioinformatics in a lab and our group received the task of mounting a plant transcriptome using sequences from 454.

We still don't have the genome sequenced so we are now defining our strategies for this project.

The first problem is related with the assembly of the sequences.

We used Newbler and Mira for the assembly and now we are lacking of metrics to compare this 2 softwares in order to decide which one is better and why.

After the assembly I think the next obvious thing to do is to align this transcriptome with the genome of arabdopsis and may be use this to decide which assembly was better and why.

Another question would be about which aligner we should use for this.

Does anyone have a suggestion or any experience to share about this project?

Thanks for help.

Last edited by raonyguimaraes; 05-10-2011 at 10:37 AM.
raonyguimaraes is offline   Reply With Quote
Old 05-10-2011, 06:16 AM   #2
circlez
Junior Member
 
Location: London

Join Date: Apr 2011
Posts: 1
Default

Regarding to the comparison of assemblers, here is a paper that might help.
Comparing de novo assemblers for 454 transcriptome data
http://www.ncbi.nlm.nih.gov/pubmed/20950480
I used Newbler 2.5 to assembly my data, which looks quite good.
circlez is offline   Reply With Quote
Old 05-10-2011, 10:32 AM   #3
raonyguimaraes
Member
 
Location: Belo Horizonte - Brazil

Join Date: Jun 2010
Posts: 38
Default

Thanks a lot !

I also found this website http://www.plantagora.org/ and a lot of metrics I can use to evaluate my assemblies

Check it out:
http://www.plantagora.org/metrics/
http://www.plantagora.org/Plantagora...valuation.html

Quote:
The following metrics were gathered as part of this project: total number of contigs, contig N50, total contig length, average contig length, largest contig length, contigs > 1kb, contigs > 5kb, number of scaffolds, total scaffold length, average scaffold length, largest scaffold length, and scaffold N50.
looks promising...

Last edited by raonyguimaraes; 05-10-2011 at 10:34 AM.
raonyguimaraes is offline   Reply With Quote
Old 05-16-2011, 09:07 AM   #4
MG1655
Junior Member
 
Location: US

Join Date: May 2011
Posts: 5
Default

Hi raon,
I've been doing de novo plant transcriptome as well (although with illumina reads) and have had much luck with trinityrnaseq. http://trinityrnaseq.sourceforge.net/

Also, plantgdb has a lot of resources (such as a download for all of the plant protein sequences known)

http://www.plantgdb.org/download/download.php/
MG1655 is offline   Reply With Quote
Old 05-17-2011, 12:02 AM   #5
Jenzo
Member
 
Location: Bad Nauheim, Germany

Join Date: Feb 2011
Posts: 31
Default

MG1655, may I ask if your data was from a normalized or non-normalized cDNA bank?
Thanks!
Jenzo is offline   Reply With Quote
Old 05-17-2011, 09:09 AM   #6
MG1655
Junior Member
 
Location: US

Join Date: May 2011
Posts: 5
Default

Quote:
Originally Posted by Jenzo View Post
MG1655, may I ask if your data was from a normalized or non-normalized cDNA bank?
Thanks!
Hi Jenzo, our cDNA was non-normalized because we also wanted to look at differential expression.
MG1655 is offline   Reply With Quote
Old 06-02-2011, 07:42 PM   #7
jmartin127
Member
 
Location: San Francisco

Join Date: Mar 2010
Posts: 15
Default

Aligning your de novo assembled transcriptome to a genomic reference can help to determine which assembly is better. Presumably, whatever you are sequencing is a closely related plant to arabidopsis?

Regardless, Blat (http://genome.ucsc.edu/cgi-bin/hgBlat?command=start) is a good tool for splice-aligning assembled transcripts to a genome.

Quote:
Originally Posted by raonyguimaraes View Post
Hello all,

After the assembly I think the next obvious thing to do is to align this transcriptome with the genome of arabdopsis and may be use this to decide which assembly was better and why.

Another question would be about which aligner we should use for this.
jmartin127 is offline   Reply With Quote
Old 07-05-2011, 01:17 PM   #8
dilaraally
Junior Member
 
Location: San Diego

Join Date: Sep 2010
Posts: 2
Default de novo assembled transcriptome to a genomic reference

I am also doing a de novo assembly of a plant transcriptome. I used ABySS and CLC Bio. One thing I did after assembling the transcriptome is to do a legacy blast search. Not all contigs will necessarily be plant - could be fungi, bacteria, and virus. If you only blast to Arabidopsis you might pick up a highly conserved gene but whose top hit was actually not a plant but a bacteria- blasting only to Arabidopsis doesn't allow you to select out contamination.

My pipeline is the following:

blastn to entire ncbi database (locally)
|
remove non-plant contigs - keep ones that had top 10 hits to plants and ones that had no hit
|
blastx
|
with the final set of contigs, annotate using closest hit in blast search.
dilaraally is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 12:57 AM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.