SEQanswers

Go Back   SEQanswers > Applications Forums > De novo discovery



Similar Threads
Thread Thread Starter Forum Replies Last Post
Somewhat vicious, somewhat funny BBoy Pacific Biosciences 10 02-17-2012 08:58 AM
Funny ReadPrep1 Images GAIIx GenomicsJB General 1 11-16-2011 10:44 AM
cufflinks funny output, scripture comparison rcorbett Bioinformatics 11 12-25-2010 09:55 AM
launching gsAssembler in Fedora cbean127 454 Pyrosequencing 12 11-09-2010 04:53 AM
gsassembler. sharagii 454 Pyrosequencing 1 07-20-2009 07:34 AM

Reply
 
Thread Tools
Old 05-02-2010, 08:57 AM   #1
454andSolid
Junior Member
 
Location: USA

Join Date: May 2009
Posts: 8
Default gsAssembler behaving funny with PE data

Hi A couple of questions about genome assembly of 454 using gsAssembler.

Some background: We are sequencing and assembling a eukaryotic genome using 454 titanium data. First we ran 7 slides of shotgun sequencing, and later one slide of paired end (with 3 kb inserts). We have made several assemblies:

Assembly A: Here we just used the shotgun data. All parameters were set to default, except that we used "-large". We got 114 Mbp in total. 7600 contigs with L50=52 Kb.

Assembly B: This time we used both the shotgun and PE data, and assembled everything together. The same parameters as in A were used. Now we got 110 Mbp in total. 9000 contigs with L50=35 Kb (and 1600 scaffolds with L50=160 Mbp).

Assembly C: Here we first made an assembly with just the shotgun data and then added the PE data and updated the assembly. This gave 114 Mbp in total. 7700 contigs with L50=53 Kb (and 1600 scaffolds with L50=160 Mbp).

Now the questions:

1. It's strange that we get less assembled sequence when we add the PE reads (assembly B vs assembly A). Does anybody have any possible explanation for this?

2. I would rather use assembly C than assembly B for the subsequent analysis since the contigs are longer. But I don't know which assembly to trust. Is there any way of knowing which of the two assemblies is more "correct"? (I'm thinking about computational things that don't involve any more sequencing...)


long post, hope somebody has some input...

thanks
/Jakub
454andSolid is offline   Reply With Quote
Old 05-02-2010, 02:45 PM   #2
RCJK
Senior Member
 
Location: Australia

Join Date: May 2009
Posts: 155
Default

Hi there,

Roche recommends assembling shotgun and PE reads as you did in C. First assemble the shotgun reads, and then add in the PE reads after. From the software manual:

When planning to run an incremental assembly, it is best to first add the shotgun reads, and then add reads from Paired End libraries with increasing insert spans. This is because longer contigs can be assembled using the longer shotgun reads. These longer contigs then have a better chance of having both ends of paired end reads aligned within them. This in turn allows more robust library span estimates to be made. (Software v.2.3, October 2009)
RCJK is offline   Reply With Quote
Old 05-03-2010, 04:08 AM   #3
454andSolid
Junior Member
 
Location: USA

Join Date: May 2009
Posts: 8
Default

Thanks for your help! Guess I should get a new manual then. (I have one from October 2008, and I couldn't find the info there...)
454andSolid is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 11:18 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO