SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Should I try hybrid assembly with my PacBio data? MiniMicrobe Pacific Biosciences 1 12-09-2015 10:12 AM
PBcR hybrid assembly of 6.5MB genome with PacBio and MiSeq DrSpace Bioinformatics 9 06-11-2015 01:40 PM
Tool for viral metagenome assembly with extremely low coverage? Rammaria Metagenomics 1 05-11-2015 07:06 PM
hybrid assembly for PacBio and Illumina data laelaps Bioinformatics 1 05-01-2012 06:35 AM
Hybrid assembly of PacBio and Illumina allo Bioinformatics 3 05-01-2012 06:27 AM

Reply
 
Thread Tools
Old 12-19-2016, 11:38 PM   #1
dsher
Member
 
Location: Haifa, Israe;

Join Date: Feb 2013
Posts: 23
Default Hybrid assembly with low coverage PacBio libraries

Hi everyone,

We are working on sequencing the genomes of a few bacteria and microalgae. We recently had a PacBio run which provided pretty bad results (only 76 MB of data - ~15x coverage of our ~4.5MB genome, average read length ~5,800 bp) which could not be assembled into anything coherent using HGAP. Canu assembler provided what looked like better results (106-110 contigs, N50 ~23kb) but these still covered less than one-half of the expected genome size (sum of contig length was ~2.3 MB, and our genome is estimated at ~4.5MB).

Given that we are having issues with getting high-quality DNA for PacBio from some of our cultures, we were thinking of making an Illumina library to use these reads for a hybrid assembly. Does anyone have any experience with using such a hybrid method with low-coverage PacBio data? Specifically, are there any recommended tools or parameters suggested for such an assembly?

One point to keep in mind is that our microalgae cultures are not axenic so we expect a (low) level of contaminating DNA.

Any recommendations will be very helpful.

Thanks
Daniel
dsher is offline   Reply With Quote
Old 01-14-2017, 05:06 AM   #2
capricy
Senior Member
 
Location: 63130

Join Date: Apr 2012
Posts: 119
Default

Hi, Daniel,

I have similar question. My illumina reads assembly covered about 50% of the genome. Do you have some updated ideas to share?

Thanks.

C
capricy is offline   Reply With Quote
Old 01-14-2017, 06:03 AM   #3
dsher
Member
 
Location: Haifa, Israe;

Join Date: Feb 2013
Posts: 23
Default

Hi Capricy - no updated ideas yet... How do you know your Illumina assembly covers only 50% of the genome? What kind of libraries did you use, and what coverage did you get on the assembled parts?
Cheers
Daniel
dsher is offline   Reply With Quote
Old 01-14-2017, 07:28 AM   #4
capricy
Senior Member
 
Location: 63130

Join Date: Apr 2012
Posts: 119
Default

The data were from E coli. New isolates. Library is illumina miseq. Coverage was estimated from the assembly/reference alignment.
capricy is offline   Reply With Quote
Old 01-14-2017, 09:49 AM   #5
dsher
Member
 
Location: Haifa, Israe;

Join Date: Feb 2013
Posts: 23
Default

I don't know much about the genomics of different E. coli strains but isn't that coverage relatively low? If you just mapped the reads to one or more of the known E. coli genomes, what coverage would you get? That might help you see whether you have a very weird genome (e.g. very large genomic islands which do not map to your reference sequence) or whether your assembly isn't very good. With our larger cyanobacterial genomes I think the low coverage is because the PacBio assembly isn't very good.

BTW we recently published a comparative analysis of what you get when you close a genome compared to fragmented de-novo assemblies for a different heterotrophic bacterium (http://journal.frontiersin.org/artic...016.00248/full). This has some information on the number of sequences and coverage we got that might be worth comparing with your results.
dsher is offline   Reply With Quote
Reply

Tags
hybrid, illumina, low coverage, pacbio

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 03:20 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO