SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Genome assembly workflows for NGS elisacarli Bioinformatics 2 09-28-2015 11:32 PM
NGS assembly analysis anuragk Introductions 3 12-09-2014 11:30 AM
NGS assembly pavan Bioinformatics 8 09-03-2013 10:46 PM
Assembly files in new Assemblers Tools for NGS. aloliveira Bioinformatics 2 10-22-2010 04:27 AM

Reply
 
Thread Tools
Old 03-13-2018, 10:29 PM   #1
wanadou
Junior Member
 
Location: Tunis

Join Date: Jan 2018
Posts: 8
Default help for NGS and assembly

Hello

I think to do the whole genome sequencing of a halophyte plant (350 Mbp) using the Hiseq Pe150 which generate data with 30X coverage
1- this strategy could it be effective to make an de novo assembly in order to have a draft genome?
or
the sequencing using PacBio (1 cell SMRT) to generate sequences with more bigger size is it effective to make de novo assembly of the draft genome?
wanadou is offline   Reply With Quote
Old 03-14-2018, 02:38 AM   #2
nucacidhunter
Jafar Jabbari
 
Location: Melbourne

Join Date: Jan 2013
Posts: 1,172
Default

The genome size is relatively small so you will have better assembly if you sequence with PacBio but one SMRT cell will not be enough. You should consider 50x coverage (17.5 Gb) which will require 3-4 Sequel SMRT cells. You can also sequence 2 SMRT cells and then do a preliminary assembly and decide on further sequencing.

Illumina sequencing will result in lots of short contigs especially if the plant genome is composed of repeat regions.

I do not think any of these platforms will enable complete assembly but the results will be better if DNA is from a homozygous plant.
nucacidhunter is offline   Reply With Quote
Old 03-14-2018, 11:16 PM   #3
wanadou
Junior Member
 
Location: Tunis

Join Date: Jan 2018
Posts: 8
Default

Hello nucacidhunter,
Thanks for your reply
The cost of sequencing using PacBio is relatively high. the sequencing of 1 cell SMRT can give 5Gb. I think to do an preliminary assembly for a draft genome. In second time i will do sequencing using Illumina platform which provide 50X coverage. this second sequencing can be used to do hybrid assembly in ordre to have complete genome.
you think this strategy is feasible? or 10-15X coverage given by PacBio is insuffusant to do draft genome assembly
wanadou is offline   Reply With Quote
Old 03-15-2018, 01:26 AM   #4
nucacidhunter
Jafar Jabbari
 
Location: Melbourne

Join Date: Jan 2013
Posts: 1,172
Default

It depends on the genome. You might look at repeat content of a related species if it is available. I have seen Illumina based assembled contigs which were couple of hundred bases larger than the sequencing length.

I do not think either of these platforms alone or in combination will give complete assembly but I would recommend to do 2 SMRT cells and assemble then depending on results decide for more sequencing on one. For instance, if you are interested in particular genes or region covered by PacBio contigs then Illumina will be good to increase accuracy. Depending on your application an incomplete assembly might be enough.

PacBio is sensitive to carry over of contaminants from DNA extraction so you should try an extraction method that result in long fragments and is free from carbohydrates and phenolic compounds. Link for PacBio calculator:
https://www.pacb.com/cn/calculator-w...me-sequencing/
nucacidhunter is offline   Reply With Quote
Old 04-16-2018, 06:10 AM   #5
colindaven
Senior Member
 
Location: Germany

Join Date: Oct 2008
Posts: 401
Default

Its' very tricky to analyse 10X pacbio data and do a de novo assembly. I have had very bad experiences in the past with this setup, getting poor N50s and putting lots of effort in. 30x+ is far, far better since you need a lot of the longest reads to span gaps AND self correct reads for a decent assembly .

There were a few tools for low coverage pacbio hybrid assembly, DBG2OLC for example
colindaven is offline   Reply With Quote
Old 04-16-2018, 08:13 AM   #6
Markiyan
Senior Member
 
Location: Cambridge

Join Date: Sep 2010
Posts: 115
Lightbulb Use Illumina 2x250 or 2x300, not 2x150 in de novo!

Please use Illumina 2x250 or 2x300 in any de novo appication! (in addition to the pacbio).

It means 1-2 runs (2x250 or 2x300) using MiSeq or 1/2 of the HiSeq 2500 RR 2x250.

Usually it gives you significantly better assemblies than 2x150 or 2x100 bp runs.

PS: Make sure a PCR-free library prep is used.
Markiyan is offline   Reply With Quote
Old 04-17-2018, 04:37 PM   #7
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 821
Default

If you want cheap, [MinION](https://store.nanoporetech.com/basic.html) is always available (starter discount is $1000 for two flow cells and a rapid kit). The consensus assemblies are less accurate than PacBio, but if you're doing Illumina as well for local assembly correction (which I also recommend) that probably won't matter as much.
gringer is offline   Reply With Quote
Old 04-20-2018, 04:16 PM   #8
luc
Senior Member
 
Location: US

Join Date: Dec 2010
Posts: 337
Default

The sequencer is certainly cheap, but to my knowledge not the reagents and additional flowcells. PacBio might be more affordable?

Quote:
Originally Posted by gringer View Post
If you want cheap, [MinION](https://store.nanoporetech.com/basic.html) is always available (starter discount is $1000 for two flow cells and a rapid kit). The consensus assemblies are less accurate than PacBio, but if you're doing Illumina as well for local assembly correction (which I also recommend) that probably won't matter as much.
luc is offline   Reply With Quote
Old 04-20-2018, 04:25 PM   #9
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 821
Default

As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!

At the moment marginal cost per base for the MinION is approximately equal to PacBio Sequel or Illumina MiSeq, with longer reads, a shorter sample preparation time, and no net capital expense for the sequencer. Flow cells are cheaper if ordered in bulk, and those who want the maximum affordability (and have lots of money to throw around) can purchase a PromethION.

Reminder: all those costs are on the ONT store website.
gringer is offline   Reply With Quote
Old 04-22-2018, 02:46 AM   #10
soungalo
Junior Member
 
Location: Israel

Join Date: Apr 2018
Posts: 3
Default Wrong way, sir

You won't be able to get a de novo assembly this way.
Regarding the Illumina strategy - to get a decent assembly you'll have to:
a. Increase coverage (50-60X should be OK)
b. Use longer reads (250bp), preferably use an insert size allowing for merging paired end reads.
c. Use various insert size libraries (for example by applying mate-pair technology).

As for PacBio - except for the simplest genomes, pure PacBio data cannot produce a decent assembly. Hybrid approaches are nice, but you'll still need lots of Illumina data, which means considerable costs.
soungalo is offline   Reply With Quote
Old 04-22-2018, 03:00 AM   #11
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 821
Default

You won't be able to get a good de-novo assembly from Illumina data alone, and longer SBS reads won't fix that. Consider things like 28S rRNA gene regions, where kilobase-length regions are repeated many times in tandem; those will never be properly assembled from just Illumina data.

A hybrid approaches with reasonable long-read coverage (e.g. 40-100X) for generating assembly scaffolds and low short-read coverage (e.g. 40X) for local correction should produce somewhat contiguous sequences (e.g. N50 > 100 kb) that have a high accuracy. Contiguity can be improved with more long-range techniques like Hi-C and optical mapping.

Single-technology sequencing is (at the moment) a false economy. The additional money spent on hybrid sequencing approaches is more than made up for by the reduced pain on the bioinformatics end of things.
gringer is offline   Reply With Quote
Old 04-22-2018, 03:14 AM   #12
nucacidhunter
Jafar Jabbari
 
Location: Melbourne

Join Date: Jan 2013
Posts: 1,172
Default

Quote:
Originally Posted by gringer View Post
As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!
Reminder: all those costs are on the ONT store website.
This is because ONT sells their consumables directly but other companies such as PacBio sell them through a local distributor that have different margin and shipping costs depending on global location.

A 50-80x coverage with PacBio alone depending on genome heterogeneity results in very good assembly.
nucacidhunter is offline   Reply With Quote
Old 04-22-2018, 04:25 AM   #13
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 821
Default

Quote:
other companies such as PacBio sell them through a local distributor that have different margin and shipping costs depending on global location.
That's not the only reason. Sigma also has region-specific pricing. Without logging in, I choose the location when I first visit the site (presumably stored as a cookie for subsequent visits), and their website gives me prices for items.
gringer is offline   Reply With Quote
Old 04-22-2018, 02:16 PM   #14
luc
Senior Member
 
Location: US

Join Date: Dec 2010
Posts: 337
Default

Final costs depend on reagent costs and yields (and reliability). Results for both systems will depend a lot on DNA sample quality. At current yields the Sequel runs will yield more data/dollar in our hands, if one is not ordering a pack of 48 or 300 nanopore flowcells. The average nanopore read lengths will be longer, but results/flowcells seem much more variable.

Quote:
Originally Posted by gringer View Post
As far as I know, ONT is the only sequencing machine supplier that publishes all their prices on their store website; you can have a look and see for yourself!

At the moment marginal cost per base for the MinION is approximately equal to PacBio Sequel or Illumina MiSeq, with longer reads, a shorter sample preparation time, and no net capital expense for the sequencer. Flow cells are cheaper if ordered in bulk, and those who want the maximum affordability (and have lots of money to throw around) can purchase a PromethION.

Reminder: all those costs are on the ONT store website.
luc is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 08:52 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2018, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO