SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics
Similar Threads
Thread Thread Starter Forum Replies Last Post
Using Mosaik to assemble bacterial genome 454 sequencing jpearl01 Bioinformatics 2 03-27-2013 06:29 AM
Can the upcoming Sandy Bridge i7 Extreme assemble a genome? ymc Bioinformatics 30 06-06-2012 06:38 AM
Assemble tools dicty Bioinformatics 8 02-23-2011 01:02 AM
cufflinks assemble syslm01 Bioinformatics 0 05-05-2010 03:43 AM
Who is the best way to align/assemble to a reference? anyone1985 Bioinformatics 3 04-30-2009 05:40 PM

Reply
 
Thread Tools
Old 01-28-2010, 12:53 AM   #1
bp2010
Junior Member
 
Location: Asia

Join Date: Jan 2010
Posts: 3
Question Do we still need to assemble a genome?

Hi,

This may sound like a naive question, but I have been trying to come up with answers for a couple days and haven't yet been able to. Thank you in advance for any input.

Now that we can detect human sequence variations (SNPs, indels, Structural Variants, etc) based on the set of paired-end reads, I wonder if there is still a need to assemble the original sequence. Wasn't the point to detect the variations?

And we don't need to know the assembled sequence for the new sequence anymore to gain its gene positions because its paired-end reads can be mapped back to the human reference genome, so we can learn the gene positions from there.

So aside from saving space (100 something GB vs 3GB) and time to analyze the data, do we really need to assemble any new human genome that has been resequenced?

Thank you!
bp2010 is offline   Reply With Quote
Old 01-28-2010, 01:04 AM   #2
Natalya
Junior Member
 
Location: China

Join Date: Jan 2010
Posts: 5
Default

We know Beijing, New York, Paris, London, Tokyo... , but we still need a World map.
Natalya is offline   Reply With Quote
Old 01-28-2010, 05:21 AM   #3
bp2010
Junior Member
 
Location: Asia

Join Date: Jan 2010
Posts: 3
Default

No no, please don't get me wrong here. Let me clarify a bit.

I understand we still need to sequence personalized genomes. However, the question is once we get the reads, do we need to assemble them?

My thinking is that the need to have a fully assembled sequence arose from the fact that we need to know how this particular sequence vary from the human reference genome (hg18, for example). But based on just the reads, we can use programs like the SOAP package to locate variations already. For everything else, it is supposed to be identical to the reference genome we use.

So why bother assembling, once we have the reads? Cannot we get all information we need from the reads alone?
bp2010 is offline   Reply With Quote
Old 01-28-2010, 05:28 AM   #4
Zigster
(Jeremy Leipzig)
 
Location: Philadelphia, PA

Join Date: May 2009
Posts: 116
Default

I would venture to say that most of the interest in assembly is in de novo assembly of novel organisms.

I believe the number of organisms that have been fully sequenced is still in the low hundreds.
__________________
--
Jeremy Leipzig
Bioinformatics Programmer
--
My blog
Twitter
Zigster is offline   Reply With Quote
Old 01-28-2010, 08:29 AM   #5
krobison
Senior Member
 
Location: Boston area

Join Date: Nov 2007
Posts: 747
Default

Take a look at the recent pan genome paper (not to be confused with the Pan genome paper :-). There may be significant portions of human genome which are not yet represented in any genome database because they are structural variants restricted to populations not yet sampled.

Full scale de novo sequencing may not always be necessary -- some sort of intelligent local reassembly / reassembly of everything that doesn't map followed by integration with that which does.
krobison is offline   Reply With Quote
Old 01-28-2010, 08:49 AM   #6
NextGenSeq
Senior Member
 
Location: USA

Join Date: Apr 2009
Posts: 482
Default

For reliable detection of variations you need fairly high coverage (at least 10X and more is better), thus you need to assemble the multiple reads to determine the coverage. Regions with low coverage give less certainty in whether a variation is real and high coverage gives more confidence (obviously).
NextGenSeq is offline   Reply With Quote
Old 01-28-2010, 06:23 PM   #7
bp2010
Junior Member
 
Location: Asia

Join Date: Jan 2010
Posts: 3
Default

I guess what you guys are trying to say here is that, to detect the variations specific to the individual whose genome is being sequenced, we have to assemble the reads anyway. Ok that I agree.

But do we have a need for the finished personalized human genome sequence? (Assuming that all the variations would have already been detected during the process of genome assembling.)

Thanks for any input.
bp2010 is offline   Reply With Quote
Reply

Tags
human genome assembler, sequence variation

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



All times are GMT -8. The time now is 03:54 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2022, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO