![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Using Mosaik to assemble bacterial genome 454 sequencing | jpearl01 | Bioinformatics | 2 | 03-27-2013 06:29 AM |
Can the upcoming Sandy Bridge i7 Extreme assemble a genome? | ymc | Bioinformatics | 30 | 06-06-2012 06:38 AM |
Assemble tools | dicty | Bioinformatics | 8 | 02-23-2011 01:02 AM |
cufflinks assemble | syslm01 | Bioinformatics | 0 | 05-05-2010 03:43 AM |
Who is the best way to align/assemble to a reference? | anyone1985 | Bioinformatics | 3 | 04-30-2009 05:40 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: Asia Join Date: Jan 2010
Posts: 3
|
![]()
Hi,
This may sound like a naive question, but I have been trying to come up with answers for a couple days and haven't yet been able to. Thank you in advance for any input. Now that we can detect human sequence variations (SNPs, indels, Structural Variants, etc) based on the set of paired-end reads, I wonder if there is still a need to assemble the original sequence. Wasn't the point to detect the variations? And we don't need to know the assembled sequence for the new sequence anymore to gain its gene positions because its paired-end reads can be mapped back to the human reference genome, so we can learn the gene positions from there. So aside from saving space (100 something GB vs 3GB) and time to analyze the data, do we really need to assemble any new human genome that has been resequenced? Thank you! |
![]() |
![]() |
![]() |
#2 |
Junior Member
Location: China Join Date: Jan 2010
Posts: 5
|
![]()
We know Beijing, New York, Paris, London, Tokyo... , but we still need a World map.
|
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: Asia Join Date: Jan 2010
Posts: 3
|
![]()
No no, please don't get me wrong here. Let me clarify a bit.
I understand we still need to sequence personalized genomes. However, the question is once we get the reads, do we need to assemble them? My thinking is that the need to have a fully assembled sequence arose from the fact that we need to know how this particular sequence vary from the human reference genome (hg18, for example). But based on just the reads, we can use programs like the SOAP package to locate variations already. For everything else, it is supposed to be identical to the reference genome we use. So why bother assembling, once we have the reads? Cannot we get all information we need from the reads alone? |
![]() |
![]() |
![]() |
#4 |
(Jeremy Leipzig)
Location: Philadelphia, PA Join Date: May 2009
Posts: 116
|
![]()
I would venture to say that most of the interest in assembly is in de novo assembly of novel organisms.
I believe the number of organisms that have been fully sequenced is still in the low hundreds. |
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: Boston area Join Date: Nov 2007
Posts: 747
|
![]()
Take a look at the recent pan genome paper (not to be confused with the Pan genome paper :-). There may be significant portions of human genome which are not yet represented in any genome database because they are structural variants restricted to populations not yet sampled.
Full scale de novo sequencing may not always be necessary -- some sort of intelligent local reassembly / reassembly of everything that doesn't map followed by integration with that which does. |
![]() |
![]() |
![]() |
#6 |
Senior Member
Location: USA Join Date: Apr 2009
Posts: 482
|
![]()
For reliable detection of variations you need fairly high coverage (at least 10X and more is better), thus you need to assemble the multiple reads to determine the coverage. Regions with low coverage give less certainty in whether a variation is real and high coverage gives more confidence (obviously).
|
![]() |
![]() |
![]() |
#7 |
Junior Member
Location: Asia Join Date: Jan 2010
Posts: 3
|
![]()
I guess what you guys are trying to say here is that, to detect the variations specific to the individual whose genome is being sequenced, we have to assemble the reads anyway. Ok that I agree.
But do we have a need for the finished personalized human genome sequence? (Assuming that all the variations would have already been detected during the process of genome assembling.) Thanks for any input. |
![]() |
![]() |
![]() |
Tags |
human genome assembler, sequence variation |
Thread Tools | |
|
|