![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Why using short reads? | dbrg77 | General | 1 | 03-25-2011 09:01 AM |
short sequencing reads | vlee2 | 454 Pyrosequencing | 4 | 02-28-2011 08:34 AM |
454 Reads correction with Short reads ? | yvan.wenger | Bioinformatics | 3 | 11-26-2010 05:17 AM |
Pre-assembly for short-reads to minimize RAM usage | Alex8 | Bioinformatics | 6 | 11-05-2010 06:58 AM |
clustering short reads | lpantano | Bioinformatics | 2 | 02-02-2010 06:56 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Seattle Join Date: Feb 2010
Posts: 109
|
![]()
Someone asked me whether it makes sense to remove duplicate reads to get the library size down to fit RAM limit. I think it is a bad strategy as explained here -
http://www.homolog.us/blogs/2011/09/...n-k-mer-world/
__________________
http://homolog.us |
![]() |
![]() |
![]() |
#2 |
Member
Location: Israel Join Date: Dec 2010
Posts: 25
|
![]()
I think duplicated reads removed to avoid biases that resulted from library preparation (for example) and not for reduction of data for de-novo assembly.
Ilia |
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Seattle Join Date: Feb 2010
Posts: 109
|
![]()
That's a good point. Some filtering is necessary to take care of pileup of reads due to biases. I do that for alignment and SNP discovery, but think twice about it during de novo assembly. If no underlying genome is known, it is hard to tell whether the duplicated reads come from error or real sequence.
__________________
http://homolog.us |
![]() |
![]() |
![]() |
#4 |
Member
Location: Israel Join Date: Dec 2010
Posts: 25
|
![]()
So when you assemble reads in to contigs, you will prefer that at least several reads will support the assembly. If you will have identical reads, you might obtain false contigs.
Ilia |
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: Seattle Join Date: Feb 2010
Posts: 109
|
![]()
It does not work that way for K-mer based assembler. Would you please explain your rationale? Why would one get false contigs?
__________________
http://homolog.us |
![]() |
![]() |
![]() |
#6 |
Member
Location: Israel Join Date: Dec 2010
Posts: 25
|
![]()
Let me rephrase my last comment:
If duplicated reads don't contribute to downstream the de novo assembly pipe, it will be good idea to remove them. Ilia |
![]() |
![]() |
![]() |
Thread Tools | |
|
|