Seqanswers Leaderboard Ad

**bastianwur** · 10-06-2016, 06:54 AM

Originally posted by JVGen View Post

Would error correcting be beneficial?
[...]
How frequent are miscalls in Illumina sequencing?

Miscalls in Illumina sequencing are not very frequent, and should probably not be a consideration at this step.
As QC measure you can use the tool pilon for error correction afterwards.

General comment: There is normally not an optimal setting. It's normally best just to test a range which you think could be reasonable.
(we have a pipeline for this, which will push the data through 5 assemblers with different parameters and evaluates afterwards, which assembly is potentially the best; there's at least one published pipeline for this out there, but wouldn't know the name right now)

**JVGen** · 10-06-2016, 06:57 AM

Originally posted by bastianwur View Post

Miscalls in Illumina sequencing are not very frequent, and should probably not be a consideration at this step.
As QC measure you can use the tool pilon for error correction afterwards.

General comment: There is normally not an optimal setting. It's normally best just to test a range which you think could be reasonable.
(we have a pipeline for this, which will push the data through 5 assemblers with different parameters and evaluates afterwards, which assembly is potentially the best; there's at least one published pipeline for this out there, but wouldn't know the name right now)

Thanks Bastian. I'm learning this is quite a complex process. I intend to look for open reading frames, so misassembly/miscalls are quite worrisome. They could introduce nonsense mutations, and we wouldn't know the difference (because we're sequencing viruses which are highly mutated). We need a bioinformatician :P

**bastianwur** · 10-06-2016, 07:19 AM

Yeah, that can for sure happen during the assembly processes, we've seen this during some comparative genomics tests.
But as suggested, use Pilon for error correction afterwards. It needs the reads mapped to the assembly, and will then check if the majority of the reads agree with the assembly, and will correct it if it's not the case.
The tool is relatively easy to use, in case you're familiar with the command line and know what a BAM file and a fasta file is.

**JVGen** · 10-07-2016, 05:03 AM

Originally posted by bastianwur View Post

Yeah, that can for sure happen during the assembly processes, we've seen this during some comparative genomics tests.
But as suggested, use Pilon for error correction afterwards. It needs the reads mapped to the assembly, and will then check if the majority of the reads agree with the assembly, and will correct it if it's not the case.
The tool is relatively easy to use, in case you're familiar with the command line and know what a BAM file and a fasta file is.

Thanks Bastian. Which de novo assemblers do you use? Many do not appear to save the reads; the output is a contig consensus sequence (Tadpole & Velvet, for instance). The only one that I've used the saves the aligned reads with the assembled contig is the assembler within Geneious. I'd be interested to hear what you use.

Thanks!

Topics	Statistics	Last Post
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, Yesterday, 06:57 AM	0 responses 11 views 0 likes	Last Post by seqadmin Yesterday, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 16 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM

Seqanswers Leaderboard Ad

Announcement

Optimal De Novo Assembly Parameters

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News