![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Ion Torrent $1000 Genome!? Benchtop Ion Proton Sequencer | aeonsim | Ion Torrent | 88 | 10-28-2012 05:50 AM |
GATK and Ion Torrent data | NextGenSeb | Bioinformatics | 2 | 02-12-2012 09:36 PM |
Technical Brief: Processing Ion Torrent Data with RTG Investigator | Stewart Noyce | Vendor Forum | 0 | 10-19-2011 01:08 PM |
Ion Torrent PGM data with Mosaik-aligner | Magnus | Bioinformatics | 0 | 05-19-2011 04:59 AM |
CLC Genomics Workbench goes hand in hand with Ion Torrent data | CLC bio | Vendor Forum | 0 | 05-12-2011 06:34 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: canada Join Date: Sep 2009
Posts: 29
|
![]()
Hi.
We have an Ion Torrent PGM where we have sequenced a number of genomes. Looking at the TMAP alignments we definitely see sequencing errors of the indel variety around homopolymer regions in the reference. The homopolymer runs don't necessarily have to be more than two consective bases either. Some quick analysis showed me that the positions of the indels in the reads aren't completely random, and do stack up in the alignments, which could cause false positives in the indel calling (as well as potentially interfering with true variants that are in the region). Downloading example E. Coli data from the Life website I see the same sorts of errors. To my dismay when I google this I find all sorts of technical and sponsored reports from Illumina et al, pointing out the errors in Ion Torrent data. Furthermore, I see the reports getting fired back from Life discounting all the analysis in the first paper, and so it continues. My question: Can anyone who has sequenced and analyzed data on the PGM objectively comment on the rate of indels in the reads? I would like to hear if other people have seen what I've seen, or even better if they know of a magic fix. thanks! |
![]() |
![]() |
![]() |
#2 | |
Senior Member
Location: Massachusetts Join Date: May 2009
Posts: 116
|
![]() Quote:
I had waited to see what information I got from Life Tech about it before coming to this site. They suggested it was a problematic sequence. Would be happy to provide more details or discuss it offline. Hilary Morrison |
|
![]() |
![]() |
![]() |
#3 |
Member
Location: Guilford, CT and S.F., CA Join Date: Jan 2010
Posts: 64
|
![]()
Here's a recording of Dr. Niall Lennon from the Broad on their experiences with semiconductor sequencing.
http://www.youtube.com/watch?v=N2nbbBo0zT0 |
![]() |
![]() |
![]() |
#4 |
Member
Location: Switzerland Join Date: May 2010
Posts: 19
|
![]()
We are looking into amplicon sequencing for variant detection, some of the genes have several repeat regions and can generate a lot of false positive heterozygous indels.
Is there a model describing how the Ion Torrent generates read errors in this area? With such a model we could adapt our filtering strategy to reduce the false positive rate (although we want to be sure not to miss a true positive). |
![]() |
![]() |
![]() |
#5 | |
Senior Member
Location: Massachusetts Join Date: May 2009
Posts: 116
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#6 |
Member
Location: Switzerland Join Date: May 2010
Posts: 19
|
![]()
Hilary,
I tried to follow your link to the IT community website, but I get an error message: "It appears you're not allowed to view what you requested" (I am registered, but not as a IT customer). David |
![]() |
![]() |
![]() |
#7 |
Member
Location: Switzerland Join Date: May 2010
Posts: 19
|
![]()
Video from Broad: I just lost 10 min. with it, no info on indel read errors
|
![]() |
![]() |
![]() |
#8 | |
Senior Member
Location: Massachusetts Join Date: May 2009
Posts: 116
|
![]() Quote:
"I've just finished looking through a set of reads from control templates (16S tag sequencing using fusion primers) and see a very interesting (and sad) error pattern. In this image, the top sequence is the most abundant *incorrect* read; the bottom (blue) is the correct read. Number of each is at the left. Ecoli tag results--less than 4% perfect reads. We have 43 controls including K12; the percent correct varied from 0% up to 82%. Seems to have happened on both runs, same day; one was 314 and the other 316. Some more investigation to do." They told us it was a difficult sequence and to try the enzyme in the 200 nt sequencing kit. IT_ErrorPattern.jpg Last edited by HMorrison; 03-02-2012 at 06:14 AM. Reason: added text |
|
![]() |
![]() |
![]() |
#9 |
Member
Location: Switzerland Join Date: May 2010
Posts: 19
|
![]()
Thanks Hilary for the details, interesting case indeed.
|
![]() |
![]() |
![]() |
#10 |
Member
Location: NYC Join Date: Feb 2012
Posts: 10
|
![]()
Unfortunately this is an inherent problem of the 454, Ion Torrent and probably the Proton chemistry. It is well documented. This if from the NEJM article on the sequencing on the German EColi outbreak :
"We also performed sequencing on the Illumina HiSeq platform in accordance with the manufacturer's instructions. An initial single-end run was used to correct errors in the Ion Torrent sequence, principally in homopolymeric tracts. " http://www.nejm.org/doi/full/10.1056...featured_home& |
![]() |
![]() |
![]() |
#11 |
Member
Location: Philadelphia, PA Join Date: Mar 2010
Posts: 19
|
![]()
Wow, this would be very a big issue if proven to be a reproducible error for PGM. But given how many CCTTs there are in genomes (occuring once in every 256bp in totally random sequence) one would image this would have been identified much earlier in-house by LT. Looks like it may have something to do with specific context within in which a CCTT lies?
Thanks Hilary for the very intriguing observation. Have any other PGM users seen this? Last edited by ngseq; 03-17-2012 at 05:33 PM. |
![]() |
![]() |
![]() |
#12 |
Senior Member
Location: Massachusetts Join Date: May 2009
Posts: 116
|
![]()
I would love to know what the two different enzymes are, but whatever enzyme is included in the 200 nt PGM sequencing kit has almost eliminated the problem I first reported. Errors are mainly in what I would consider true homopolymer runs (i.e. more than two of the same base). Much more likely to continue using the system for pyrotag-like (ph-tag?) sequencing.
|
![]() |
![]() |
![]() |
#13 |
Member
Location: Rockville, MD Join Date: Apr 2011
Posts: 23
|
![]()
FWIW - there's a parrallel discussion about this at the IonTorrent community here.
http://lifetech-it.hosted.jivesoftwa.../2299?tstart=0 As Hmorrison mentioned - the 200nt kit largely eliminates this issue. |
![]() |
![]() |
![]() |
#14 |
Senior Member
Location: Massachusetts Join Date: May 2009
Posts: 116
|
![]()
I know; I posted it there too. LifeTech doesn't seem to like me using SeqAnswers exclusively.
|
![]() |
![]() |
![]() |
#15 |
Member
Location: Philadelphia, PA Join Date: Mar 2010
Posts: 19
|
![]()
thanks! looks like we should try to stick to 200nt kits.
|
![]() |
![]() |
![]() |
#16 |
Senior Member
Location: Dronning Maud Land Join Date: Mar 2009
Posts: 129
|
![]()
Are there any datasets available to analyze with the 200 nt kit? I'm interested in the ampliseq cancer panel but need to understand the expected error profile.
Found they have data posted for the Paired-End AmpliSeq Cancer panel http://lifetech-it.hosted.jivesoftwa.../docs/DOC-2573 Last edited by epistatic; 03-29-2012 at 10:11 AM. Reason: Link added |
![]() |
![]() |
![]() |
Tags |
deletion, insertion, ion torrent |
Thread Tools | |
|
|