![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Ion Torrent Sequencing | qnc | General | 6 | 06-20-2014 12:57 AM |
Ion Torrent claims of MiSeq showing post-homopolymer substitution errors | ECO | Illumina/Solexa | 13 | 05-22-2012 05:31 AM |
Ion Torrent and desktop sequencing: The Death of the Core | genbio64 | Core Facilities | 8 | 04-08-2012 06:01 PM |
Bisulphite sequencing - Ion Torrent | arnaud.kr | Ion Torrent | 1 | 11-11-2011 04:06 AM |
Can Ion Torrent be useful for microRNA sequencing? | aushev | The Pipeline | 4 | 11-25-2010 10:42 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: Edmonton, Canada Join Date: Aug 2012
Posts: 4
|
![]()
I am trying to detect rare variants in pooled mutated samples. I have barcoded my pools and I have 3 genes I am analyzing in each of 28 pools. Since I am doing this as a test with diluted samples bearing a known mutation I can confirm if the technique is working. I can clearly see the expected mutations (around 2% change), but also see a bunch of sequencing errors, mainly at the end of the reads, which at the end show frequencies close to the real mutation. I been struggling for the last couple of days with SAMtools, CRISP and even the ion torrent variant call plugin without success (this last one only calls alleles with a minimum of 5% frequency and rare mutations are lower than that). I know there are other Biologist-friendly tools out there, but they are usually expensive, so I am also on the lookout for something free and efficient.
I would mainly like to find a way to tell those sequencing errors from the real variants...any advise? |
![]() |
![]() |
![]() |
#2 |
Member
Location: 02119 Join Date: Jul 2011
Posts: 29
|
![]()
This isn't an ion torrent specific problem. As the frequency of the mutation you're looking for falls near and below the sequencing error rate, things get hard. I think you need to consider an approach like this:
http://labs.bio.unc.edu/CDJones/PrimerID/Home.html where you introduce random bases into your primers such that you can tell which sequencing reads came from which first-round copies. Basically, you generate a first set of molecules from the template with a random sequence on the end, then amplify and sequence those. Now rather than 10k reads from your amplicon, you have 100 reads of 100 original molecules. The 99 original molecules without the mutation produce mostly WT bases. The 1 original molecule with the mutation produces mostly reads with the variant base. You may also see this technique referred to as dogtagging. |
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: Edmonton, Canada Join Date: Aug 2012
Posts: 4
|
![]()
thanks I will look into this
![]() |
![]() |
![]() |
![]() |
#4 |
Member
Location: Rockville, MD Join Date: Apr 2011
Posts: 23
|
![]()
A couple suggestions that might help:
First - depending on your application, you may also want to look into a viral quasispecies analysis tool such as ShoRAH or QuRe - the viral quasispecies problem is somewhat similar to rare variant calling. Also - I may be wrong, but I believe the Ion Torrent Variant Caller tool is tuned for human SNP calling. In my own experience it is not the best tool for use with microbial or viral sequences. I generally don't use it. You may want to aggressively pre-filter your FASTQ from the torrent server before read mapping. Filter your reads to only Q20 or higher. Remove all duplicate reads (I'm assuming this is from PCR amplicon sequencing data?) Then trim the ends of all reads by 3 - 5 bases and discard any read that is 10 - 20 nt or shorter in length. You will end up throwing out a fair amount of data, but your mapping and variant calling will improve. Then use tmap3 or some other ion torrent friendly read-mapper. We use CLCbio's software for this - but there are some open source solutions out there as well. hope that helps. |
![]() |
![]() |
![]() |
#5 |
Junior Member
Location: Edmonton, Canada Join Date: Aug 2012
Posts: 4
|
![]()
Thanks Jonathan,
The quasispecies might not apply to this study, but is still good to know since I will be working with retrotransposons in another area. I think you are right about the quality filtering. I also use the CLC bio and it seems that with the new quality variant caller I am able to distinguish the mutations I was expecting. thanks Leo. |
![]() |
![]() |
![]() |
Thread Tools | |
|
|