![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Trying to apply overlap to each row of a "GRangesList" | francy | Bioinformatics | 13 | 09-29-2015 09:29 AM |
Paired reads overlap "too much" | Retro | Illumina/Solexa | 7 | 12-04-2014 04:33 AM |
MiSeq gDNA reads still fail "Kmer content" and "per base seq content" after trimming" | ysnapus | Illumina/Solexa | 4 | 11-12-2014 08:25 AM |
DEXSeq error in estimateDispersions: match.arg(start.method, c("log(y)", "mean")) | fpadilla | Bioinformatics | 14 | 07-03-2013 03:11 PM |
The position file formats ".clocs" and "_pos.txt"? Ist there any difference? | elgor | Illumina/Solexa | 0 | 06-27-2011 08:55 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Germany Join Date: Sep 2013
Posts: 11
|
![]()
The question ermerged after reviewing this older application note by Illumina:
https://www.illumina.com/content/dam..._miseq_16S.pdf Their figure 1 indicates that the first position which is counted towards the targeted read length is the first base following the 3' end of the gene-specific primer. Hence, the 515F-806R system targeting the V4 of the 16S rRNA gene is perfectly useable with a 2 x 150 base pairs MiSeq run, because of an 46 bp overlap within the 253 bp fragment covered between both gene-specific primers. However, if one would assume that the first base counting for the target read length is the base following the sequencing primer, that obviously changes. So, if one would count in both primers, the overlap is reduced to <10 bases, even in the predictably worst part of boths read, qualitywise. Adding barcode(s) would even result in no overlap. I hope this examples clarifies the question. The underlying task is to find a primer pair that is feasible with MiSeq 2 x 250 with very good coverage and HiSeq 2 x 150 with lower coverage but higher yields. I doubt that 2x150 bp HiSeq is a good system for hiseq. However, it seems that its OK to just use the forward read (according to caporaso et al 2011). What do you think? Last edited by nouse; 02-14-2017 at 02:26 PM. |
![]() |
![]() |
![]() |
#2 |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]()
For amplicons, my understanding is (and I would appreciate if someone closer to the wet-lab could confirm or deny this) that the molecules being sequenced are laid out like this:
[adapter1][barcode1][more adapter1][sequencing primer1][pcr primer1] actual genomic sequence [pcr primer2][sequencing primer2][more adapter2][barcode2][adapter2] The sequencing primers are not (usually) part of the read, unless you are using staggered variable-length primers to increase library diversity, but in that case only a few bp of it get sequenced. The PCR primers are always part of the read. I think that whether the PCR primers are genomic or synthetic depends on the process; I've never really gotten a conclusive answer on that. |
![]() |
![]() |
![]() |
#3 |
Member
Location: Germany Join Date: Sep 2013
Posts: 11
|
![]()
Thanks for your answer.
From my experience with the HiSeq, the raw sequences i got included barcodes and pcr primers (which makes sense, since they have been sequenced after all). This indicates that the figure 1 of the illumina application note is either misleading or wrong or they used their pcr primer regions as a target for another sequencing round. |
![]() |
![]() |
![]() |
#4 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,238
|
![]()
Brian’s explanation showing amplicon library structure is correct. However, I would add that if there are variable length diversity nucleotides or barcodes at 5’ end of either PCR primers they will be sequenced as well along with the PCR primers. If someone uses custom sequencing primers that binds to the PCR primers then PCR primers will not be sequenced (sequencing primers will not be required to be included in adapter design). In this case diversity nucleotides added to 5’ end of primers will not be useful because they cannot be sequenced.
Fig 1 in Illumina’s note indicates that the hypervariable region is 254 bp and the minimum length of amplified region including conserved 5’ and 3’ flanking regions (used for priming) is 291 bp so 2x150 will not be enough to provide 46 bp overlap unless custom primers were used for sequencing. But the figure indicates that standard Illumina sequencing primers were used for sequencing thus the figure is incorrect. |
![]() |
![]() |
![]() |
#5 | |
Senior Member
Location: USA, Midwest Join Date: May 2008
Posts: 1,178
|
![]() Quote:
Caporaso, J. G., Lauber, C. L., Walters, W. A., Berg-Lyons, D., Lozupone, C. A., Turnbaugh, P. J., et al. (2011). Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proceedings of the National Academy of Sciences, 108 Suppl 1, 4516–4522. http://doi.org/10.1073/pnas.1000080107 Kozich, J. J., Westcott, S. L., Baxter, N. T., Highlander, S. K., & Schloss, P. D. (2013). Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and Environmental Microbiology, 79(17), 5112–5120. http://doi.org/10.1128/AEM.01043-13 |
|
![]() |
![]() |
![]() |
#6 | |
Super Moderator
Location: Walnut Creek, CA Join Date: Jan 2014
Posts: 2,707
|
![]() Quote:
![]() |
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|