Seqanswers Leaderboard Ad

**Brian Bushnell** · 02-14-2017, 05:38 PM

For amplicons, my understanding is (and I would appreciate if someone closer to the wet-lab could confirm or deny this) that the molecules being sequenced are laid out like this:

[adapter1][barcode1][more adapter1][sequencing primer1][pcr primer1] actual genomic sequence [pcr primer2][sequencing primer2][more adapter2][barcode2][adapter2]

The sequencing primers are not (usually) part of the read, unless you are using staggered variable-length primers to increase library diversity, but in that case only a few bp of it get sequenced. The PCR primers are always part of the read. I think that whether the PCR primers are genomic or synthetic depends on the process; I've never really gotten a conclusive answer on that.

**nouse** · 02-15-2017, 01:04 AM

Thanks for your answer.
From my experience with the HiSeq, the raw sequences i got included barcodes and pcr primers (which makes sense, since they have been sequenced after all).

This indicates that the figure 1 of the illumina application note is either misleading or wrong or they used their pcr primer regions as a target for another sequencing round.

**nucacidhunter** · 02-15-2017, 02:39 PM

Brian’s explanation showing amplicon library structure is correct. However, I would add that if there are variable length diversity nucleotides or barcodes at 5’ end of either PCR primers they will be sequenced as well along with the PCR primers. If someone uses custom sequencing primers that binds to the PCR primers then PCR primers will not be sequenced (sequencing primers will not be required to be included in adapter design). In this case diversity nucleotides added to 5’ end of primers will not be useful because they cannot be sequenced.

Fig 1 in Illumina’s note indicates that the hypervariable region is 254 bp and the minimum length of amplified region including conserved 5’ and 3’ flanking regions (used for priming) is 291 bp so 2x150 will not be enough to provide 46 bp overlap unless custom primers were used for sequencing. But the figure indicates that standard Illumina sequencing primers were used for sequencing thus the figure is incorrect.

**kmcarr** · 02-16-2017, 06:37 AM

Originally posted by Brian Bushnell View Post

For amplicons, my understanding is (and I would appreciate if someone closer to the wet-lab could confirm or deny this) that the molecules being sequenced are laid out like this:

[adapter1][barcode1][more adapter1][sequencing primer1][pcr primer1] actual genomic sequence [pcr primer2][sequencing primer2][more adapter2][barcode2][adapter2]

The sequencing primers are not (usually) part of the read, unless you are using staggered variable-length primers to increase library diversity, but in that case only a few bp of it get sequenced. The PCR primers are always part of the read. I think that whether the PCR primers are genomic or synthetic depends on the process; I've never really gotten a conclusive answer on that.

It is not always the case that the PCR primers are part of the read. In the two most cited 16S-V4 protocols (Caporaso & Knight, Kozich & Schloss) custom sequencing primers which match the target specific PCR primer are added to the MiSeq run. This results in read data which starts immediately after the 3' ends of the PCR primers so there is no PCR primer sequence to trim from your reads, and hence no wasted sequence.

Caporaso, J. G., Lauber, C. L., Walters, W. A., Berg-Lyons, D., Lozupone, C. A., Turnbaugh, P. J., et al. (2011). Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proceedings of the National Academy of Sciences, 108 Suppl 1, 4516–4522. http://doi.org/10.1073/pnas.1000080107

Kozich, J. J., Westcott, S. L., Baxter, N. T., Highlander, S. K., & Schloss, P. D. (2013). Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and Environmental Microbiology, 79(17), 5112–5120. http://doi.org/10.1128/AEM.01043-13

**Brian Bushnell** · 02-16-2017, 07:09 AM

Originally posted by kmcarr View Post

In the two most cited 16S-V4 protocols (Caporaso & Knight, Kozich & Schloss) custom sequencing primers which match the target specific PCR primer are added to the MiSeq run. This results in read data which starts immediately after the 3' ends of the PCR primers so there is no PCR primer sequence to trim from your reads, and hence no wasted sequence.

Oh, that's clever. I wonder if that caused some compromises that limit the diversity of organisms that will amplify? I guess I need to read the papers

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 10 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

What is the official "first sequenced position" (for overlap calculation)??

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News