Hi all,
We received a set of ribosome profiling libraries to be sequenced on the NextSeq 500. We ran the pooled libraries using 1 x 40bp chemistry on the NextSeq. The pool was qPCR quantified and run on the bioanalyzer. Please see the attached data. The clustering density was very broad across all 4 lanes. The % base plot showed a clear G signal across all 40 cycles! When fastQC was done, all 8 samples showed ~80% of the sequence to be G's! Since these were ribosome profiling libraries, we thought that most of the libraries could be rRNA and that resulted in low diversity that could be leading to inaccurate basecalling. Illumina told us that lower clustering, lower signal, and something library-related is causing this weird result. However, when the investigator ligated these libraries into a vector and sanger sequenced a few clones, all sequences were from genes and not rRNA. However, when these libraries were PCR amplified, no signal was detected at 14 cycles (protocol suggested maximum) but with 4 additional cycles, the correct library fragment size showed up. That suggested either very low template or very low PCR amplifiable molecules.
Could it be that higher number of PCR cycles introduced duplicates? Sanger sequencing shows genes and not rRNA. What could be the cause of these G's?
We received a set of ribosome profiling libraries to be sequenced on the NextSeq 500. We ran the pooled libraries using 1 x 40bp chemistry on the NextSeq. The pool was qPCR quantified and run on the bioanalyzer. Please see the attached data. The clustering density was very broad across all 4 lanes. The % base plot showed a clear G signal across all 40 cycles! When fastQC was done, all 8 samples showed ~80% of the sequence to be G's! Since these were ribosome profiling libraries, we thought that most of the libraries could be rRNA and that resulted in low diversity that could be leading to inaccurate basecalling. Illumina told us that lower clustering, lower signal, and something library-related is causing this weird result. However, when the investigator ligated these libraries into a vector and sanger sequenced a few clones, all sequences were from genes and not rRNA. However, when these libraries were PCR amplified, no signal was detected at 14 cycles (protocol suggested maximum) but with 4 additional cycles, the correct library fragment size showed up. That suggested either very low template or very low PCR amplifiable molecules.
Could it be that higher number of PCR cycles introduced duplicates? Sanger sequencing shows genes and not rRNA. What could be the cause of these G's?
Comment