I am analysing a environmental barcoding 454 dataset. As this is the first run I have completed I individually identified all species in all samples prior to bulk PCR and 454 sequencing, so I know what should be in each sample. While it has worked well I have one quirky thing with the dataset where I get 1-4 sequence reads appearing in samples that represent species that should not be in the sample. I though it may represent gut content of my species (as guts where included in the 454 extractions), but these reads are sporadically appear and are not alway associated with all replicates of a sample. I used a universal tailed design (with 2 PCR step to attached the MID) so I not sure how it could happen but is it possible for a small number of reads to be associated with the wrong MID? Has anyone else noticed this? Any ideas??
Unconfigured Ad
Collapse
X
-
I saw a paper in press that describes this source of error, but I forget the title and journal. it describes something in the way of MID swapping or errors in the MIDs that create either new MID combination not employed in the study or mimic existing MID combos. I have seen the same thing in our dual MID data i.e. we get a bunch of duel mid combination that were not employed, at very low sequencing depths.
Just ignore them in my opinion
-
-
One other thing that could explain your observations is aerosol contamination of PCR products, or during DNA extraction.
DNA contamination may occur if you are using a robot for extractions where the same pipets are only rinsed. For all next-gen library preps you should follow strict pcr-PCR conditions. i.e. separate rooms, barrier filter tips etc
Comment
-
-
We consistently see something similar (8+ 454 Jr. runs).
Multiplex sequencing of samples that each have unique MIDs on 5' and 3' resulting in reads that have crossing of MIDs.
(samples prepared on separate days, using pre- and post-PCR rooms)
Occurring at about 1% of total reads.
Most likely this occurs during emulsion PCR by cross-priming of unincorporated primers (no matter how good your sample prep is).
Hope this is informative
Comment
-
-
Just out of curiosity I used sfftools to split one of earlier sff files from multiplexed run. In addition to expected barcodes 1 to 5 I found reads with unused barcodes 6-12 at the rates of few percent each. Interestingly, tubes with barcodes 6-12 were never opened yet at the time the libraries were prepared and I am the only one in the entire institute who uses 454 sequencing. So the possibility of any sort of contamination is out of question. Either reagents were cross-contaminated already or there got to be another reason for emergence of reads with unexpected barcodes.
Comment
-
-
Same goes here.Originally posted by yaximik View PostJust out of curiosity I used sfftools to split one of earlier sff files from multiplexed run. In addition to expected barcodes 1 to 5 I found reads with unused barcodes 6-12 at the rates of few percent each. Interestingly, tubes with barcodes 6-12 were never opened yet at the time the libraries were prepared and I am the only one in the entire institute who uses 454 sequencing. So the possibility of any sort of contamination is out of question. Either reagents were cross-contaminated already or there got to be another reason for emergence of reads with unexpected barcodes.
The paper I linked too earlier discusses.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
06-18-2026, 07:11 AM -
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
30 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
||
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
44 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
51 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
51 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
Comment