Hi,
last year we sequenced two libraries of fish skin samples for one study, the V4 region of the 16S.
The first library was sequenced with the v3 kit because it was new and our sequencing service thought it's worth a try. However, for V4 I actually don't need the v3 kit since the amplicon is ~ 250 bp long. I received ok sequences but later had problems with an increase of unique sequences in the analysis pipeline, a well known problem very nicely described by Pat Schloss: http://blog.mothur.org/2014/09/11/Wh...nce-matrix%3F/.
So we sequenced the second library with the v2 kit and I included 15 samples from library 1 in there, to be able to correct for run variation. In fact, the 15 technical replicates were very similar in later analysis but when I checked the whole data set of 250 samples, I detected in part a large variation in the data according to the different runs. For instance, when I analyzed alpha-diversity, the factor run had the highest impact on the results, higher than the factors I was interested in.
Did somebody of you ever had the same problem?
I'm actually afraid that I can not publish my results because of that issue.
Thanks!
last year we sequenced two libraries of fish skin samples for one study, the V4 region of the 16S.
The first library was sequenced with the v3 kit because it was new and our sequencing service thought it's worth a try. However, for V4 I actually don't need the v3 kit since the amplicon is ~ 250 bp long. I received ok sequences but later had problems with an increase of unique sequences in the analysis pipeline, a well known problem very nicely described by Pat Schloss: http://blog.mothur.org/2014/09/11/Wh...nce-matrix%3F/.
So we sequenced the second library with the v2 kit and I included 15 samples from library 1 in there, to be able to correct for run variation. In fact, the 15 technical replicates were very similar in later analysis but when I checked the whole data set of 250 samples, I detected in part a large variation in the data according to the different runs. For instance, when I analyzed alpha-diversity, the factor run had the highest impact on the results, higher than the factors I was interested in.
Did somebody of you ever had the same problem?
I'm actually afraid that I can not publish my results because of that issue.
Thanks!
Comment