![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
smallRNA GAIIx raw fastq files - quality filter? | vebaev | Bioinformatics | 0 | 08-22-2011 11:30 AM |
PubMed: Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing. | Newsbot! | Literature Watch | 0 | 05-20-2011 04:40 AM |
input files for IMAGE | Maegwin | Bioinformatics | 4 | 04-22-2011 05:54 PM |
IMAGE input files | skingan | Genomic Resequencing | 0 | 07-29-2010 01:02 PM |
ShortRead: a bioconductor package for input, quality assessment and exploration | Chien-Yuan Chen | Literature Watch | 0 | 11-09-2009 02:36 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: America Join Date: Apr 2010
Posts: 9
|
![]()
I just got done with an analysis based on a Sanger sequencing assay, where having the raw trace files was absolutely invaluable. They saved me from publishing some very "interesting" results which were actually experimental artifacts because they showed up in the trace files as sequencing anomalies which weren't caught by the SNP detection software we used.
Now I'm moving on to a project based on SOLiD sequencing, and I've learned that my collaborators are throwing away the raw fluorescence image files. The fact that I will not be able to go back to the raw data to check for anomalies makes me nervous, and I am trying to decide whether to push back on this policy. I am wondering whether people assessing SOLiD sequence calls ever get any added value from examining a sample of pertinent raw image files. |
![]() |
![]() |
![]() |
#2 |
Junior Member
Location: US Join Date: Apr 2010
Posts: 1
|
![]()
I would love to see this too. I have the same problem. Are there even any samples out there? I could imagine even going a step further back and looking at calibration files if they exist.
|
![]() |
![]() |
![]() |
#3 | |
Nils Homer
Location: Boston, MA, USA Join Date: Nov 2008
Posts: 1,285
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
#4 |
Junior Member
Location: Ithaca, NY Join Date: Oct 2009
Posts: 5
|
![]()
..........
Last edited by Alex Coventry; 04-09-2010 at 03:35 PM. |
![]() |
![]() |
![]() |
#5 |
Junior Member
Location: America Join Date: Apr 2010
Posts: 9
|
![]()
So, what kind of debugging do you use the images for? I have heard of people using them to diagnose primer failures, but not much beyond that.
|
![]() |
![]() |
![]() |
#6 |
Nils Homer
Location: Boston, MA, USA Join Date: Nov 2008
Posts: 1,285
|
![]() |
![]() |
![]() |
![]() |
#7 |
Senior Member
Location: USA Join Date: Oct 2008
Posts: 158
|
![]()
The cost to store images often outweighs the value. I have never seen a reason to go back to the raw image files. If your primary metrics are fine, the images will not help.
|
![]() |
![]() |
![]() |
#8 |
Nils Homer
Location: Boston, MA, USA Join Date: Nov 2008
Posts: 1,285
|
![]()
It also depends on the company, but you might need a subset of images to provide to the company to prove that there is a problem with the sequencer or reagents. Just a thought.
Last edited by nilshomer; 04-11-2010 at 10:16 PM. Reason: speak-and-spell |
![]() |
![]() |
![]() |
#9 |
Member
Location: Oxford Join Date: Jul 2008
Posts: 24
|
![]()
Was very useful when setting up and debugging machines, chemistry and protocols at Solexa and Sanger.
Often systemic issues with the instrument showed up as easily recognisible problems in the images. It is easy to know what an ideal image should look like and any deviation from this can be quickly and easily recognized by eye (human brain is very good at this sort of pattern recognition). For example, contaminants in the reagents would show as bright blobs that would then get falsely called as clusters leading to pseudo sequences. Blobs adjacent to real clusters would spill over signal and skew base calls (and perhaps your SNP calls). Badly set up optics would lead to uneven illumination across the tile giving poor or artefactual base calls. Flaws in the focusing software, occuring sporadically would do the same. Manufacturing faults in the flowcells would lead to flowcell walls being images, altering focusing, and giving rise to pseudo-sequences. Other flaws in the flowcell coatings would show up in the images but not in the metrics, 'black holes' in the surfaces. Primer problems would give rise distinct sub-populations of 'speck' clusters. Optical duplicates could be spotted using the X,Y coordinates of similar sequences (I think based on alignment start and stops) and backtracking to the images demonstrated that some of this was due to egde effects and stage movements (and others) - and so on. In those days on GAI and IIs you could breeze into the lab and watch the images popping up on a selected number of tiles and hopefully they would be reassuringly normal. Yes indeed, if the manufacturers changed something in reagents or in the instrumentation and it was not beneficial or introduced a sporadic bug then the images provided a very powerful way to communicate that to them - especially when those bugs were only affecting some machines or batches and not others. We went back to images a lot in fact, contrary to what one of the posters asserts, although i do not know if this still holds true with more recent systems. The argument was - when the instruments and reagents become completely reliable black box, low variability systems with well defined behavior and outputs - of course then you don't need to back track to images. For sure, these days, a lot of QC metrics are provided and it may be possible to spot things more easily from these, and of course, far more images are produced now making it much more difficult to keep and review them. But personally, given the rate of change of these systems and the issues with variability, if i were working on a very large project spanning and long time period, I'd backtrack a bit and ensure that a run or base-call on a recent run equates to a base call on one of my earlier data sets where i may be combining these data or making comparisons. If you are modifying run protocols, cluster creation or other aspects of the system that are non-standard then its probably wise to back rack to images. Personally, I like to know whats going on under the hood, then i can head off a breakdown and get more mileage and economy. Some people like to just turn the key and drive. Last edited by clivey; 04-12-2010 at 02:09 AM. |
![]() |
![]() |
![]() |
#10 |
Junior Member
Location: America Join Date: Apr 2010
Posts: 9
|
![]()
Thanks for the information, clivey. It's hard to know how many of those issues will arise with SOLiD runs, but it provides a place to start looking.
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|