First off, I apologize for the long post.
Second, I have not found anything via searches that is definitively able to help me, so I am making a new thread here
Ok, so let me give you some info on my project (I am a Ph.D. student in Microbiology) so that you can understand the question I am trying to answer.
1) I study a bacteria
2) If grown on a petri dish with nutrient agar, and allowed to grow up from a single cell, bacteria will form what we call "colonies" which are visible by eye and are usually quite similar in their physical characteristics within a given species
3) My bacteria likes to make different-looking colonies
4) These different-looking colonies are more than just interesting to look at, they also have implications for physiology, behavior and pathogenicity (my bug is an opportunistic pathogen).
5) A collaborator of ours offered to sequence some of these "colony variants" for us and I have the data back and assembled
5a) The data was gathered on a Roche 454 platform
5b) I have been using DNASTAR's seqMan NGen to assemble the data and DNASTAR's seqMan Pro to view the assembly
5c) I have not closed any of the genomes with new sequence data
6) I have not yet been able to find a piece of software that will allow me to compare all of my sequenced variants at one time to determine whether any given mutation is important or not so I built a spreadsheet by hand to do that and manually searched all of the assemblies for the read/base-pair composition at any site of interest (usually found in the SNP report for one of the given variants). Unfortunately, this produces about 1,000 SNPs and introduces an unacceptable amount of human error (discovered the hard way) - both of which cannot be brute-force-fixed either with re-sequencing or with man hours (trust me, I've tried)
I **desperately** need a tool that will:
-take different assemblies from a nearly isogenic collection of samples and arrange them to see what is similar/different about them
-highlight regions that *could* be of interest but would normally be filtered out due to low depth of coverage (and could be filled in by targeted re-sequencing)
I was under the impression that SAM (Sequence Assembly Manager) could do this for me with the pileup function and I am in the process of installing it; however I have run into several snags, the biggest of which is that GNU Compiler Collection (GCC) will not update from 4.1.2 to anything higher and it looks like I am going to have to find a new OS and re-install because apparently, my OS (CentOS_5.6) is no longer supported.
Yet I cannot keep throwing my time away on this project - it supposed to be a preliminary side project and we've been working on it for over a year now, so I have to stop with this trial-and-error nonsense and actually finish the data analysis. I am begging you guys, if there is any piece of software you know that would do this, what is it and what do I need to run it? (and will you coach me into getting the thing up and running) Or, ***if you have personal experience doing this kind of thing (or know someone who does), could you PLEASE contact me*** - I will seriously buy you beer and make you cookies or whatever you want
Second, I have not found anything via searches that is definitively able to help me, so I am making a new thread here
Ok, so let me give you some info on my project (I am a Ph.D. student in Microbiology) so that you can understand the question I am trying to answer.
1) I study a bacteria
2) If grown on a petri dish with nutrient agar, and allowed to grow up from a single cell, bacteria will form what we call "colonies" which are visible by eye and are usually quite similar in their physical characteristics within a given species
3) My bacteria likes to make different-looking colonies
4) These different-looking colonies are more than just interesting to look at, they also have implications for physiology, behavior and pathogenicity (my bug is an opportunistic pathogen).
5) A collaborator of ours offered to sequence some of these "colony variants" for us and I have the data back and assembled
5a) The data was gathered on a Roche 454 platform
5b) I have been using DNASTAR's seqMan NGen to assemble the data and DNASTAR's seqMan Pro to view the assembly
5c) I have not closed any of the genomes with new sequence data
6) I have not yet been able to find a piece of software that will allow me to compare all of my sequenced variants at one time to determine whether any given mutation is important or not so I built a spreadsheet by hand to do that and manually searched all of the assemblies for the read/base-pair composition at any site of interest (usually found in the SNP report for one of the given variants). Unfortunately, this produces about 1,000 SNPs and introduces an unacceptable amount of human error (discovered the hard way) - both of which cannot be brute-force-fixed either with re-sequencing or with man hours (trust me, I've tried)
I **desperately** need a tool that will:
-take different assemblies from a nearly isogenic collection of samples and arrange them to see what is similar/different about them
-highlight regions that *could* be of interest but would normally be filtered out due to low depth of coverage (and could be filled in by targeted re-sequencing)
I was under the impression that SAM (Sequence Assembly Manager) could do this for me with the pileup function and I am in the process of installing it; however I have run into several snags, the biggest of which is that GNU Compiler Collection (GCC) will not update from 4.1.2 to anything higher and it looks like I am going to have to find a new OS and re-install because apparently, my OS (CentOS_5.6) is no longer supported.
Yet I cannot keep throwing my time away on this project - it supposed to be a preliminary side project and we've been working on it for over a year now, so I have to stop with this trial-and-error nonsense and actually finish the data analysis. I am begging you guys, if there is any piece of software you know that would do this, what is it and what do I need to run it? (and will you coach me into getting the thing up and running) Or, ***if you have personal experience doing this kind of thing (or know someone who does), could you PLEASE contact me*** - I will seriously buy you beer and make you cookies or whatever you want
Comment