Hello,
I am a new student of Bioinformatics from Seattle and so far it's a fascinating field. I am starting to work on a school project and are still a little lost, since this is my first contact with the field and the tools.
As part of the project, I would like to test BWA with a genome (it does not have to be as long as the human one, something smaller and easier to work with would be great) and reads of different lengths/error rates. The goal of the test would be to see how accurate BWA is when sequencing reads of different lengths and with different error rates, and how its performance is degraded as the length of the reads grows.
I have the Windows versions of BWA and SAMtools from Codeplex, as recommended in a different thread.
My question is, where can I find data to test BWA as mentioned above? How could I test different lengths/error rates? Any quick, general instructions on how to start would be greatly appreciated.
Thanks again, it's a pleasure to be here.
I am a new student of Bioinformatics from Seattle and so far it's a fascinating field. I am starting to work on a school project and are still a little lost, since this is my first contact with the field and the tools.
As part of the project, I would like to test BWA with a genome (it does not have to be as long as the human one, something smaller and easier to work with would be great) and reads of different lengths/error rates. The goal of the test would be to see how accurate BWA is when sequencing reads of different lengths and with different error rates, and how its performance is degraded as the length of the reads grows.
I have the Windows versions of BWA and SAMtools from Codeplex, as recommended in a different thread.
My question is, where can I find data to test BWA as mentioned above? How could I test different lengths/error rates? Any quick, general instructions on how to start would be greatly appreciated.
Thanks again, it's a pleasure to be here.
Comment