Hopefully last release for the year: Bismark v0.13.1
We have just released a new version of Bismark (v0.13.1), which is available from the Babraham Bioinformatics website. This version adds several new useful options and bug fixes, perhaps most notably the new option --merge_CpG for the coverage2cytosine module that pools CpG information from both top and bottom strand methylation evidence and a fix for the methylation extractor option --multicore which now accurately shows the correct number of events in the M-bias plots.
Here is a more detailed list of all changes:
• Bismark Genome Preparation: Added a check for unique chromosome names to the Bismark indexer to avoid disappointments later
• Methylation Extractor: Fixed a bug for the M-bias reports when the option --multicore was used, in which case only the numbers of one core were used to constuct the report. Now every different thread writes out an individual M-bias table, and once the methylation extraction has completed all these individual files are merged into a single, cumulative table as it should be
• Methylation Extractor: Added a new option --mbias_off, which processes the files as normal but does not write out any M-bias files. This option is meant for users who run the methylation extractor two times, the first time to figure out whether there is a bias that needs to be removed, and the second time using the --ignore options, but without overwriting the already existent M-bias reports
• bismark2bedGraph: Deferred removal of the input file path information a little so that specifying file paths doesn't prevent bismark2bedGraph from finding the input files anymore
• bismark2bedGraph: If the specified output directory doesn't exist it will be created
• bismark2bedGraph: Changed the way scaffolds are sorted (with --gazillion/--scaffold specified) to -k3,3V (this was done following a suggestion by Volker Brendel, Indiana University: "The -k3,3V sort option is critical when the sequence names are numbered scaffolds (without left-buffering of zeros)
• coverage2cytosine: Added a new option --merge_CpG that will post-process the genome-wide report to write out an additional coverage file which has the top and bottom strand methylation evidence pooled into a single CpG dinucleotide entity. This may be the desirable input format for some downstream processing tools such as the R-package bsseq (by K.D. Hansen). An example would be:
This option is currently experimental, and only works if CpG context only and a single genome-wide report were specified (i.e. it doesn't work with the options --CX or --split_by_chromosome)
• coverage2cytosine: Changed the processing of not-covered chromosomes so that they are sorted and not processed randomly. This should make runs more reproducible
Comments or suggestions welcome. Happy New Year! Felix
We have just released a new version of Bismark (v0.13.1), which is available from the Babraham Bioinformatics website. This version adds several new useful options and bug fixes, perhaps most notably the new option --merge_CpG for the coverage2cytosine module that pools CpG information from both top and bottom strand methylation evidence and a fix for the methylation extractor option --multicore which now accurately shows the correct number of events in the M-bias plots.
Here is a more detailed list of all changes:
• Bismark Genome Preparation: Added a check for unique chromosome names to the Bismark indexer to avoid disappointments later
• Methylation Extractor: Fixed a bug for the M-bias reports when the option --multicore was used, in which case only the numbers of one core were used to constuct the report. Now every different thread writes out an individual M-bias table, and once the methylation extraction has completed all these individual files are merged into a single, cumulative table as it should be
• Methylation Extractor: Added a new option --mbias_off, which processes the files as normal but does not write out any M-bias files. This option is meant for users who run the methylation extractor two times, the first time to figure out whether there is a bias that needs to be removed, and the second time using the --ignore options, but without overwriting the already existent M-bias reports
• bismark2bedGraph: Deferred removal of the input file path information a little so that specifying file paths doesn't prevent bismark2bedGraph from finding the input files anymore
• bismark2bedGraph: If the specified output directory doesn't exist it will be created
• bismark2bedGraph: Changed the way scaffolds are sorted (with --gazillion/--scaffold specified) to -k3,3V (this was done following a suggestion by Volker Brendel, Indiana University: "The -k3,3V sort option is critical when the sequence names are numbered scaffolds (without left-buffering of zeros)
• coverage2cytosine: Added a new option --merge_CpG that will post-process the genome-wide report to write out an additional coverage file which has the top and bottom strand methylation evidence pooled into a single CpG dinucleotide entity. This may be the desirable input format for some downstream processing tools such as the R-package bsseq (by K.D. Hansen). An example would be:
Code:
genome-wide CpG report (old) gi|9626372|ref|NC_001422.1| 157 + 313 156 CG gi|9626372|ref|NC_001422.1| 158 - 335 156 CG merged CpG evidence coverage file (new) gi|9626372|ref|NC_001422.1| 157 158 67.500000 648 312
• coverage2cytosine: Changed the processing of not-covered chromosomes so that they are sorted and not processed randomly. This should make runs more reproducible
Comments or suggestions welcome. Happy New Year! Felix
Comment