Published by the Foundation for Open Access Statistics
Editors-in-chief: Bettina Grün, Torsten Hothorn, Edzer Pebesma, Achim Zeileis    ISSN 1548-7660; CODEN JSSOBK
BatchJobs and BatchExperiments: Abstraction Mechanisms for Using R in Batch Environments | Bischl | Journal of Statistical Software
Authors: Bernd Bischl, Michel Lang, Olaf Mersmann, Jörg Rahnenführer, Claus Weihs
Title: BatchJobs and BatchExperiments: Abstraction Mechanisms for Using R in Batch Environments

Empirical analysis of statistical algorithms often demands time-consuming experiments. We present two R packages which greatly simplify working in batch computing environments. The package BatchJobs implements the basic objects and procedures to control any batch cluster from within R. It is structured around cluster versions of the well-known higher order functions Map, Reduce and Filter from functional programming. Computations are performed asynchronously and all job states are persistently stored in a database, which can be queried at any point in time. The second package, BatchExperiments, is tailored for the still very general scenario of analyzing arbitrary algorithms on problem instances. It extends package BatchJobs by letting the user define an array of jobs of the kind “apply algorithm A to problem instance P and store results”. It is possible to associate statistical designs with parameters of problems and algorithms and therefore to systematically study their influence on the results.

The packages’ main features are: (a) Convenient usage: All relevant batch system operations are either handled internally or mapped to simple R functions. (b) Portability: Both packages use a clear and well-defined interface to the batch system which makes them applicable in most high-performance computing environments. (c) Reproducibility: Every computational part has an associated seed to ensure reproducibility even when the underlying batch system changes. (d) Abstraction and good software design: The code layers for algorithms, experiment definitions and execution are cleanly separated and enable the writing of readable and maintainable code.

Page views:: 3459. Submitted: 2012-05-30. Published: 2015-03-20.
Paper: BatchJobs and BatchExperiments: Abstraction Mechanisms for Using R in Batch Environments     Download PDF (Downloads: 3355)
BatchJobs_1.6.tar.gz: R source package Download (Downloads: 127; 107KB)
BatchExperiments_1.4.1.tar.gz: R source package Download (Downloads: 117; 39KB)
v64i11.R: R example code from the paper Download (Downloads: 177; 7KB)

DOI: 10.18637/jss.v064.i11

This work is licensed under the licenses
Paper: Creative Commons Attribution 3.0 Unported License
Code: GNU General Public License (at least one of version 2 or version 3) or a GPL-compatible license.