rose_bunch
For the running of multiple command variants in parallel under a single
job, as defined by the application configuration.
Each variant of the command is run in the same working directory with
its output directed to separate .out
and .err
files of the form:
Should you need separate working directories you should configure your
command to create the appropriate subdirectory for working in.
Note
Under load balancing systems such as PBS or Slurm, you will need to
set resource requests to reflect the resources required by running
multiple commands at once e.g. if one command would require 1GB
memory and you have configured your app to run up to four commands at
once then you will need to request 4GB of memory.
Configuration
The application is normally configured in the
rose_bunch[bunch]
and rose_bunch[bunch-args]
sections of the rose-app.conf
file, but
rose-app.conf[command]
can be used too, see below for details.
-
Rose App
rose_bunch
-
Config
[bunch]
-
Config
command-format
= FORMAT
A Pythonic printf
-style format string to construct the commands
to run. Insert placeholders %(argname)s
for substitution of the
arguments specified under [bunch-args]
to the invoked
command. The placeholder %(command-instances)s
is reserved for
inserting an automatically generated index for the command
invocation when using the command-instances setting. If not specified
then the command to run is determined following the mode=command
logic, see rose app-run for details, and arguments
are made accessible to the command as environment variables.
-
Config
command-instances
= N
Allows the user to specify an integer value for the number of
instances of a command they want to run. This generates the values
used by the %(command-instances)s
value in command-format.
If [bunch]command-format is not specified then the command instance
is passed as the COMMAND_INSTANCES
environment variable
instead. This is useful for cases where the only difference between
invocations would be an index number e.g. ensemble members. Note
indexes start at 0
.
-
Config
pool-size
= N
Allows the user to limit the number of concurrently running commands.
If not specified then all command variations will be run at the same
time.
-
Config
fail-mode
= continue|abort
-
Specify what action you want the job to take on the failure of a
command that it is trying to run. If set to continue all command
variants will be run by the job and the job will return a non-zero
exit code upon completion e.g. if three commands are to be run and
the second one fails, all three will be run and the job will exit
with a return code of 1
. Alternatively, if fail-mode
is set to abort then on failure of any one of the command variants
it will stop trying to run any further variants N.B. the job will
wait for any already running commands to finish before exiting.
Commands that won’t be run due to aborting will be reported in the
job output with a [SKIP]
prefix when running in verbose mode.
For example in the case of three command variants with a
pool-size
of 1
and fail-mode=abort
,
if the second variant failed then the job would exit with a
non-zero error code without having run the third variant.
-
Config
incremental
= true|false
-
If set to true
then only failed commands will be re-run on
retrying running of the job. If any changes are made to the
configuration being run then all variants will be re-run. Similarly,
running the app with the --new
option to
rose task-run
will result in all commands being run. In verbose mode the app
will report commands that won’t be run due to previous successes
in the job output with a [PASS]
prefix.
-
Config
names
= name1 name2 ...
Allows defining names for each of the command variants to be run,
facilitating identification in logs. If not set then commands will
be identified by their index. The number of entries in the names
must be the same as the number of entries in each of the args to
be used.
-
Config
argument-mode
= Default|izip|izip_longest|product
-
If set to a value other than Default
then the values for each
bunch-arg will be manipulated. izip
will shrink all values so
all have the same length as the shortest bunch-arg. izip_longest
will pad out values for each bunch-arg with an empty string so that
each bunch-arg is the same length as the longest one. product
will expand all provided bunch-args to create each possible
combination. See the itertools
documentation in Python for
more information.
-
Config
[bunch-args]
This section is used to specify the various combinations of args to be
passed to the command specified under [bunch]command-format
,
if defined, or rose-app.conf[command]
otherwise.
-
Config
argname
= val1 val2 ...
Allows defining named lists of argument values to pass to
the commands being run. Multiple named sets of arguments can be
defined. Each argname can be referenced using %(argname)s
in
[bunch]command-format
, if specified, or ${argname}
environment variable otherwise. The only disallowed names are
command-instances
and COMMAND_INSTANCES
, which are reserved
for the auto-generated list of instances when the
[bunch]command-instances=N
option is used.