Parameterised Tasks§
Parameterised tasks (see parameterisation) provide a way of implicitly looping over tasks without the need for Jinja2.
Parameterised tasks (see parameterisation) provide a way of implicitly looping over tasks without the need for Jinja2.
[cylc]
[[parameters]]
world = Mercury, Venus, Earth
[scheduling]
[[dependencies]]
graph = start => hello<world> => end
[runtime]
[[hello<world>]]
script = echo 'Hello World!'
[scheduling]
[[dependencies]]
graph = """
start => hello_Mercury => end
start => hello_Venus => end
start => hello_Earth => end
"""
[runtime]
[[hello_Mercury]]
script = echo 'Hello World!'
[[hello_Venus]]
script = echo 'Hello World!'
[[hello_Earth]]
script = echo 'Hello World!'
[runtime]
[[hello<world=Earth>]]
script = echo 'Greetings Earth!'
[runtime]
[[hello<world>]]
script = echo "Hello ${CYLC_TASK_PARAM_world}!"
Parameters can be either words or integers:
[cylc]
[[parameters]]
foo = 1..5
bar = 1..5..2
baz = pub, qux, bol
Hint
Remember that Cylc automatically inserts an underscore between the task and the parameter, e.g. the following lines are equivalent:
task<baz=pub>
task_pub
Hint
Cylc prefixes integer parameters with the parameter name:
[scheduling]
[[dependencies]]
graph = """
# task<bar> would result in:
task_bar1
task_bar3
task_bar5
# task<baz> would result in:
task_pub
task_qux
task_bol
"""
[scheduling]
[[dependencies]]
[[[T00/PT3H]]]
graph = """
get_observations<station> => consolidate_observations
"""
[runtime]
[[get_observations<station>]]
script = get-observations
[[[environment]]]
API_KEY = xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
[[get_observations<station=aldergrove>]]
[[[environment]]]
SITE_ID = 3917
[[get_observations<station=camborne>]]
[[[environment]]]
SITE_ID = 3808
[[get_observations<station=heathrow>]]
[[[environment]]]
SITE_ID = 3772
[[get_observations<station=shetland>]]
[[[environment]]]
SITE_ID = 3005
This practical continues on from the Jinja2 practical.
Next section: Which approach to use
Practical§
This practical continues on from the Jinja2 practical.
Use Parameterisation To Consolidate The get_observations
Tasks.
Next we will parameterise the get_observations
tasks.
Add a parameter called station
:
[cylc]
UTC mode = True
+ [[parameters]]
+ station = aldergrove, camborne, heathrow, shetland
Remove the four get_observations
tasks and insert the following code
in their place:
[[get_observations<station>]]
script = get-observations
[[[environment]]]
API_KEY = {{ API_KEY }}
Using cylc get-config
you should see that Cylc replaces the
<station>
with each of the stations in turn, creating a new task for
each:
cylc get-config . --sparse -i "[runtime]"
The get_observations
tasks are now missing the SITE_ID
environment variable. Add a new section for each station with a
SITE_ID
:
[[get_observations<station=heathrow>]]
[[[environment]]]
SITE_ID = 3772
Hint
The relevant IDs are:
3917
3808
3772
3005
Solution
[[get_observations<station=aldergrove>]]
[[[environment]]]
SITE_ID = 3917
[[get_observations<station=camborne>]]
[[[environment]]]
SITE_ID = 3808
[[get_observations<station=heathrow>]]
[[[environment]]]
SITE_ID = 3772
[[get_observations<station=shetland>]]
[[[environment]]]
SITE_ID = 3005
Using cylc get-config
you should now see four get_observations
tasks, each with a script
, an API_KEY
and a SITE_ID
:
cylc get-config . --sparse -i "[runtime]"
Finally we can use this parameterisation to simplify the suite’s
graphing. Replace the get_observations
lines in the graph with
get_observations<station>
:
[[[PT3H]]]
# Repeat every three hours starting at the initial cycle point.
graph = """
- get_observations_aldergrove => consolidate_observations
- get_observations_camborne => consolidate_observations
- get_observations_heathrow => consolidate_observations
- get_observations_shetland => consolidate_observations
+ get_observations<station> => consolidate_observations
"""
Hint
The cylc get-config
command does not expand parameters or families
in the graph so you must use cylc graph
to inspect changes to the
graphing.
Use Parameterisation To Consolidate The post_process
Tasks.
At the moment we only have one post_process
task
(post_process_exeter
), but suppose we wanted to add a second task for
Edinburgh.
Create a new parameter called site
and set it to contain exeter
and edinburgh
. Parameterise the post_process
task using this
parameter.
Hint
The first argument to the post-process
task is the name of the
site. We can use the CYLC_TASK_PARAM_site
environment variable to
avoid having to write out this section twice.
Solution
First we must create the site
parameter:
[cylc]
UTC mode = True
[[parameters]]
station = aldergrove, camborne, heathrow, shetland
+ site = exeter, edinburgh
Next we parameterise the task in the graph:
-get_rainfall => forecast => post_process_exeter
+get_rainfall => forecast => post_process<site>
And also the runtime:
-[[post_process_exeter]]
+[[post_process<site>]]
# Generate a forecast for Exeter 60 minutes in the future.
- script = post-process exeter 60
+ script = post-process $CYLC_TASK_PARAM_site 60