kedro.runner.AbstractRunner

class kedro.runner.AbstractRunner(is_async=False)[source]

AbstractRunner is the base class for all Pipeline runner implementations.

Methods

create_default_data_set(ds_name)

Factory method for creating the default dataset for the runner.

run(pipeline, catalog[, run_id])

Run the Pipeline using the datasets provided by catalog and save results back to the same objects.

run_only_missing(pipeline, catalog)

Run only the missing outputs from the Pipeline using the datasets provided by catalog, and save results back to the same objects.

__init__(is_async=False)[source]

Instantiates the runner classs.

Parameters

is_async (bool) – If True, the node inputs and outputs are loaded and saved asynchronously with threads. Defaults to False.

abstract create_default_data_set(ds_name)[source]

Factory method for creating the default dataset for the runner.

Parameters

ds_name (str) – Name of the missing dataset.

Return type

AbstractDataSet

Returns

An instance of an implementation of AbstractDataSet to be used for all unregistered datasets.

run(pipeline, catalog, run_id=None)[source]

Run the Pipeline using the datasets provided by catalog and save results back to the same objects.

Parameters
  • pipeline (Pipeline) – The Pipeline to run.

  • catalog (DataCatalog) – The DataCatalog from which to fetch data.

  • run_id (Optional[str]) – The id of the run.

Raises

ValueError – Raised when Pipeline inputs cannot be satisfied.

Return type

Dict[str, Any]

Returns

Any node outputs that cannot be processed by the DataCatalog. These are returned in a dictionary, where the keys are defined by the node outputs.

run_only_missing(pipeline, catalog)[source]

Run only the missing outputs from the Pipeline using the datasets provided by catalog, and save results back to the same objects.

Parameters
  • pipeline (Pipeline) – The Pipeline to run.

  • catalog (DataCatalog) – The DataCatalog from which to fetch data.

Raises

ValueError – Raised when Pipeline inputs cannot be satisfied.

Return type

Dict[str, Any]

Returns

Any node outputs that cannot be processed by the DataCatalog. These are returned in a dictionary, where the keys are defined by the node outputs.