Skip to main content

Runner Configuration

Configuration for the flow runner

RunnerConfig

info

RunnerConfig is defined beneath the following ancestor nodes in the YAML structure:

Below are the properties for the RunnerConfig. Each property links to the specific details section further down in this page.

PropertyDefaultTypeRequiredDescription
sizeRuntimeSize
NoOverride the size of the flow runner. If not specified, the flow runner inherits the size from the deployment or workspace.

Property Details

BackfillRun

Defines the parameters for a backfill run.

PropertyDefaultTypeRequiredDescription
backfill_runYesBackfill run options.

BackfillRunOptions

Options for a backfill run.

PropertyDefaultTypeRequiredDescription
descriptionstring
NoBrief description of what the model does.
metadataNoMeta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources.
namestringYesThe name of the model
flow_namestringYesName of the Flow that is to be backfilled.
start_timestringYesStart time of the time range to be backfilled.
end_timestringYesEnd time of the time range to be backfilled.
granularitystring ("day", "week", "month")YesTime granularity to use for backfill. Must be one of: 'day', 'week', 'month'. The backfill runner divides the date range into Flow runs of this granularity and launches these Flow runs.
max_concurrent_flow_runs1integerNoMaximum number of concurrent Flow runs used for backfill. This is used to limit the number of Flow runners (and hence cluster resources) that are launched simultaneously.
backfill_orderstring ("forward_chronological", "reverse_chronological")
NoOrder to use for backfilling - either forward or reverse chronological order.
flow_run_optionsNoAdditional options for each Flow run launched during the backfill.
run_final_syncboolean
NoBoolean flag indicating whether to run a final sync after concurrent backfill Flow runs. This final sync is a single Flow run that is executed without any time parameters, and is meant to sync the data to the latest state and capture any missing time intervals.

Flow

A Flow is the primary unit of execution in Ascend and contains a collection of Components assembled into a directed acyclic graph (DAG).

PropertyDefaultTypeRequiredDescription
flowYes

FlowOptions

Defines the options for a Flow

PropertyDefaultTypeRequiredDescription
parametersobject with property values of type None
NoDictionary of parameters to use for resource.
defaultsarray[None]
NoList of default configs with filters that can be applied to a resource config.
descriptionstring
NoBrief description of what the model does.
metadataNoMeta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources.
namestringYesThe name of the model
data_planeNoData plane to use for the flow.
versionstring
NoFlow version.
bootstrapstring
NoBootstrap command to run within the Docker container.
runnerRunnerConfig
NoRunner configuration.
component_concurrencyinteger
NoMaximum number of concurrent Components to run within this Flow.

FlowRun

Defines the run-specific parameters for a Flow, one flow can have multiple Flow runs

PropertyDefaultTypeRequiredDescription
flow_runYes

FlowRunBaseOptions

Base options for a Flow Run

PropertyDefaultTypeRequiredDescription
parametersobject with property values of type None
NoDictionary of parameters to use for resource.
defaultsarray[None]
NoList of default configs with filters that can be applied to a resource config.
descriptionstring
NoBrief description of what the model does.
metadataNoMeta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources.
run_testsTruebooleanNoBoolean flag indicating whether to run tests after processing data.
store_test_resultsboolean
NoBoolean flag indicating whether to store test results.
componentsarray[string]
NoList of Component names to run.
component_categoriesarray[string]
NoList of Component categories to run.
halt_flow_on_errorboolean
NoBoolean flag indicating whether to halt the Flow on error.
disable_optimizersboolean
NoBoolean flag indicating whether to disable optimizers.
disable_incremental_metadata_collectionboolean
NoBoolean flag indicating whether to disable collection of Incremental Read and Transform Component metadata.
full_refreshFalseboolean
NoBoolean flag indicating whether to perform a full refresh of each Component. ⚠ If true, will drop all internal data and metadata tables/views and re-compute them from scratch.
update_materialization_typeFalseboolean
NoBoolean flag indicating whether to update Component materialization types (e.g., changing types between 'simple', 'view', 'incremental', and 'smart'). ⚠ If materialization type changes are detected, existing data and metadata tables/views will be dropped and re-computed from scratch. Otherwise, existing data and metadata tables/views will be preserved and type changes will result in an error.
runner_overridesRunnerConfig
NoOverride runner configuration for this specific flow run. If not specified, inherits from the flow's runner configuration, or the deployment/workspace defaults.

FlowRunOptions

Options for a Flow Run

PropertyDefaultTypeRequiredDescription
parametersobject with property values of type None
NoDictionary of parameters to use for resource.
defaultsarray[None]
NoList of default configs with filters that can be applied to a resource config.
descriptionstring
NoBrief description of what the model does.
metadataNoMeta information of a Flow run. In most cases, it doesn't affect the system behavior but may be helpful to analyze project resources.
run_testsTruebooleanNoBoolean flag indicating whether to run tests after processing data.
store_test_resultsboolean
NoBoolean flag indicating whether to store test results.
componentsarray[string]
NoList of Component names to run.
component_categoriesarray[string]
NoList of Component categories to run.
halt_flow_on_errorboolean
NoBoolean flag indicating whether to halt the Flow on error.
disable_optimizersboolean
NoBoolean flag indicating whether to disable optimizers.
disable_incremental_metadata_collectionboolean
NoBoolean flag indicating whether to disable collection of Incremental Read and Transform Component metadata.
full_refreshFalseboolean
NoBoolean flag indicating whether to perform a full refresh of each Component. ⚠ If true, will drop all internal data and metadata tables/views and re-compute them from scratch.
update_materialization_typeFalseboolean
NoBoolean flag indicating whether to update Component materialization types (e.g., changing types between 'simple', 'view', 'incremental', and 'smart'). ⚠ If materialization type changes are detected, existing data and metadata tables/views will be dropped and re-computed from scratch. Otherwise, existing data and metadata tables/views will be preserved and type changes will result in an error.
runner_overridesRunnerConfig
NoOverride runner configuration for this specific flow run. If not specified, inherits from the flow's runner configuration, or the deployment/workspace defaults.
namestring
NoFlow run name.
flow_namestringYesName of the Flow to run.
event_start_timestring
NoEvent start time to be used for time-series processing.
event_end_timestring
NoEvent end time to be used for time-series processing.

RuntimeSize

Enumeration of available runtime sizes for deployments, workspaces, and flow runners. Each size corresponds to specific resource allocations (CPU, memory, disk).

No properties defined.