Definitions

class dagster.Definitions [source]

A set of definitions explicitly available and loadable by Dagster tools.

Parameters:

assets (Optional[Iterable[Union[AssetsDefinition, SourceAsset, CacheableAssetsDefinition]]]) – A list of assets. Assets can be created by annotating a function with @asset or @observable_source_asset. Or they can by directly instantiating AssetsDefinition, SourceAsset, or CacheableAssetsDefinition.
asset_checks (Optional[Iterable[AssetChecksDefinition]]) – A list of asset checks.
schedules (Optional[Iterable[Union[ScheduleDefinition, UnresolvedPartitionedAssetScheduleDefinition]]]) – List of schedules.
sensors (Optional[Iterable[SensorDefinition]]) – List of sensors, typically created with @sensor.
jobs (Optional[Iterable[Union[JobDefinition, UnresolvedAssetJobDefinition]]]) – List of jobs. Typically created with define_asset_job or with @job for jobs defined in terms of ops directly. Jobs created with @job must already have resources bound at job creation time. They do not respect the resources argument here.
resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets. The resources dictionary takes raw Python objects, not just instances of ResourceDefinition. If that raw object inherits from IOManager, it gets coerced to an IOManagerDefinition. Any other object is coerced to a ResourceDefinition. These resources will be automatically bound to any assets passed to this Definitions instance using with_resources. Assets passed to Definitions with resources already bound using with_resources will override this dictionary.
executor (Optional[Union[ExecutorDefinition, Executor]]) – Default executor for jobs. Individual jobs can override this and define their own executors by setting the executor on @job or define_asset_job explicitly. This executor will also be used for materializing assets directly outside of the context of jobs. If an Executor is passed, it is coerced into an ExecutorDefinition.
loggers (Optional[Mapping[str, LoggerDefinition]) – Default loggers for jobs. Individual jobs can define their own loggers by setting them explictly.
metadata (Optional[MetadataMapping]) – Arbitrary metadata for the Definitions. Not displayed in the UI but accessible on the Definitions instance at runtime.
component_tree (Optional[ComponentTree]) – Information about the Components that were used to construct part of this Definitions object.

Example usage:

Definitions(
    assets=[asset_one, asset_two],
    schedules=[a_schedule],
    sensors=[a_sensor],
    jobs=[a_job],
    resources={
        "a_resource": some_resource,
    },
    asset_checks=[asset_one_check_one]
)

Dagster separates user-defined code from system tools such the web server and the daemon. Rather than loading code directly into process, a tool such as the webserver interacts with user-defined code over a serialization boundary.

These tools must be able to locate and load this code when they start. Via CLI arguments or config, they specify a Python module to inspect.

A Python module is loadable by Dagster tools if there is a top-level variable that is an instance of Definitions.

static merge [source]

Merges multiple Definitions objects into a single Definitions object.

The returned Definitions object has the union of all the definitions in the input Definitions objects.

Raises an error if the Definitions objects to be merged contain conflicting values for the same resource key or logger key, or if they have different executors defined.

Examples:

import submodule1
import submodule2

defs = Definitions.merge(submodule1.defs, submodule2.defs)

Returns: The merged definitions.Return type: Definitions

static validate_loadable [source]

Validates that the enclosed definitions will be loadable by Dagster:

No assets have conflicting keys.
No jobs, sensors, or schedules have conflicting names.
All asset jobs can be resolved.
All resource requirements are satisfied.
All partition mappings are valid.

Meant to be used in unit tests.

Raises an error if any of the above are not true.

get_all_asset_specs [source]: deprecated
This API will be removed in version 1.11. Use resolve_all_asset_specs instead.
Returns an AssetSpec object for AssetsDefinitions and AssetSpec passed directly to the Definitions object.

get_asset_value_loader [source]

Returns an object that can load the contents of assets as Python objects.

Invokes load_input on the IOManager associated with the assets. Avoids spinning up resources separately for each asset.

Usage:

with defs.get_asset_value_loader() as loader:
    asset1 = loader.load_asset_value("asset1")
    asset2 = loader.load_asset_value("asset2")

get_job_def [source]

Get a job definition by name. This will only return a JobDefinition if it was directly passed in to the Definitions object.

If that is not found, the Definitions object is resolved (transforming UnresolvedAssetJobDefinitions to JobDefinitions and an example). It also finds jobs passed to sensors and schedules and retrieves them from the repository.

After dagster 1.11, this resolution step will not happen, and will throw an error if the job is not found.

get_schedule_def [source]: Get a ScheduleDefinition by name. If your passed-in schedule had resource dependencies, or the job targeted by the schedule had resource dependencies, those resource dependencies will be fully resolved on the returned object.

get_sensor_def [source]: Get a SensorDefinition by name. If your passed-in sensor had resource dependencies, or the job targeted by the sensor had resource dependencies, those resource dependencies will be fully resolved on the returned object.

load_asset_value [source]

Load the contents of an asset as a Python object.

Invokes load_input on the IOManager associated with the asset.

If you want to load the values of multiple assets, it’s more efficient to use get_asset_value_loader(), which avoids spinning up resources separately for each asset.

Parameters:

asset_key (Union[AssetKey, Sequence[str], str]) – The key of the asset to load.
python_type (Optional[Type]) – The python type to load the asset as. This is what will be returned inside load_input by context.dagster_type.typing_type.
partition_key (Optional[str]) – The partition of the asset to load.
metadata (Optional[Dict[str, Any]]) – Input metadata to pass to the IOManager (is equivalent to setting the metadata argument in In or AssetIn).

Returns: The contents of an asset as a Python object.

map_asset_specs [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Map a function over the included AssetSpecs or AssetsDefinitions in this Definitions object, replacing specs in the sequence or specs in an AssetsDefinitions with the result of the function.

Parameters:

func (Callable[[AssetSpec], AssetSpec]) – The function to apply to each AssetSpec.
selection (Optional[Union[str, Sequence[str], Sequence[AssetKey], Sequence[Union[AssetsDefinition, SourceAsset]], AssetSelection]]) – An asset selection to narrow down the set of assets to apply the function to. If not provided, applies to all assets.

Returns: A Definitions object where the AssetSpecs have been replaced with the result of the function where the selection applies.Return type: Definitions

Examples:

import dagster as dg

my_spec = dg.AssetSpec("asset1")

@dg.asset
def asset1(_): ...

@dg.asset
def asset2(_): ...

defs = Definitions(
    assets=[asset1, asset2]
)

# Applies to asset1 and asset2
mapped_defs = defs.map_asset_specs(
    func=lambda s: s.merge_attributes(metadata={"new_key": "new_value"}),
)

map_resolved_asset_specs [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Map a function over the included AssetSpecs or AssetsDefinitions in this Definitions object, replacing specs in the sequence.

See map_asset_specs for more details.

Supports selection and therefore requires resolving the Definitions object to a RepositoryDefinition when there is a selection.

Examples:

import dagster as dg

my_spec = dg.AssetSpec("asset1")

@dg.asset
def asset1(_): ...

@dg.asset
def asset2(_): ...

# Applies only to asset1
mapped_defs = defs.map_resolved_asset_specs(
    func=lambda s: s.replace_attributes(metadata={"new_key": "new_value"}),
    selection="asset1",
)

resolve_all_asset_keys [source]: Returns an AssetKey object for every asset contained inside the resolved Definitions object.

resolve_all_asset_specs [source]: Returns an AssetSpec object for every asset contained inside the resolved Definitions object.

@dagster.definitions [source]

Decorator that marks a function as an entry point for loading Dagster definitions.

This decorator provides a lazy loading mechanism for Definitions objects, which is the preferred approach over directly instantiating Definitions at module import time. It enables Dagster’s tools to discover and load definitions on-demand without executing the definition creation logic during module imports. The user can also import this function and import it for test cases.

The decorated function must return a Definitions object and can optionally accept a ComponentLoadContext parameter, populated when loaded in the context of autoloaded defs folders in the dg project layout.

Parameters: fn – A function that returns a Definitions object. The function can either:

Accept no parameters: () -> Definitions
Accept a ComponentLoadContext: (ComponentLoadContext) -> DefinitionsReturns: A callable that will invoke the original function and return its Definitions object when called by Dagster’s loading mechanisms or directly by the user.Raises: DagsterInvariantViolationErrorDagsterInvariantViolationError – If the function signature doesn’t match the expected patterns (no parameters or exactly one ComponentLoadContext parameter).

Examples:

Basic usage without context:

import dagster as dg

@dg.definitions
def my_definitions():
    @dg.asset
    def sales_data():
        return [1, 2, 3]

    return dg.Definitions(assets=[sales_data])

Usage with ComponentLoadContext for autoloaded definitions:

import dagster as dg

@dg.definitions
def my_definitions(context: dg.ComponentLoadContext):
    @dg.asset
    def sales_data():
        # Can use context for environment-specific logic
        return load_data_from(context.path)

    return dg.Definitions(assets=[sales_data])

The decorated function can be imported and used by Dagster tools:

# my_definitions.py
@dg.definitions
def defs():
    return dg.Definitions(assets=[my_asset])

# dg dev -f my_definitions.py

Note: When used in autoloaded defs folders, the ComponentLoadContext provides access to environment variables and other contextual information for dynamic definition loading.

See also: - dagster.Definitions: The object that should be returned by the decorated function

dagster.ComponentLoadContext: Context object for autoloaded definitions

dagster.create_repository_using_definitions_args [source]

Create a named repository using the same arguments as Definitions. In older versions of Dagster, repositories were the mechanism for organizing assets, schedules, sensors, and jobs. There could be many repositories per code location. This was a complicated ontology but gave users a way to organize code locations that contained large numbers of heterogenous definitions.

As a stopgap for those who both want to 1) use the new Definitions API and 2) but still want multiple logical groups of assets in the same code location, we have introduced this function.

Example usage:

named_repo = create_repository_using_definitions_args(
    name="a_repo",
    assets=[asset_one, asset_two],
    schedules=[a_schedule],
    sensors=[a_sensor],
    jobs=[a_job],
    resources={
        "a_resource": some_resource,
    }
)

dagster.load_definitions_from_current_module [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Constructs the dagster.Definitions from the module where this function is called. Automatically discovers all objects defined at module scope that can be passed into the dagster.Definitions constructor.

Parameters:

resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets in the loaded dagster.Definitions.
loggers (Optional[Mapping[str, LoggerDefinition]]) – Default loggers for jobs in the loaded dagster.Definitions. Individual jobs can define their own loggers by setting them explicitly.
executor (Optional[Union[Executor, ExecutorDefinition]]) – Default executor for jobs in the loaded dagster.Definitions. Individual jobs can define their own executors by setting them explicitly.

Returns: The dagster.Definitions defined in the current module.Return type: Definitions

dagster.load_definitions_from_module [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Constructs the dagster.Definitions from the given module. Automatically discovers all objects defined at module scope that can be passed into the dagster.Definitions constructor.

Parameters:

module (ModuleType) – The Python module to look for dagster.Definitions inside.
resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets in the loaded dagster.Definitions.
loggers (Optional[Mapping[str, LoggerDefinition]]) – Default loggers for jobs in the loaded dagster.Definitions. Individual jobs can define their own loggers by setting them explicitly.
executor (Optional[Union[Executor, ExecutorDefinition]]) – Default executor for jobs in the loaded dagster.Definitions. Individual jobs can define their own executors by setting them explicitly.

Returns: The dagster.Definitions defined in the given module.Return type: Definitions

dagster.load_definitions_from_modules [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Constructs the dagster.Definitions from the given modules. Automatically discovers all objects defined at module scope that can be passed into the dagster.Definitions constructor.

Parameters:

modules (Iterable[ModuleType]) – The Python modules to look for dagster.Definitions inside.
resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets in the loaded dagster.Definitions.
loggers (Optional[Mapping[str, LoggerDefinition]]) – Default loggers for jobs in the loaded dagster.Definitions. Individual jobs can define their own loggers by setting them explicitly.
executor (Optional[Union[Executor, ExecutorDefinition]]) – Default executor for jobs in the loaded dagster.Definitions. Individual jobs can define their own executors by setting them explicitly.

Returns: The dagster.Definitions defined in the given modules.Return type: Definitions

dagster.load_definitions_from_package_module [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Constructs the dagster.Definitions from the given package module. Automatically discovers all objects defined at module scope that can be passed into the dagster.Definitions constructor.

Parameters:

package_module (ModuleType) – The package module to look for dagster.Definitions inside.
resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets in the loaded dagster.Definitions.
loggers (Optional[Mapping[str, LoggerDefinition]]) – Default loggers for jobs in the loaded dagster.Definitions. Individual jobs can define their own loggers by setting them explicitly.
executor (Optional[Union[Executor, ExecutorDefinition]]) – Default executor for jobs in the loaded dagster.Definitions. Individual jobs can define their own executors by setting them explicitly.

Returns: The dagster.Definitions defined in the given package module.Return type: Definitions

dagster.load_definitions_from_package_name [source]

preview

This API is currently in preview, and may have breaking changes in patch version releases. This API is not considered ready for production use.

Constructs the dagster.Definitions from the package module for the given package name. Automatically discovers all objects defined at module scope that can be passed into the dagster.Definitions constructor.

Parameters:

package_name (str) – The name of the package module to look for dagster.Definitions inside.
resources (Optional[Mapping[str, Any]]) – Dictionary of resources to bind to assets in the loaded dagster.Definitions.
loggers (Optional[Mapping[str, LoggerDefinition]]) – Default loggers for jobs in the loaded dagster.Definitions. Individual jobs can define their own loggers by setting them explicitly.
executor (Optional[Union[Executor, ExecutorDefinition]]) – Default executor for jobs in the loaded dagster.Definitions. Individual jobs can define their own executors by setting them explicitly.

Returns: The dagster.Definitions defined in the package module for the given package name.Return type: Definitions