Configuring branch deployments with GitHub or GitLab

Dagster+ feature

This feature is only available in Dagster+.

This guide covers setting up branch deployments for a Dagster project in Dagster+ Serverless or Hybrid with GitHub or GitLab. Once you've set up branch deployments, any time you create or update a pull request (or merge request) in your project repository, an associated branch deployment will automatically be created or updated in Dagster+.

note

Output created from a branch deployment -- such as a database, table, etc. -- won't be automatically removed from storage once a branch is merged or closed. For more information on handling this output, see the best practices section.

Dagster+ Serverless

Branch deployments are automatically configured for Serverless deployments when you configure CI/CD. For more information, see the CI/CD configuration guide.

Dagster+ Hybrid

Prerequisites

To follow the steps in this section, you'll need:

Organization Admin permissions in Dagster+.
The ability to run a new agent in your infrastructure.

Step 1: Configure CI/CD

Follow the CI/CD configuration guide to set up CI/CD for Dagster+ Hybrid with your preferred Git provider.

Step 2: Create a branch deployment agent

While you can use your existing production agent for branch deployments on Dagster+ Hybrid, we recommend creating a dedicated branch deployment agent. This ensures that your production instance isn't negatively impacted by the workload associated with branch deployments.

Amazon ECS
Docker
Kubernetes

Set up a new Docker agent. For instructions, see the Docker agent setup guide.
After the agent is set up, modify the dagster.yaml file as follows:
- Set the dagster_cloud_api.branch_deployments field to true
- Remove any deployment field(s)
For example:

dagster.yaml

instance_class:
  module: dagster_cloud.instance
  class: DagsterCloudAgentInstance

dagster_cloud_api:
  agent_token:
    env: DAGSTER_AGENT_TOKEN
  branch_deployments: true ## true enables branch deployments

user_code_launcher:
  module: dagster_cloud.workspace.docker
  class: DockerUserCodeLauncher
  config:
    networks:
      - dagster_cloud_agent
    server_ttl:
      enabled: true
      ttl_seconds: 7200 #2 hours

Set up a new Kubernetes agent. For instructions, see the Kubernetes agent setup guide.
After the agent is set up, modify your Helm values file to include the following:

dagsterCloud:
  branchDeployments: true
  workspace:
    serverTTL:
      enabled: true
      ttlSeconds: 7200

Step 3: Update `build.yaml` with the branch deployment agent

In the build.yaml file, replace build.registry with the registry used by the agent you created in step 2.

For example:

build.yaml
locations:
  - location_name: example_location
    code_source:
      python_file: repo.py
    build:
      directory: ./example_location
      registry: 764506304434.dkr.ecr.us-east-1.amazonaws.com/branch-deployment-agent

note

In older deployments, you may have a dagster_cloud.yaml file instead of a build.yaml file.

Step 4: Add secrets to your Git provider

GitHub
GitLab

In your GitHub repository, click the Settings tab.
In the Security section of the sidebar, click Secrets > Actions.
Click New repository secret.
In the Name field, enter the name of the secret. For example, DAGSTER_CLOUD_URL
In the Value field, paste the value of the secret.
Click Add secret.

Repeat steps 3-6 for each of the secrets required for the registry used by the agent you created in step 1. See below for more details:

Docker
Amazon ECR
Google Container Registry (GCR)

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
DOCKERHUB_USERNAME - Your DockerHub username
DOCKERHUB_TOKEN - A DockerHub access token

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
AWS_ACCESS_KEY - The Access key ID of the AWS IAM user you created in step 3
AWS_SECRET_ACCESS_KEY - The Secret access key of the AWS IAM user you created in step 3
AWS_REGION - The AWS region where your ECR registry is located

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
GCR_JSON_KEY - Your GCR JSON credentials

In your GitLab repository, click Settings > CI/CD.
On the settings page, click Variables.
Under Project variables, click Add variable.
In the Key field, enter the name of the secret. For example, DAGSTER_CLOUD_URL.
In the Value field, paste the value of the secret.
Update the type, environments, visibility, flags, and description fields as needed.
Click Add variable.

Repeat steps 3-6 for each of the secrets required for the registry used by the agent you created in step 1. See below for more details:

Docker
Amazon ECR
Google Container Registry (GCR)

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
DOCKERHUB_USERNAME - Your DockerHub username
DOCKERHUB_TOKEN - A DockerHub access token

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
AWS_ACCESS_KEY - The Access key ID of the AWS IAM user you created in step 3
AWS_SECRET_ACCESS_KEY - The Secret access key of the AWS IAM user you created in step 3
AWS_REGION - The AWS region where your ECR registry is located

DAGSTER_CLOUD_URL - Your Dagster+ base URL (https://my_org.dagster.cloud)
GCR_JSON_KEY - Your GCR JSON credentials

Accessing branch deployments

Once configured, branch deployments can be accessed:

From a GitHub pull request
In Dagster+

Every pull request in the repository contains a View in Cloud link, which will open a branch deployment - or a preview of the changes - in Dagster+.

View in Cloud preview link highlighted in a GitHub pull request

Changing the base deployment

The base deployment has two main purposes:

It sets which full deployment is used to propagate Dagster+ managed environment variables that are scoped for branch deployments.
It is used in the UI to track changes to the branch deployment from its parent full deployment.

The default base for branch deployments is prod. To configure a different full deployment as the base, create a branch deployment using the dagster-cloud CLI (see steps above) and specify the deployment with the optional --base-deployment-name parameter.

Best practices

To ensure the best experience when using branch deployments, we recommend:

Configuring jobs based on environment. Dagster automatically sets environment variables containing deployment metadata, allowing you to parameterize jobs based on the executing environment. Use these variables in your jobs to configure things like connection credentials, databases, and so on. This practice will allow you to use branch deployments without impacting production data.
Creating jobs to automate output cleanup. As branch deployments don't automatically remove the output they create, you may want to create an additional Dagster job to perform the cleanup.

Dagster+ Serverless​

Dagster+ Hybrid​

Step 1: Configure CI/CD​

Step 2: Create a branch deployment agent​

Step 3: Update build.yaml with the branch deployment agent​

Step 4: Add secrets to your Git provider​

Accessing branch deployments​

Changing the base deployment​

Best practices​