Skip to main content

Bluesky data analysis

note

To see video of this example

In this example, you'll build a pipeline with Dagster that:

Ingestion of data-related Bluesky posts
Modeling data using dbt
Representing data in a dashboard with PowerBI

Prerequisites

To follow the steps in this guide, you'll need:

Basic Python knowledge
Python 3.9+ installed on your system. Refer to the Installation guide for information.
Understanding of data pipelines and the extract, transform, and load process (ETL).
Familiar with dbt and data transformation.
Usage of BI tools for dashboards.

Step 1: Set up your Dagster environment

First, set up a new Dagster project.

Clone the Dagster repo and navigate to the project:

cd examples/docs_projects/project_atproto_dashboard

Install the required dependencies with uv:
```
uv sync
```
Activate the virtual environment:
- MacOS
- Windows
source .venv/bin/activate
Ensure the following environments have been populated in your .env file. Start by copying the template:
```
cp .env.example .env
```
And then populate the fields.

Step 2: Launch the Dagster webserver

To make sure Dagster and its dependencies were installed correctly, navigate to the project root directory and start the Dagster webserver:

dg dev

Next steps

Continue this example with ingestion

Prerequisites
Step 1: Set up your Dagster environment
Step 2: Launch the Dagster webserver
Next steps