#454: Data Pipelines with Dagster

#454: Data Pipelines with Dagster

Up next

#541: Monty - Python in Rust for AI

When LLMs write code to accomplish a task, that code has to actually run somewhere. And right now, the options aren't great. Spin up a sandboxed container and you're paying a full second of cold start overhead plus the complexity of another service. Let the LLM loose on your actu ...  Show more

#540: Modern Python monorepo with uv and prek

Monorepos -- you've heard the talks, you've read the blog posts, maybe you've seen a few tantalizing glimpses into how Google or Meta organize their massive codebases. But it's often in the abstract and behind closed doors. What if you could crack open a real, production monorepo ...  Show more

Recommended Episodes

Massively Parallel Data Processing In Python Without The Effort Using Bodo
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

Python has beome the de facto language for working with data. That has brought with it a number of challenges having to do with the speed and scalability of working with large volumes of information.There have been many ...

  Show more

Shorten the distance between production data and insight
The Stack Overflow Podcast

Modern networked applications generate a lot of data, and every business wants to make the most of that data. Most of the time, that means moving production data through some transformation process to get it ready for the analytics process. But what if you could have in-app an ...

  Show more

Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+
Data Engineering Podcast

<h2>Summary</h2>

A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the redesigned commercial companion to the open source project t ...

  Show more

Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
Data Engineering Podcast

<h2>Summary</h2>

Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while mainta ...

  Show more