Data Engineering

Data Engineering

Up next

Unfaithful Chain of Thought

What's actually happening when an LLM "thinks out loud"? Research on human decision-making suggests that much of the reasoning we believe drives our choices is actually post hoc rationalization — we decide first, explain later. Katie and Ben get curious about whether the same mig ...  Show more

Benchmark Bank Heist

What if an AI decided the smartest way to pass its test was to find the answer key? That's exactly what Anthropic's Claude Opus did when faced with a benchmark evaluation — reasoning that it was being tested, tracking down the encrypted eval dataset, decrypting it, and returning ...  Show more

Recommended Episodes

Moving Machine Learning Into The Data Pipeline at Cherre
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformation ...

  Show more

#162 Scaling Data Engineering in Retail with Mohammad Sabah, SVP of Engineering & Data at Thrive Market
DataFramed

Poor data engineering is like building a shaky foundation for a house—it leads to unreliable information, wasted time and money, and even legal problems, making everything less dependable and more troublesome in our digital world. In the retail industry specifically, data enginee ...  Show more

Unpacking The Seven Principles Of Modern Data Pipelines
Data Engineering Podcast

<h2>Summary</h2>

Data pipelines are the core of every data product, ML model, and business intelligence dashboard. If you're not careful you will end up spending all of your time on maintenance and fire-fighting. The folks at Rivery distilled the seven principles of mod ...

  Show more

Data Quality Starts At The Source
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

The most important gauge of success for a data platform is the level of trust in the accuracy of the information that it provides. In order to build and maintain that trust it is necessary to invest in defining, monitori ...

  Show more