What data transformation library should I use? Pandas vs Dask vs Ray vs Modin vs Rapids (Ep. 112)

What data transformation library should I use...

‏التالي

Europe, wake up! You Can't Be a Superpower on Someone Else's Servers (Ep. 304)

Rebuilding a defense industrial base takes 20 years and costs trillions. Tech sovereignty takes 3 years and political will. Europe is doing the hard thing and refusing the easy one. Here's why — and who's profiting from that refusal. Buy me a coffee https://ko-fi.com/datascience ...  عرض المزيد

Social media is an ant mill (Internet is a disaster) (Ep. 303)

Internet followed nature. Until it didn't. And became the disaster we know. Buy me a coffee https://ko-fi.com/datascience ✨ Connect with us! Personal newsletter: https://defragzone.substack.com 📩 Newsletter: https://datascienceathome.substack.com 🎙 Podcast: Available on Spotify ...  عرض المزيد

‏حلقات موصى بها

Massively Parallel Data Processing In Python Without The Effort Using Bodo
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

Python has beome the de facto language for working with data. That has brought with it a number of challenges having to do with the speed and scalability of working with large volumes of information.There have been many ...

  عرض المزيد

#454: Data Pipelines with Dagster
Talk Python To Me

See the full show notes for this episode on the website at talkpython.fm/454 

Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data int ...

  عرض المزيد

Building The Materialize Engine For Interactive Streaming Analytics In SQL
Data Engineering Podcast

<div class="wp-block-jetpack-markdown"><h2>Summary</h2>

Transactional databases used in applications are optimized for fast reads and writes with relatively simple queries on a small number of records. Data warehouses are optimized for batched writes and complex analytical qu ...

  عرض المزيد