Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Eine durchschnittliche Folge dieses Podcasts dauert 48m. Bisher sind 110 Folge(n) erschienen. Dies ist ein wöchentlich erscheinender Podcast


recommended podcasts

episode 107: Escaping Analysis Paralysis For Your Data Platform With Data Virtualization

An interview about data virtualization and data engineering automation with AtScale and the value of abstractions for your data platform architecture



episode 106: Designing For Data Protection

An interview about data protection regulations and how they can influence the design of your data platform



episode 105: Automating Your Production Dataflows On Spark

An interview about how the Ascend platform provides an autonomous data orchestration platform to simplify your production dataflows



episode 104: Build Maintainable And Testable Data Applications With Dagster

An interview about the Dagster framework and how you can use it to build testable and maintainable data applications



episode 103: Data Orchestration For Hybrid Cloud Analytics

An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems



episode 102: Keeping Your Data Warehouse In Order

An interview about Dataform and how it helps you to keep your data warehouse in good working order


 15 October 2019  47m

episode 101: Fast Analytics On Semi-Structured And Structured Data In The Cloud

An interview about the architecture of Rockset and how they built a serverless platform for fast and flexible analytics on your semi-structured data


 08 October 2019  54m

episode 100: Ship Faster With An Opinionated Data Pipeline Framework

An interview about how the open source Kedro framework makes it faster and easier to build your end-to-end data pipeline for machine learning projects


 01 October 2019  35m

episode 99: Open Source Object Storage For All Of Your Data

An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere


 23 September 2019  1h8m

episode 98: Navigating Boundless Data Streams With The Swim Kernel

An interview about using stateful computation on data streams with the SwimOS kernel to improve your analytics


 18 September 2019  57m