Spotify shares how they scaled high-quality annotations for ML and GenAI across millions of tracks, streamlining from manual efforts to a scalable, efficient platform.
Curious if DuckDB is worth it? Alireza covers why this lightweight database is a solid bridge between SQL and Python for data pipelines.
Explore why structured data beats querying raw data and unpacking common governance pitfalls and best practices.
NVIDIA and Deloitte’s AI agents are transforming patient care at Ottawa Hospital by streamlining interactions and reducing admin tasks. See how their tech is changing healthcare.
Databricks shares key insights and a structured five-phase process for smoother, more effective data warehouse migrations, from initial assessment to complete execution.
Adrian Chodkowski covers strategies to optimize Databricks federated queries and manage pushdowns effectively. A must-read for those working with external data sources in Databricks.
Przemysław Baran offers a practical guide to implementing dbt’s Semantic Layer, from setup to production, to support centralized business logic in data projects.
Toby Mao examines dbt’s latest microbatching feature and highlights SQLMesh as a more robust alternative for time-based, incremental processing.
Debezium 3.0.1.Final is out, adding support for Cassandra 5, PostgreSQL 17, and MySQL 9.1, plus new YAML configuration options for Debezium Server.
Taylor Murphy explores AI’s impact on ETL, data analytics, and cost efficiency—insights for anyone in data.
Learn how Airflow, Atlan, and OpenLineage enable metadata management and column-level lineage across platforms like AWS and Google Cloud.
Discover strategies from Heineken and Van Oord for building a data-first culture in this leadership-focused webinar.
Join industry experts at the Annual MLOps World & Generative AI Summits! Enjoy FREE virtual workshops and hands-on sessions to boost your skills.