A sharp, funny critique of today’s bloated lakehouse stacks that questions whether complexity has become the product. If you’ve ever wrangled Iceberg and wondered “why?”, read this.
Get a look at how OpenAI uses Kafka and Flink to power real-time, event-driven GenAI systems. Great architecture snapshot for anyone scaling LLMs.
This post argues that diverse, high-quality third-party data is what truly powers responsible AI. It’s a strategic look at model inputs, not just algorithms.
Pinterest trained a new model on 16,000 user actions to improve feed ranking with massive gains in repins and fewer hides. One of the strongest personalization upgrades we’ve seen.
A hands-on breakdown of how to design autonomous AI agents that plan, act, and learn. If you’re building beyond chat interfaces, this is a practical architectural guide.
Agent Bricks is a new framework for building, monitoring, and deploying RAG agents at scale. Integrated with Unity Catalog and MLflow for production-readiness.
The deep tech behind Apache Iceberg. This spec defines how its table format handles snapshots, schema changes, and metadata. It's a foundational reference for engineers implementing or extending Iceberg in production systems.
Mai-Lan Tomsen Bukovec from AWS reflects on how S3 shaped the modern data stack. Covers consistency, metadata, and the rise of data lakes.
A full hands-on project using Kafka, Spark, and ML to detect fraud in real time. Covers architecture, model training, and scaling.
A focused event on applied ML, GenAI, and LLM systems in production. Use code AS40 for 40% off tickets.
Jack Ye dives into the new Iceberg spec supporting cross-table transactions with SQL isolation. Also: Gravitino integration and future plans for Hudi and Paimon.
A hands-on demo of dbt on Snowflake: visualize DAGs, run tests, and deploy pipelines with ease using the new native dbt integration.
With its Crunchy Data acquisition, Snowflake launches managed Postgres built for secure, AI-ready transactional workloads on the Data Cloud.