Pinterest unified its ML stack using Ray to enable scalable training, hyperparameter tuning, and modular end-to-end pipelines.
Canva connects experimentation with business outcomes by measuring impact at scale across its product ecosystem.
SeatGeek shares how it ensures reliable and fault-tolerant event publishing across microservices using the transactional outbox pattern.
Microsoft Fabric notebooks now support high-concurrency mode for faster and more efficient pipeline execution.
This opinionated take argues that RAG and prompt engineering are often more effective than fine-tuning large language models.
Netflix introduces its Unified Data Architecture to power batch, streaming, and ML pipelines across a scalable and modular platform.
Google BigQuery adds support for OBJECT data types, enabling native querying of unstructured formats like PDFs, images, and audio.
A new open-source Python notebook for building reactive dashboards with reproducible, modular code and minimal boilerplate.
Firebolt releases its high-speed query engine as a free open-source option for local or hybrid data environments.
A solution accelerator that connects Genie with Slack through n8n to trigger workflows and automate operations from chat.
In this podcast, the LlamaIndex CEO breaks down how to build GenAI systems that handle complex document workflows and scale in the enterprise.
How Meta’s initial approach caused them troubles and their effort to fix them at the organizational scale.
How Expedia uses real-time A/B test monitoring with Apache Flink to detect anomalies early, preventing revenue loss and improving experiment reliability.
A practical guide to dimensional data modeling in Databricks using Delta Lake, Unity Catalog, and Delta Live Tables to build scalable, BI-ready star schemas and fact/dimension tables.