ARTICLES
Modernizing Sports Betting with Real-Time Data Streaming | 7 min | Streaming Architecture | Mitchell Gray | Ververica Blog
How sports betting platforms use Flink streams to power live odds, fraud detection, and personalization with exactly-once guarantees.
TUTORIAL
Behind Viewer Retention Analytics at Scale | 8 min | Data Engineering | Vimeo Engineering Blog
Vimeo details how they process billions of play events to measure second-by-second viewer engagement.
7 Drop-In Replacements to Instantly Speed Up Your Python Data Science Workflows | 6 min | Data Engineering | Jamil Semaan | NVIDIA Developer Blog
Swap Pandas, NumPy, and scikit-learn for GPU-optimized drop-ins like cuDF, CuPy, and cuML for instant speedups.
Machine Learning Data Preprocessing in DuckDB | 6 min | ML & Data Engineering | Petrica Leuca | DuckDB Blog
New SQL-native functions for one-hot encoding, normalization, and scaling let you prep ML data directly in DuckDB.
DATA LIBRARY
Data & AI Monitor Report 2025–2026 | AI & Data Strategy | Xebia
Industry benchmarks on GenAI adoption, MLOps maturity, and platform modernization for the year ahead.
NEWS
Introducing Streaming Agents | 7 min | AI & Data Streaming | Mayank | Confluent Blog
Confluent launches event-driven, LLM-powered agents that combine Kafka streams with real-time context.
Apache Ozone 2.0.0: Open Source Cloud Object Store for the Data Lake Era | 3 min | Data Infrastructure | The Apache Software Foundation Blog
The new version brings better scale, security, and performance to this S3-compatible object store for big data.
Tecton Joins Databricks to Power Real-Time Data for Personalized AI Agents | 6 min | AI & Data Infrastructure | Akhil Gupta, Mani Parkhe, Patrick Wendell | Databricks Blog
Databricks acquires Tecton to bring real-time feature pipelines and personalization into its Data Intelligence Platform.
PODCAST
LLM Deployment on Kubernetes with LLMD | 52 min | LLM | Abdel Sghiouar, Kaslin Fields | Kubernetes Podcast
How LLMD brings autoscaling, GPU scheduling, and native model serving to Kubernetes.
EVENTS, CONFS, AND MEETUPS
AWS Community Day Baltic | September 10 | Gdynia, Poland
The first Baltic AWS Community Day brings 20+ speakers, 300+ attendees, and keynotes from AWS advocates Gunnar Grosch and Viktor Vedmich.
PINNACLE PICKS
Your last week top picks:
ML Observability: Bringing Transparency to Payments and Beyond | 9 min | Machine Learning | Tanya Tang, Andrew Mehrmann | Netflix Tech Blog
Netflix details how they monitor and explain ML models in payments, capturing metrics, tracing predictions, and detecting drift. Built for compliance, reliability, and scaling observability patterns into other domains.
Agentic Data Access at Meta: Warehouse Agents Balancing Productivity and Security | 8 min | Can Lin, Uday Ramesh Savagaonkar, Iuliu Rus, Komal Mangtani | Data Infrastructure | Meta Engineering Blog
Meta introduces a multi-agent framework where query agents and owner agents collaborate to ensure secure warehouse access. Balances speed for analysts with compliance and audit guarantees.
flink-mcp | 4 min | Streaming & AI
A Python package connecting Apache Flink with the Model Context Protocol, letting AI models stream predictions and decisions into real-time pipelines.
________________________
Have any interesting content to share in the DATA Pill newsletter?
