DATA Pill #194 – Agent Design, Spark Resilience, LLM Scaling & Semantic Layers

ARTICLES

Scaling LLM Post‑Training at Netflix | Netflix |Baolin Li, Lingyi Liu, Binh Tang, Shaojing Li | 7 min | LLM Infrastructure

Netflix shares how it scales post‑training of large language models (fine‑tuning and reward modelling) across hundreds of GPUs. Topics include distributed optimization, scheduling on heterogeneous clusters, evaluation pipelines and lessons learned from deploying domain‑specific LLMs for personalization and content creation.

Drastically Reducing Out‑of‑Memory Errors in Apache Spark at Pinterest | Pinterest| Felix Loesing | 12 min | Data Infrastructure

Pinterest processes over 90k Spark jobs daily. Felix explains how they cut OOM failures by detecting high‑memory tasks and retrying them on larger executors (“Auto Memory Retries”). The approach makes executor sizing elastic, launching larger profiles only when needed, leading to fewer job failures and lower on‑call load.

Scaling Localization with AI at Lyft | Lyft| Stefan Zier | 6 min | AI in Localization

Lyft describes its AI‑powered localization platform that translates and adapts UI strings across dozens of languages. The system combines machine translation, LLM‑based context extraction, and human review loops to deliver high‑quality localized copy at scale, reducing turnaround times for product launches.

Tracking Feature Lineage in Feast with OpenLineage | Feast | Nikhil Kathole & Francisco Javier Arceo | 5 min | ML Ops

Feast integrates OpenLineage to automatically capture feature lineage. It records every step from data sources to materialized feature tables, enabling unified visibility across systems, simplifying debugging and compliance checks

How to Stub External Services in Integration Tests | Allegro Tech | Piotr Klimiec | 8 min | Testing & Microservices

Piotr illustrates how to improve integration tests by grouping stubs by service rather than using monolithic helpers. He proposes a stub‑builder pattern that centralises external service stubs, uses fluent APIs to express expectations (stub.userRegistry().willReturnUserPermissions()), and makes test intent obvious. This pattern reduces boilerplate and makes complex integration flows easier to read

How to Get the Most Out of Your Agents, Part I Xebia | Rogier van Geer| 5 min | Agent Engineering

This primer outlines why thoughtful agent design beats naive prompting. It covers setting clear goals, picking the right abstraction (single vs multi‑agent), measuring agent behaviour and iterating on prompts, policies and tools. A good starting point if you’re moving from chat demos to production agents.

Subagents and Web Search in Claude Code | Ollama| 4 min | Agent Tools

Ollama introduces subagents that run tasks in parallel (file search, code exploration, research) and built‑in web search for Claude Code. The post explains how to spawn subagents to audit security, find performance bottlenecks, or map database queries, and shows how web search integrates current information into coding sessions.

Semantic Layer Architectures Explained: Warehouse‑Native vs dbt vs Cube| Typedef | 10 min | Data Architecture

A comprehensive guide comparing three approaches to semantic layers: warehouse‑native (Snowflake/Databricks), transformation‑layer (dbt MetricFlow) and OLAP‑acceleration (Cube). It breaks down where to locate semantic logic, trade‑offs in performance and governance, and how different patterns evolved

Make Your AI Better at Data Work with dbt’s Agent Skills | dbt | Joel Labes & Jason Ganz | 14 min | Developer Tools

dbt Labs introduces agent skills—bundles of prompts and scripts that embed dbt best practices into AI assistants. Skills cover analytics engineering (building models, writing tests), semantic modeling with MetricFlow, platform operations (troubleshooting, configuring MCP servers) and migration tasks. The post explains how to install and use these skills to turn general coding agents into competent data agents

NEWS

Pandas 3.0 Released| Pandas | 5 min

Pandas 3.0 introduces a dedicated string dtype, copy‑on‑write semantics for predictable behavior, improved datetime resolution, and a new pd.col syntax. It also removes deprecated features and may require code updates

DuckDB Vortex Extension Released | DuckDB | Guillermo Sanchez| 8 min | Database Engines

DuckDB announces native support for the Vortex columnar file format. Vortex offers late decompression and compute-in-storage capabilities, allowing DuckDB to filter and process data directly on compressed blocks. The extension is available as a core DuckDB plugin and supports heterogeneous data types and GPU acceleration

TUTORIALS & Books

Complete Guide to Building Skills for Claude | Anthropic | ~10 min | Skills & LLMs

A hands‑on tutorial covering how to build, test and distribute skills for Claude. It walks through creating skill files, using evaluation harnesses, and publishing skills for use in Claude Code or other AI assistants.

DATATube

Introducing Databricks AI Dev Kit - Skills, MCP server, Builder App | Dustin Vannoy | 15 min

AI-Driven Development (vibe coding) on Databricks just got a whole lot better! The Databricks AI Dev Kit gives your AI coding assistant (Claude Code, Cursor, etc.) the trusted sources it needs to build faster and smarter on Databricks.

AI SaaS explained in 7 min.. | Caleb Writes Code | 15 min

SaaS based stocks are down more than 60% and traditional software companies are facing AI disruption happening as the first wave. How will the AI industry and the broader economy value software companies as the unit economics change with how pricing works with agents and agentic applications that interact with applications?

TOOLS

dbt Agent Skills Repo

A curated collection of agent skills for dbt tasks (model building, testing, debugging, semantic layer creation) packaged for Claude Code and other agents. Use these to improve your agent’s data‑engineering abilities.

HashiCorp Agent Skills

An open library of skills for Terraform and Packer. Skills encode best practices and patterns, helping AI assistants generate, test, refactor and manage infrastructure code with consistent style and security

Qwen 3.5 Model

Qwen’s new model version features improved reasoning, longer context and API endpoints for custom agent applications.

AI Dev Kit

Databricks’ open-source kit provides templates, connectors and orchestration scripts for building generative AI applications on the Databricks Lakehouse.

MCP Aggregator

A server and library that aggregates content for MCP agents. It supports indexing web pages, documents and codebases, exposing them through a unified MCP interface.

CONFS, EVENTS, WEBINARS & MEETUPS

Data & AI That Matter From systems to people: building data and AI with real-world impact| Webinar Online | March 26, 2026

Not all data and AI initiatives deliver value. This webinar series focuses on impact over hype — how data platforms, AI systems, and teams can be designed to genuinely support people, decisions, and outcomes.

PINNACLE PICKS

Your last edition top picks:

Docker Sandboxes: Run Claude Code and Other Coding Agents Unsupervised—But Safely | Docker | Eric Jia, Srini Sekaran | 6 min | AI Tooling & Security

Docker explains how sandboxed environments let teams run autonomous coding agents without exposing host systems. A practical approach to balancing agent autonomy with isolation, permissions, and reproducibility.

10 strategies to reduce MCP token bloat | The New Stack | Bill Doerrfeld| 7 min | LLM Infrastructure
A practical guide to controlling token growth in Model Context Protocol (MCP) systems. Covers schema design, context pruning, and architectural patterns that keep agent costs predictable at scale.

AI in Practice: Claims Handling, Video Labeling & Scalable AI Use Cases | Diederick Greveling, Jeroen Overschie |1h

In this video, we explore practical, real-world AI use cases that are already transforming organizations, including:

AI-powered claims handling
Video labeling and intelligence
Automated decision support and operational optimization

_____________________

Have any interesting content to share in the DATA Pill newsletter? Reach Out!

2026-02-19 23:02