DATA Pill feed

DATA Pill #138 - Parquet & AI = 🙅‍♂️⛔️? Archetypes of LLM apps

ARTICLES

Parquet & AI = 🙅‍♂️⛔️? | 4 min | Data Engineering | Julien Hurault | Personal Blog
A deep dive into Parquet’s limitations for AI workflows and emerging formats like Lance and Nimble, designed for the AI era.
Top 10 Data & AI Trends for 2025 | 12 min | Data & AI | Barr Moses | Towards Data Science
Explore 2025’s biggest trends, from small data models to operationalizing generative AI effectively.
Archetypes of LLM apps| 12 min | LLM | Philip I. Thomas | Contraption Blog
Unpack LLM application types—from code generation to advanced AI swarms—and transform workflows.
The 70% problem: Hard truths about AI-assisted coding | 7 min | AI | Addy Osmani | Personal Blog
AI coding tools boost productivity but need human expertise to ensure quality and maintainability.

TUTORIALS

Real-time Postgres CDC to Iceberg: Why it matters? | 5 min | Real-time data processing | RisingWave Labs | Dev Genius Blog
Streamline Postgres to Iceberg pipelines with CDC using RisingWave’s built-in connectors.
Talk to Airflow — Build an AI Agent Using PydanticAI and Gemini 2.0| 15 min | AI | Volker Janz | Data Engineer Things
Learn how PydanticAI simplifies building production-grade AI agents for Apache Airflow.
Airflow 3.0 promises us REAL event driven scheduling | 6 min | Data Engineering | Josip Bartulović | Data Engineer Things
Discover how Airflow 3.0 revolutionizes scheduling with real-time, event-driven pipelines.

PODCAST

How developers (really) used AI coding tools in 2024 | 39 min | AI | Ben Popper, Tariq Shaukat | Substack Overflow Podcast
Insights on AI in coding, pull requests, and what’s next for software development in 2025.
How Orchestration Impacts Data Platform Architecture | 1 h | Data Engineering | Tobias Macey, Hugo Lu | Data Engineering Podcast
Navigate data orchestration’s impact on modern platform design with expert insights.

DATA TUBE

Git Basics: 01 - Intro to source control | 40 min | Streaming | Piotr Tybulewicz | Tybul on Azure
Master Git in this beginner-friendly series, starting with why Git is essential for developers.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
2025-01-02 12:58