DATA Pill feed

DATA Pill #143 - ETL is Dead, The Golden Path Revolution

ARTICLES

ETL is Dead | 4 min | Data Engineering | Vincent Rainardi | Personal Blog
ETL has evolved into EL + in-platform transformation, making data modeling and optimization the focus of modern engineering.
The Golden Path Revolution | 4 min | Platform Engineering | Robert Sahlin | Personal Blog
Golden Paths simplify complex workflows, turning data engineering into a strategic enabler for innovation.

TUTORIALS

Developing RAG Systems with DeepSeek R1 & Ollama (Complete Code Included) | 4 min | RAG | Sebastian Petrus | Personal Blog
Learn to build RAG systems for precise, cost-effective AI-driven document search and responses.
Image replacement in Canva designs using reverse image search | 6 min | ML | Sam Jacobs | Canva Engineering Blog
Canva automates image replacement with vector databases, ensuring design consistency at scale.
How Meta discovers data flows via lineage at scale | 6 min | Data Management | Rishab Mangla, David Taieb, Wenlong Dong, Gabriela Jacques da Silva, Brani Stojkovic, Slobodan Predolac, Alex Lambert, Francesco Logozzo, Taha Bekir Eren | Meta Engineering Blog
Meta integrates data lineage into privacy infrastructure, enhancing compliance and user data protection.
Enabling advanced GPU features in PyTorch - Warp Specialization| 6 min | ML | Hongtao Yu, Manman Ren, Bert Maher, Shane Nay, Gustav Zhu, Shuhao Jiang | PyTorch Blog
Triton 3.2's warp specialization boosts PyTorch GPU performance by 15% on NVIDIA Hopper.

TOOLS

Opik | LLM
Build and optimize LLM systems with better performance, tracing, and evaluations.
Maestro | VLM
Streamline fine-tuning of vision-language models with prebuilt training recipes.

DATA TUBE

Flink AI Model Inference for GenAI and Real-time Analytics | Gen AI | 14 min | Kai Waehner | Confluent
Learn how Flink enables real-time AI inference for analytics, predictive maintenance, and more.

CONFS, EVENTS AND MEETUPS

Data & AI Summit Webinars | Online | 20th February & 18th March
Sessions on large-scale ML deployment and AI-driven model development at Truecaller and more.

PINNACLE PICKS

Your last week top picks:
How AI and Machine Learning are Fixing Data Quality Fast | 4 min | AI | Michał Kardach, Katarzyna Kusznierczuk | GetInData | Part of Xebia Blog
Discover how AI-driven tools like Monte Carlo and Talend Data Fabric improve data quality for faster insights.
Agents | AI | Julia Wiesinger, Patrick Marlow, Vladimir Vuskovic | Google
See how AI agents enhance decision-making with external tool access for real-time actions.
Explore RAG architecture and strategies for optimizing retrieval-augmented generation models.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
Made on
Tilda