DATA Pill feed

DATA Pill #144 - Train 400x faster Static Embedding Models, LLMs and Graphs Synergy

ARTICLES

7 Powerful Questions to Define and Execute Your Data Strategy | 3 min | Data Strategy | Steven Nooijen | Xebia Blog
Define a data strategy that delivers business impact. Seven key questions to align data efforts, balance short-term wins, and engage the right stakeholders.
Bridging the Data Divide: How Confluent and Databricks Are Unlocking Real-Time AI| 3 min | Real-Time AI | Jay Kreps, Ali Ghodsi | Confluent Blog
Confluent and Databricks integrate to enable real-time AI, combining governance and event-driven learning for fraud detection, personalization, and automation.

TUTORIALS

Train 400x faster Static Embedding Models with Sentence Transformers | 12 min | LLM | Tom Aarsen | HuggingFace Blog
Train ultra-fast static embedding models on CPUs while maintaining high accuracy. Explore two new models and open-source scripts for efficient AI applications.
Understanding Reasoning LLMs | 9 min | LLM | Sebastian Raschka | Personal Blog
Deep dive into techniques like inference scaling and reinforcement learning to enhance LLMs for complex problem-solving, using DeepSeek R1 as a case study.
How AI Agents & Data Products Work Together to Support Cross-Domain Queries & Decisions for Businesses | 14 min | LLM | Travis Thompson, Brij Mohan Singh, Ritwika Chowdhury | Modern Data 101 Blog
How knowledge graphs, multi-agent AI, and RAG workflows improve enterprise decision-making and governance.

PODCASTS

Why Legal Hurdles Are the Biggest Barrier to AI Adoption| AI | 39 min | Ben Lorica, Andrew Burt | The Data Exchange Podcast
Navigating AI legal risks, compliance hurdles, and the rise of AI regulation.
LLMs and Graphs Synergy | LLMs | 35 min | Kyle Polish, Garima Agrawal | Data Skeptic Podcast
Using knowledge graphs to refine LLMs, boost accuracy, minimize hallucinations, and enhance decision-making.

DATA TUBE

Step-by-step guide to building a local RAG-powered app for querying PDFs securely with LangChain and vector databases.
Deep Dive into LLMs like ChatGPT | LLMs | 3 h 31 min | Andrej Karpathy
A comprehensive breakdown of LLMs, their training stack, and best practices for real-world applications.

CONFS, EVENTS AND MEETUPS

In this panel, experts share best practices to help data leaders integrate governance into their strategies for better decision-making.

PINNACLE PICKS

Your last week top picks:
ETL is Dead| 4 min | Data Engineering | Vincent Rainardi | Personal Blog
ETL has evolved into EL + in-platform transformation, making data modeling and optimization the focus of modern engineering.
The Golden Path Revolution| 4 min | Platform Engineering | Robert Sahlin | Personal Blog
Golden Paths simplify complex workflows, turning data engineering into a strategic enabler for innovation.
Data & AI Summit Webinars| Online | 20th February & 18th March
Sessions on large-scale ML deployment and AI-driven model development at Truecaller and more.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
Made on
Tilda