ARTICLES
Top 7 Alternatives to Apache Flink | 6 min | Real-Time Analytics | Bobur Umurzokov | Personal Blog
This article covers the top 7 alternatives to Apache Flink—GlassFlow, Spark, KsqlDB, Arroyo, RisingWave, Quix, and Bytewax—highlighting comparable stream processing features and solutions to Flink’s challenges.
Stop Thinking in Data Pipelines, Think in Data Platforms: Introducing the Analytics Engineering Framework| 7 min | Data Platforms | Oscar Pulido | Google Cloud - Community
An introduction to the Analytics Engineering Framework (AEF), a scalable approach to building data platforms on Google Cloud. AEF solves inefficiencies by focusing on platforms over pipelines and helps teams maintain robust data workflows.
How to run data science projects | 6 min | ML | Dzidas Martinaitis | Personal Blog
Dzidas Martinaitis shares his approach to running practical data science projects, from defining problems to setting success metrics, which he has built from nine years of experience at Amazon and AWS.
TUTORIALS
Data Design Pattern: Medallion Architecture - is it really a new way of doing things? | 6 min | Data Lakehouse | Daniel Sahal | GetInData | Part of Xebia Blog
In this blog post, you will learn what medallion architecture is, the characteristics of each layer of this pattern and how it differs from the classic data warehouse layers.
Deploy Gemma2 with multiple LoRA adapters with TGI DLC on GKE | 10 min | LLM | Hugging Face Blog
A step-by-step guide to deploying the Gemma 2 model with LoRA adapters on Google Kubernetes Engine, allowing flexible, high-performance language model inference on GCP.
How we built ngrok's data platform | 15 min | Data Engineering | Christian Hollinger | ngrok Blog
Christian Hollinger shares the story of building ngrok’s data platform from scratch, covering the challenges and solutions for global-scale, real-time data processing.
{$te}
Data-driven Excellence: AI and Analytics in Action | 33 min | AI | Q McCallum, Anna Anisin, Matthew Denesuk, Jaime Russ | Data Science Salon Podcast
Listen as industry experts discuss using a Center of Excellence model to drive AI and data strategies, with insights from Royal Caribbean and Ryder System leaders.
DATA TUBE
NVIDIA NIM: Deploying Generative AI at the Speed of Light | 43 min | Gen AI | Mateusz Szczęsny | MOPS Community
Learn how NVIDIA's NIM microservices optimize AI model deployment with cloud-native speed and efficiency, seamlessly integrating on-premise and cloud infrastructure.
Watch out! Security danger with Azure Databricks Delta Tables | 49 min | Data Engineering | Piotr Tybulewicz | Tybul on Azure
Accidental read access to Delta tables in ADLSg2 is a common risk when working with ACLs and Databricks permissions. This guide identifies and fixes unintended access and ensures your data remains secure
CONFS EVENTS AND MEETUPS
FIRESIDE CHAT: How to Build a High-Performance Fraud Detection System | 20th November | Online
Explore how North built an in-house scalable, real-time fraud detection system that replaced a third-party solution to better adapt to new fraud patterns. Join Ben Orkin, VP of Engineering - MLOps, for insights on North's technical decisions, business drivers, and the future of fraud detection innovation.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Dig previous editions of DataPill