DATA Pill feed

DATA Pill #130 - Top 7 Alternatives to Apache Flink, How to run data science projects

ARTICLES

Top 7 Alternatives to Apache Flink | 6 min | Real-Time Analytics | Bobur Umurzokov | Personal Blog
This article covers the top 7 alternatives to Apache Flink—GlassFlow, Spark, KsqlDB, Arroyo, RisingWave, Quix, and Bytewax—highlighting comparable stream processing features and solutions to Flink’s challenges.
An introduction to the Analytics Engineering Framework (AEF), a scalable approach to building data platforms on Google Cloud. AEF solves inefficiencies by focusing on platforms over pipelines and helps teams maintain robust data workflows.
How to run data science projects | 6 min | ML | Dzidas Martinaitis | Personal Blog
Dzidas Martinaitis shares his approach to running practical data science projects, from defining problems to setting success metrics, which he has built from nine years of experience at Amazon and AWS.

TUTORIALS

Data Design Pattern: Medallion Architecture - is it really a new way of doing things? | 6 min | Data Lakehouse | Daniel Sahal | GetInData | Part of Xebia Blog
In this blog post, you will learn what medallion architecture is, the characteristics of each layer of this pattern and how it differs from the classic data warehouse layers.
A step-by-step guide to deploying the Gemma 2 model with LoRA adapters on Google Kubernetes Engine, allowing flexible, high-performance language model inference on GCP.
How we built ngrok's data platform | 15 min | Data Engineering | Christian Hollinger | ngrok Blog
Christian Hollinger shares the story of building ngrok’s data platform from scratch, covering the challenges and solutions for global-scale, real-time data processing.

{$te}

Data-driven Excellence: AI and Analytics in Action | 33 min | AI | Q McCallum, Anna Anisin, Matthew Denesuk, Jaime Russ | Data Science Salon Podcast
Listen as industry experts discuss using a Center of Excellence model to drive AI and data strategies, with insights from Royal Caribbean and Ryder System leaders.

DATA TUBE

NVIDIA NIM: Deploying Generative AI at the Speed of Light | 43 min | Gen AI | Mateusz Szczęsny | MOPS Community
Learn how NVIDIA's NIM microservices optimize AI model deployment with cloud-native speed and efficiency, seamlessly integrating on-premise and cloud infrastructure.
Watch out! Security danger with Azure Databricks Delta Tables | 49 min | Data Engineering | Piotr Tybulewicz | Tybul on Azure
Accidental read access to Delta tables in ADLSg2 is a common risk when working with ACLs and Databricks permissions. This guide identifies and fixes unintended access and ensures your data remains secure

CONFS EVENTS AND MEETUPS

Explore how North built an in-house scalable, real-time fraud detection system that replaced a third-party solution to better adapt to new fraud patterns. Join Ben Orkin, VP of Engineering - MLOps, for insights on North's technical decisions, business drivers, and the future of fraud detection innovation.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
➡ Dig previous editions of DataPill
2024-11-05 12:32