How Lyft predicts minute-level demand across millions of locations, balancing latency, noise, and deep vs. time series models.
This blog introduces Shallow Snapshot V2 in Amazon OpenSearch Service, enabling faster, more efficient, and scalable backups using a timestamp-based system with reduced overhead.
A versioned schema evolution strategy for Lakehouse stacks that avoids Spark’s mergeSchema pitfalls while staying engine-flexible.
Five foundational areas—networking, IAM, IaC, CI/CD, and conventions—that will make or break your Azure data platform build.
Build an AI-powered data pipeline with Airflow 3 and Gemini, using LLMs to generate and rank tier lists from LoL match data.
Unify stream and batch ETL with Flink’s declarative Materialized Table—define logic once, let Flink choose the mode.
Step-by-step guide to training a custom chatbot using LoRA fine-tuning on five years of personal Telegram history.
Neon, the serverless Postgres startup with an AI-first dev experience, joins Databricks to power the next-gen AI-native database platform.
Best practices for migrating from Hadoop to an object storage-based Iceberg lakehouse, with performance and sustainability in focus.
Join Luca Bianchi, Field CTO at Neosperience, for a hands-on code-along session exploring key AWS tools like SageMaker and Bedrock to streamline machine learning workflows and boost your AWS skills.
Uber enhanced Kubernetes with custom schedulers and GPU-aware logic to scale multi-tenant Ray workloads for ML efficiently and reliably.
Learn how ClickHouse replaced Spark and ElasticSearch to power real-time analytics at massive scale for Microsoft Clarity.
A real-world example of replacing Kafka pipelines with Rust, cutting CO₂ emissions and cloud costs by 99%.