Find out how the semantic layer changes modern data systems by adding context to raw data, making it more accessible across tools. This post examines how this layer solves challenges like "semantic mistrust" and powers scalable, purpose-driven data applications.
Curious about the new kid on the block, Apache Polaris? This article shows how Snowflake’s latest open-source project is shaking up the table format landscape, making it easier to manage metadata and work across multiple engines.
BlaBlaCar leveled up their SQL transformations by adopting dbt™, which helped them manage over 4,000 tables with ease. Learn how dbt™ improved collaboration, automation, and the overall developer experience for their team.
Change Data Capture (CDC) extracts incremental data changes for analytics and system synchronization without affecting operational databases. This tutorial breaks down the benefits of log-based CDC, highlighting its efficiency and low impact on performance.
Explore the shift toward declarative data stacks, which make data processes simpler and more flexible. This approach lets engineers focus on what must be done while the system handles the details.
Airflow Datasets bring automation to trigger workflows based on specific events. This tutorial walks you through using Google Cloud Pub/Sub with Airflow Datasets to create dynamic, event-driven pipelines.
Microsoft announced the public preview of Azure Databricks Mirrored Catalog, enabling direct access to Databricks Unity Catalog tables from Fabric. Users can now create a read-only, replicated copy in OneLake via the UI and explore data with SQL Endpoint or Power BI. This blog covers setting up the mirrored database and its pros and cons.
The Apache Flink community is preparing for Flink 2.0, the first major release in 8 years, bringing new features and compatibility-breaking changes. A preview release is now available to help users and partners adapt early and provide feedback.
Heineken shares how it uses generative AI to drive consumer insights and streamline operations. This session offers practical insights into how AI is reshaping large-scale businesses.
The Big Data Technology Warsaw Summit returns on April 9-10, 2025! Submit your speaking proposal and join over 500 professionals as they dive into the latest in data engineering and big data technology.
We are a community partner of Infoshare Katowice, an event for developers and architects, as well as for IT team leaders, managers, and entrepreneurs from tech companies and the GameDev industry.
3 stages:
- GROWTH – Business development, growth strategies, case studies from leaders, insight into people & culture
- DEV ARCHITECTURE – System architecture, programming, and software engineering
- DEV CODE STAGE – Coding techniques, programming languages and developer tools
Promo code for our community: ISK24-DP10
The event for the Ml/Gen AI community comprised over 20,000 ML researchers, engineers, scientists, and entrepreneurs across several disciplines.
Taken from the real-life experiences of practitioners, the Steering Committee has selected the top applications, achievements, and knowledge areas to highlight across the event.
You can use code DataPill for 20% off all tickets.