DATA Pill feed

DATA Pill #137 - Your Top Picks of 2024!

ARTICLES

How Pfizer Achieved Self-Service Data Mesh with Snowflake and Azure | 20 min | Data Engineering | Samia Rahman, Marty Hall, Gary Kretzschmar, Christopher Witcher, Jennifer Yoakum, Matthew Massey | Snowflake Blog
This article delves into the strategic deployment of data mesh on platforms like Snowflake and Azure, offering insights from those who have successfully navigated the journey. Through the lens of Pfizer's Data Strategy, Science, and Solutions team, let's explore the pivotal shifts necessary to achieve a robust self-service data ecosystem, illustrating the challenges and triumphs along the way.
What Is a Streaming Database?| 6 min | Real-time analytics | RisingWave Labs | Towards Dev
Streaming databases are designed to process and store large volumes of real-time data, enabling immediate analysis and insights. Unlike traditional batch-processing databases, they handle continuous data flow and are ideal for time-sensitive applications like fraud detection and IoT. These databases support real-time analytics by combining immediate data processing with persistent storage.
Dbt vs. Dataform: Which one should you choose? | 11 min | Data Engineering | Na Nguyen (Anna) | Joon Solutions Global Blog
The launch of Dataform has complicated the choice between it and Dbt, especially for clients integrated with the GCP ecosystem. To determine the better option, Anna explored Dataform's capabilities through an entire code lifecycle, aiming to identify any pitfalls and compare it fairly with Dbt. The following review is based on six main aspects:

  • Development
  • Collaboration
  • Deployment
  • Governance
  • Integration
  • Platform Cost
AWS Lambda vs. Cloudflare Workers Detailed Comparison | 7 min | Data Engineering | Kiryl Anoshka | Fively Blog
This article compares AWS Lambda and Cloudflare Workers, focusing on their theoretical capabilities and practical differences across key categories such as performance, runtime, and pricing. It also includes insights on which platform excels and a cold start comparison to highlight their distinctions, particularly for smaller tasks.
Explore common data strategy pitfalls and learn how to avoid them. This article covers key organizational gaps and offers practical steps to help build a more aligned and effective data strategy.
Unlocking Insights with High-Quality Dashboards at Scale | 13 min | Data Science | Skyler Johnson | Spotify Engineering Blog
Discover how Spotify uses Tableau and Looker Studio, guided by a Dashboard Quality Framework, to create thousands of dashboards with consistently high standards. The centralized Dashboard Portal makes it easy to access curated, high-quality insights.
Questions we’re tired of hearing: Why can’t I just query raw data? | 6 min | Data Governance | Bo Lemmers | Xebia Blog
Explore why structured data beats querying raw data and unpacking common governance pitfalls and best practices.

TUTORIALS

Real-time Analytics: architecture, technologies and example implementation in e-commerce | 6 min | Real-time analytics | Piotr Pękala | GetInData | Part of Xebia Blog
This blog delves into how real-time analytics can transform data collection, transformation, and analysis to provide immediate insights and actionable information, focusing on e-commerce implementation.

PODCAST

No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla | 44 min | Gen AI | Andrej Karpathy, Sarah Guo, Elad Gil | No Priors Podcast
Andrej Karpathy, former Tesla Autopilot leader and OpenAI founding member, joins to discuss self-driving cars, Tesla's Optimus robot, and AI's future. He also shares insights on AI education and his new mission, Eureka Labs.

DATA TUBE

How to design, implement and maintain secure, scalable and cost effective lakehouse architectures leveraging Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
➡ Dig previous editions of DataPill
Made on
Tilda