DATA Pill feed

DATA Pill #139 -PySpark Fixes, Netflix Insights & BigQuery Tools!

ARTICLES

The wall that wasn’t | 13 min | AI | Duncan Anderson | Barnacle Labs
Explore how AI models like OpenAI’s o3 and Google’s Gemini use Chain of Thought to push reasoning capabilities forward.
Title Launch Observability at Netflix Scale | 6 min | Data Observability | Varun Khaitan | Netflix TechBlog
Netflix shares how their "Title Health" framework improves content launches and builds robust systems.
Compare top data quality tools to find the right solution for governance, observability, and no-code prep.

TUTORIALS

Building a SQL Bot with LangChain, Azure OpenAI, and Microsoft Fabric | 8 min | Data Science | Mariusz Kujawski | Personal Blog
AI coding tools boost productivity but need human expertise to ensure quality and maintainability.
Serverless, Location-Aware Search for web and mobile apps with Agent Builder & BigQuery | 16 min | AI | Łukasz Olejniczak | Google Cloud - Community
Learn to build smarter, location-aware search engines using BigQuery and Agent Builder.
Mastering Spark: Session vs. DataFrameWriter vs. Table Configs | 10 min | Data Lakehouse | Miles Cole | Personal Blog
Learn to build smarter, location-aware search engines using BigQuery and Agent Builder.
Getting ISO year right in PySpark | 3 min | Data Processing | Morten Gammelgaard Hannibalsen | Personal Blog
Fix PySpark's ISO 8601 issues with UDFs or custom logic for accurate year and week alignment.

TOOL

A Pythonic DataFrame and ML API powered by BigQuery for analytics and machine learning.

DATA TUBE

Building Large Language Models (LLMs) | 1 h 44 min | LLM | Yann Dubois | Stanford Online
Learn how to build ChatGPT-like models with this comprehensive Stanford lecture.

DATA TUBE

Machine Learning with AWS | Webinar | 31th January
Master Git in this beginner-friendly series, starting with why Git is essential for developers.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
Made on
Tilda