ARTICLES
Hermes: A Text-to-SQL solution at Swiggy | 9 min | Data Science | Amaresh M, Rutvik Reddy | Swiggy Bytes — Tech Blog
At Swiggy, the team developed Hermes, an AI tool that lets users ask questions in natural language and get SQL queries and results in Slack. This blog explores Hermes' development, from initial challenges to the refined V2, enhancing productivity and data access across teams.
Bridging the Gap Between Business Stakeholders and Data Modelers | 5 min | Data Management | Camila Birocchi | Xebia Blog
Effective collaboration between data modelers and business stakeholders is critical to successful data projects. This blog covers roles, challenges, and strategies to ensure clear communication and strategic alignment for optimal outcomes.
TUTORIALS
Real-time Analytics: architecture, technologies and example implementation in e-commerce | 6 min | Real-time analytics | Piotr Pękala | GetInData | Part of Xebia Blog
This blog delves into how real-time analytics can transform data collection, transformation, and analysis to provide immediate insights and actionable information, focusing on e-commerce implementation.
Netflix Maestro and Apache Airflow — Competitors or Companions in Workflow Orchestration? | 20 min | Data Engineering | Volker Janz | Data Engineer Things
How Netflix Maestro and Apache Airflow can complement each other. Delve into their features, strengths, and use cases to uncover whether they are companions or competitors.
No, Data Engineers Don’t NEED dbt. | 11 min | Data Engineering | Leo Godin | Data Engineer Things
The need to learn dbt depends on your company's use of it. Dbt transforms data using SQL and is valuable in SQL-centric environments. This blog explains how dbt addresses issues like dependency management, dynamic SQL, automatic materialization, and testing. It discusses when it may or may not be the right fit for your data challenges.
Elevate your Databricks development workflow with SHALLOW CLONE | 5 min | Data Engineering | Tomasz Kostyrka | Seequality Blog
In this blog, Tomasz demonstrates how to use the SHALLOW CLONE functionality to streamline development. This feature accelerates the creation of dedicated development environments and enables comprehensive data testing within the CI/CD pipeline.
PODCAST
CEO of LlamaIndex, Jerry Liu on Generative AI, LLMs, ChatGPT, RAG, Entrepreneurship, with Raja Iqbal | 1 h 20 min | Gen AI | Jerry Liu, Raja Iqbal | Data Science Dojo
Jerry Liu, Co-founder and CEO of LlamaIndex, and Raja Iqbal, CEO of Data Science Dojo, delve into generative AI, LLMs, and entrepreneurship. They discuss the applications of LlamaIndex, Retrieval Augmented Generation (RAG), fine-tuning for enterprise use, and the potential risks and societal implications of AI, providing valuable insights for enthusiasts and entrepreneurs alike.
DATA TUBE
Use real time eventstreams in Microsoft Fabric | 26 min | Real-time processing | Kamil Nowinski | Kamil Data Geek - Azure Explained
Eventstream lets you easily capture, transform, and route real-time events with a no-code experience. Explore EventStore for robust monitoring and deep insights into your cluster's state, querying events at various levels and correlating them to enhance diagnostics.
Accelerate your productivity with the Kedro extension for VS Code | 5 min | Data Engineering | Kedro
In this video we show our Kedro extension for VS Code, which integrates Kedro projects with Visual Studio Code. It's designed to streamline your workflow and enhance your productivity with features such as enhanced code navigation and autocompletion.
CONFS EVENTS AND MEETUPS
Azure Fest | Nieuwegein | 11th September
Azure Fest NL is a free, one-day community event featuring world-class speakers who share real-world experiences, best practices, and the latest developments on the Azure platform.
________________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Dig previous editions of DataPill