使用Player FM应用程序离线!
Building Resilient Data Systems for Modern Enterprises at Astrafy with Andrea Bombino
Manage episode 448897519 series 2948506
Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, Andrea Bombino, Co-Founder and Head of Analytics Engineering at Astrafy, shares insights into his team’s approach to optimizing data transformation and orchestration using tools like datasets and Pub/Sub to drive real-time processing. Andrea explains how they leverage Apache Airflow and Google Cloud to power dynamic data workflows.
Key Takeaways:
(01:55) Astrafy helps companies manage data using Google Cloud.
(04:36) Airflow is central to Astrafy’s data engineering efforts.
(07:17) Datasets and Pub/Sub are used for real-time workflows.
(09:59) Pub/Sub links multiple Airflow environments.
(12:40) Datasets eliminate the need for constant monitoring.
(15:22) Airflow updates have improved large-scale data operations.
(18:03) New Airflow API features make dataset updates easier.
(20:45) Real-time orchestration speeds up data processing for clients.
(23:26) Pub/Sub enhances flexibility across cloud environments.
(26:08) Future Airflow features will offer more control over data workflows.
Resources Mentioned:
https://www.linkedin.com/in/andrea-bombino/
Astrafy -
https://www.linkedin.com/company/astrafy/
https://airflow.apache.org/
https://cloud.google.com/
dbt -
https://www.getdbt.com/
https://astronomer.typeform.com/airflowsurvey24
Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering & AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.
#AI #Automation #Airflow #MachineLearning
39集单集
Building Resilient Data Systems for Modern Enterprises at Astrafy with Andrea Bombino
The Data Flowcast: Mastering Airflow for Data Engineering & AI
Manage episode 448897519 series 2948506
Efficient data orchestration is the backbone of modern analytics and AI-driven workflows. Without the right tools, even the best data can fall short of its potential. In this episode, Andrea Bombino, Co-Founder and Head of Analytics Engineering at Astrafy, shares insights into his team’s approach to optimizing data transformation and orchestration using tools like datasets and Pub/Sub to drive real-time processing. Andrea explains how they leverage Apache Airflow and Google Cloud to power dynamic data workflows.
Key Takeaways:
(01:55) Astrafy helps companies manage data using Google Cloud.
(04:36) Airflow is central to Astrafy’s data engineering efforts.
(07:17) Datasets and Pub/Sub are used for real-time workflows.
(09:59) Pub/Sub links multiple Airflow environments.
(12:40) Datasets eliminate the need for constant monitoring.
(15:22) Airflow updates have improved large-scale data operations.
(18:03) New Airflow API features make dataset updates easier.
(20:45) Real-time orchestration speeds up data processing for clients.
(23:26) Pub/Sub enhances flexibility across cloud environments.
(26:08) Future Airflow features will offer more control over data workflows.
Resources Mentioned:
https://www.linkedin.com/in/andrea-bombino/
Astrafy -
https://www.linkedin.com/company/astrafy/
https://airflow.apache.org/
https://cloud.google.com/
dbt -
https://www.getdbt.com/
https://astronomer.typeform.com/airflowsurvey24
Thanks for listening to “The Data Flowcast: Mastering Airflow for Data Engineering & AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.
#AI #Automation #Airflow #MachineLearning
39集单集
所有剧集
×欢迎使用Player FM
Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。