DataKRK #42 | ML with Flink + lessons learned on Azure Data Platform

Join us for the 42nd DataKRK: Tuesday, 20 Feb, 6PM @ HEVRE, Krakow
Agenda:
Handling online model training with clickstream data using Flink
Batch processing has long been a cornerstone of machine learning deployments. While good-enough for most applications, the list of specialised use-cases that call for a tighter feedback loop is ever growing. Platforms like Spark Streaming deliver near/approximate real-time processing capabilities but they come with inherent constraints. Flink has been growing in popularity in recent years with its uncompromised approach to data streaming challenges. We will present how proposed architecture enhances our ability to rapidly adjust and refine online content recommendations, effectively shortening the feedback loop and making our machine learning models more responsive to current user interactions.
Azure Data Platform as Code
The aim of this session is to demonstrate how an enterprise-ready Azure Data Platform can be set up from scratch in days instead of months. I will present the most important lessons I've learned over the last year while working on such an automation framework. I'll discuss failures, dead ends, drawn conclusions, and the approach we ultimately developed and successfully implemented.
During this one-hour session, among other topics:
- Landing Zones, Cloud Adoption & Cloud Scale Analytics Frameworks - why should the 'Data people' also understand this stuff?
- Automation from day one & Everything as Code.
- Networking, Security, Monitoring.
- Why a bunch of accelerators work better than an out-of-the-box solution.
- Project timeline, proper analysis and collaboration with the client - the keys to success.
Speakers:
Zbigniew Królikowski | Senior ML Engineer & Team Lead at VirtusLab
Focus of my work is applying software engineering principles to reliably solve complex business problems and repeatably bring value out of Machine Learning and Data Science projects.
Tomasz Kostyrka | Data Platform Architect, GetInData - Part of Xebia
For 10 years in Data projects based mostly on the Microsoft platform. In recent years eagerly escaping into topics related to Cloud Architecture and DevOps.
Remember to mark your attendance, and see you on Feb 20th!