Time Series Analysis… using an Event Streaming Platform
6:00pm-6:10pm Online Networking (feel free to BYOB!!)
6:10pm-7:40pm: Time Series Analysis… using an Event Streaming Platform, Mirko Kämpf, Solutions Architect, Confluent
6:40pm-7:00pm: Q&A
Hello Kafkateers!
In order to do our part to help flatten the curve of the spread of COVID-19, we are moving all of our meetups online for the time being. Please find the details to join this fun and informative meetup below.
Joining our slack space is not instant, so ensure that you are in, in time for the event, follow the steps within this link before the day of the event if you can! https://launchpass.com/confluentcommunity
Find information about upcoming meetups and tons of content from past Kafka Meetups all over the world:
cnfl.io/meetup-hub
Speaker:
Mirko Kämpf, Solutions Architect, Confluent
Title:
Time Series Analysis… using an Event Streaming Platform
Abstract:
Advanced time series analysis (TSA) requires very special data preparation procedures to convert raw data into compatible formats.
In this presentation you will see typical processing patterns for TSA, from simple statistics to reconstruction of correlation networks and interaction graphs.
The first case is relevant for anomaly detection and to protect safety.
Reconstruction of graphs from time series data is a very useful technique to better understand complex systems like supply chains, material flows in factories, information flows within organizations, and especially in medical research.
With this motivation we will look at typical data aggregation patterns, how to apply analysis algorithms in the cloud, and into a reference architecture for TSA on top of the Confluent Platform, which is backed by Apache Kafka.
You will see how we use a common data model across multiple components: starting with custom logic to produce data (samples), Kafka Streams applications, and also ksqlDB applications with user defined functions.
-----
Bio:
Mirko Kämpf is a Solutions Architect at Confluent. Previously he worked as a solutions architect for Cloudera, and as a technical trainer at Cloudera University, where he gained vast experience on the Hadoop and Spark ecosystem, as well as in cloud-based data management with modern architectures to support the data driven businesses.
Most recently, Mirko has dedicated his time to improving a methodology to generate time series episodes and correlation graphs from event-streams. This work is related to the emerging data economy which has to adopt its data consumption patterns to highly dynamic data providers which allow access to more and more sources in a meaningful way.
Mirko has authored several technology related blogs about his time series and graph processing at scale at the Cloudera engineering blog. He has given talks at GraphConnect London, Strata London and Strata New York in the last two years. He also organized and delivered hands-on workshops for developers and data scientists covering event based data analysis at scale for the GridKA-Summerschool at KIT Karlsruhe over 5 years.
In his spare time, he enjoys cooking for his family and friends – especially smoked food – and sailing.
Contact:
LinkedIn: www.linkedin.com/in/kamir
Twitter: @semanpix
------
Online Meetup Etiquette:
•Please unmute yourself when you have a question.
•Please hold your questions until the end of the presentation or use the zoomchat!
•Please arrive on time as zoom meetings can become locked for many reasons.