What, How and When to Delta Lake - Live Coding Session
This live coding session is a gentle introduction to the latest and greatest of Delta Lake (https://delta.io/).
You will learn what Delta Lake is and what challenges it aims to solve. You will hear about how Delta Lake builds upon the features of the recent Apache Spark 3 and why it can complement your data processing workloads.
During this talk, I'm going to touch upon the slogan from the main page of Delta Lake: "Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads."
You will also learn about time travel and data versioning using Spark tables in Spark SQL and Spark Structured Streaming.
Target audience: Anybody who's willing to learn
Short Bio: Jacek is an IT freelancer specializing in Apache Spark, Delta Lake, Apache Kafka (with brief forays into a wider data engineering space, e.g. Trino and ksqlDB, mostly during Warsaw Data Engineering meetups). He is best known for "The Internals Of" online books available free of charge at https://books.japila.pl/.
Więcej informacji: https://www.meetup.com/Trojmiasto-Spark-Meetup/events/276419023/