All videos
All videos
Probabilistic data structures for Kafka Streams
October 9, 2019
Apache Kafka has emerged as the world's most popular data streaming platform backbone. Only recently, a new flavour of a stream processing has been introduced - Kafka Streams. However, while simple and lightweight, it lacks a high-level API for probabilistic data structures, which could be a way to provide a considerable solution for the bottlenecks.
In this talk, we will briefly describe the foundational concepts behind both Kafka Streams and probabilistic data structures and present our efforts in creating a library, which connects powerfully Kafka Streams pipeline builders to a variety of modern big data tools for approximate result computation.
About the speakers

Mateusz Owczarek
Scala Developer at AVSystem
Mateusz is Scala developers at AVSystem with 4 years of experience in commercial software development. Mateusz is a Kafka enthusiast, currently working on a generic…
Read more

Miron Ficak
Scala Developer at AVSystem
A Scala developer at AVSystem with 4 years of experience in commercial software development. Miron is a Linkyfi developer, creating data analytics services for stream…
Read more
Other videos that you might like

How to recommend a personalised product to thousands of clients starting with raw data to real-time serving
Hubert Pomorski

Descriptive statistics – the mighty dwarf of data science
Paweł Rzeszuciński

Big and smart data in the development of autonomous vehicles
Grzegorz Wyszyński

Stream processing in telco – case study based on Apache Flink & TouK Nussknacker
Maciek Próchniak