Apache Flink's Edge in Stream Processing
Shekhar Prasad Rajak (~shekhar_prasad) |
Description:
Ever wondered how businesses handle the deluge of real-time data to make instantaneous decisions? As data streams continue to grow exponentially, the need for efficient, low-latency processing becomes paramount. Apache Flink stands out as a robust solution for real-time stream processing, especially in handling out-of-order events and providing exactly-once semantics. This talk will delve into:
* Introduction to Apache Flink and its capabilities: Discover how Flink excels in processing endless streams of data with minimal latency, enabling immediate insights and actions.
* Handling out-of-order data and ensuring exactly-once semantics: Learn how Flink's advanced time and windowing capabilities provide accurate and reliable data processing.
* Comparing Flink with Kafka for complex use cases: Explore scenarios where Kafka's capabilities may fall short, and Flink's strengths in complex event processing and dynamic windowing shine through.
* Real-world examples: From financial services detecting fraud in real-time to ride-sharing platforms optimizing routes on-the-fly, see how Flink outperforms in handling out-of-order events and ensuring low-latency, high-throughput processing.
* Building efficient ETL pipelines: Understand how Flink simplifies ETL processes, making data transformations faster and more efficient compared to traditional batch processing.
Join us to uncover how Apache Flink is redefining real-time stream processing and why it's a crucial tool for modern data-driven solutions.
Prerequisites:
- Familiarity with the concept of stream processing and its importance in handling real-time data.
- Knowledge of how stream processing differs from batch processing.
Speaker Info:
Shekhar is passionate about Open Source Softwares and active in various Open Source Projects. He has contributed SymPy, Ruby gems like: daru, daru-view (author), Bundler, NumPy & SciPy. He has successfully completed Google Summer of Code 2016, 17, also worked as Admin for SciRuby & mentored. Shekhar was speaker at RubyConf 2018, PyCon 2017, ApacheCon 2020 on “Running ML algorithms with ML tools available in Apache Ecosystem” & “Cluster Management in Apache Ecosystem & Kubernetes”.
Speaker Links:
https://shekharrajak.github.io/