This talk presents Apache Flink’s approach to address this challenge. We discuss Flink’s active memory management, its custom data serialization framework, and its techniques to efficiently operate on binary data. The talk concludes with benchmark results that compare the straight-forward object-on-heap and Flink’s approach.
About the speaker
Fabian Hueske is a PMC member of Apache Flink. He started working on this project as part of his PhD studies at TU Berlin in 2009. Fabian did internships with IBM Research, SAP Research, and Microsoft Research and is a co-founder of data Artisans, a Berlin-based start-up devoted to foster Apache Flink. He is frequently giving talks on Apache Flink at conferences and meetups. Fabian is interested in distributed data processing and query optimization.