Ecosystem Track  

Apache Beam: Portability in the times of Real Time Streaming

Apache Beam was open sourced by the big data team at Google in 2016, and has become an active community with participants from all over. Beam is a framework to define data processing workflows and run them on various runners (Flink included). In this talk, I will talk about some cool things you can do with Beam + Flink such as running pipelines written in Go and Python; then I’ll mention some cool tools in the Beam ecosystem. Finally, we’ll wrap up with some cool things we expect to be able to do soon - and how you can get involved.

Authors

Pablo Estrada
Pablo Estrada
Google

Pablo Estrada

Pablo is a Software Engineer from Mexico City. He lives in Seattle and works on Apache Beam and Google Cloud Dataflow. He's worked all across the Beam stack, mainly in the Python and Java SDKs. Before working at Google, he earned a Master's degree at Seoul National University, doing computational linguistics research. His favorite activities are traveling and drinking tea with the locals.