Getting started with Apache Beam
The Apache Beam project is in the process of bootstrapping. This includes the creation of project resources, the refactoring of the initial code submission, and the formulation of project documentation, planning, and design documents. Until the project is fully initialized, this page contains useful resources to learn more about the model and tools which comprise Apache Beam.
Articles & slides
- The world beyond batch: Streaming 101
- The world beyong batch: Streaming 102
- Dataflow/Beam & Spark: A Programming Model Comparison
- Introducing Apache Beam
- Dataflow and open source - proposal to join the Apache Incubator
Current code
The following GitHub repositories contain code which will be incorporated into Apache Beam.
These code repositories will be refactored and managed together (along with other code and new contributions) into a single repository.
Documentation
- Apache Beam incubation proposal
- Apache Beam technical vision
- Apache Beam technical documentation