Arroyo Stream Processing Engine
Arroyo is a distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data. Unlike traditional batch processors, streaming engines can operate on both bounded and unbounded sources, emitting results as soon as they are available.
In short: Arroyo lets you ask complex questions of high-volume real-time data with sub-second results.
Arroyo can be self-hosted, or used via the Arroyo Cloud service managed by Arroyo Systems.
Some example use cases include:
There are already a number of existing streaming engines out there, including Apache Flink, Spark Streaming, and Kafka Streams. Why create a new one?
Arroyo ships as a single binary, which can be easily installed locally or run in a container.
See the getting started guide to get up and running with a local Arroyo deployment.
Arroyo supports several deployment targets for production use, including native support for Kubernetes. See the deployment docs for more information.
Arroyo is fully open-source under the Apache 2.0 license.
Commercial support is offered by Arroyo Systems, the creators of Arroyo. Reach out to support@arroyo.systems to get in touch.
Community support is offered via the Arroyo Discord where the Arroyo development team and community are actively engaged in helping users get started and solve their probelms with Arroyo.
By default, Arroyo collects limited and anonymous usage data to help us understand how the system is being used and to help prioritize future development.
You can opt out of telemetry by setting DISABLE_TELEMETRY=true
when running
Arroyo services.