Getting started
Setting up Arroyo locally
Arroyo ships as a single, self-contained binary for Linux and MacoS or as a docker container. It can be run in two modes: as a persistent multi-job session cluster, or a single-job pipeline cluster. When run locally or in Docker, the default sqlite database is used to store configuration data.
Starting a cluster
Locally
The easiest way to try out Arroyo is to run it locally. Currently Linux and MacOS are well supported.
For MacOS, we provide a Homebrew tap that can be used to install Arroyo:
For MacOS and Linux, you can also install the binary with the following script:
Alternatively, you can download the binary for your OS and architecture from the releases page.
Once you’ve installed Arroyo, you can run it with the arroyo
command:
A local cluster can be started with
Then, open the Web UI in your browser at http://localhost:5115.
With Docker
Arroyo can also run in Docker. Note that by default, a docker cluster will not persist the set of pipelines and tables.
If you get an error like
Then you have another service running on that port. Either stop that service, or rebind to a different port with
-p 5215:5115
for example.
Then, open the Web UI at http://localhost:5115.
Running a single pipeline
In addition to the multi-tenant session cluster mode, Arroyo can also be configured to run a single pipeline
via the CLI as a pipeline cluster via the arroyo run
subcommand:
By default, arroyo run
will read a SQL query from STDIN, or the query can be provided
as an argument.
See the pipeline cluster docs for more details.
Next steps
Now you’re ready to create your first pipeline! Continue on to the tutorial to learn how.