Arroyo currently supports a limited set of primitive types, shown in the table below. Because every Arroyo pipeline is a compiled Rust crate, each type has a corresponding Rust type.

Arroyo leverages Apache DataFusion for SQL support. DataFusion is in turn a part of the Apache Arrow project, a cross-language data format for columnar in-memory computation.

Depending on which part of the system you’re working with one or more of these will be relevant. For basic SQL users you should only need to deal with the SQL types.

These types can also be inserted as literals in sql. For types like INT vs BIGINT it will infer the type based on context.

ArroyoRustPythonArrowSql TypesExample Literals
BooleanboolboolBooleanBOOLEANTRUE, FALSE
Int32i32intInt32INT, INTEGER0, 1, -2
Int64i64intInt64BIGINT0, 1, -2
Uint32u32intUInt32INT UNSIGNED, INTEGER UNSIGNED0, 1, 2
Uint64u64intUInt64BIGINT UNSIGNED0, 1, 2
Float32f32floatFloat32FLOAT, REAL0.0, -2.4, 1E-3
Float64f64floatFloat64DOUBLE0.0, -2.4, 1E-35
StringStringstrUtf8VARCHAR, CHAR, TEXT, STRING"hello", "world"
TimestampSystemTimeTimestampTIMESTAMP'2020-01-01', '2023-05-17T22:16:00.648662+00:00'
BytesVec<u8>BinaryBYTEAX'A123' (hex)

Composite types

In addition to the primitive types above, Arroyo SQL supports two forms of composite types: arrays and structs.

Array types

Arrays group together zero or more elements of the same type. Arrays are declared by suffixing another type with [], for example an INT array can be declared with INT[]. Arrays values can be indexed using 1-indexed subscript notation (v[1] is the first element of v).

Arrays can be constructed via [] literals, like [1, 2, 3].

Arroyo provides a set of array functions for manipulating array values; arrays may also be unnested using the UNNEST operator.

Struct types

Structs allow combining related fields into a single value. Structs are declared using this syntax: struct<field_name field_type, ..>, and may contain any other type, including arrays and other structs.

For example:

CREATE TABLE events (
    properties STRUCT <
      user_id TEXT,
      timings INT[],
      name STRUCT <
        first TEXT,
        last TEXT
      >
    >
)

Structs fields can be accessed via . notation, for example properties.name.first.