Quickstart
Installation
Xorq can be installed using pip:
Or using nix to drop into an IPython shell:
Quick Start
Write a Simple Pandas UDF
Save this file as example.py
.
Create a Sentiment Analysis Pipeline
You can also build more complex pipelines that process data and expose endpoints via Arrow Flight:
Save this file as sentiment_pipeline.py
.
Serve Your Pipeline
CLI Commands
Build (experimental)
Xorq makes it easy to serialize the pipeline in a diffable and human-readable format, including YAML for expressions, compiled SQL, and deferred reads. Once these artifacts are checked into git, we can build validation, lineage, and documentation tools in the CI/CD process.
You can also build more complex pipelines:
The build artifacts are serialized to disk in the builds
directory by default:
Run
Execute the serialized expressions by using xorq run
:
Serve (coming soon)
Deploy your pipeline as a service:
Advanced Features
Using Arrow Flight for Microservices
Create data microservices with Arrow Flight:
Expose your data pipeline as a service using Arrow Flight:
And now, we can connect and test our service: