ParquetSnapshotStorage

ParquetSnapshotStorage()

Storage that caches expressions as Parquet files using a snapshot invalidation strategy.

This storage class saves intermediate results as Parquet files in a specified directory and uses a snapshot-based approach for cache invalidation. The snapshot strategy ensures cached data is only invalidated when the expression’s definition changes, making it suitable for stable datasets.

Parameters

Name Type Description Default
source ibis.backends.BaseBackend The backend to use for execution. Defaults to xorq’s default backend. required
path Path The directory where Parquet files will be stored. Defaults to xorq.options.cache.default_path. required

Methods

Name Description
exists Check if the expression has been cached.

exists

exists(expr)

Check if the expression has been cached.

Parameters

Name Type Description Default
expr ir.Expr The expression to check required

Returns

Name Type Description
bool True if the expression is cached, False otherwise