Cache Management
Alluxio sits between your compute and persistent storage as a caching layer. Understanding how to manage that cache — what goes in, what stays, and what gets removed — is the key to consistent, high-performance workloads.
What do you want to do?
How the Cache Lifecycle Works
1. Data Enters the Cache
Two paths:
Passive (read-through): On a cache miss, the worker fetches from UFS and caches automatically. Zero configuration — but the first read is slow.
Active preloading: Use
job loadto push data into cache before any read happens. Eliminates cold-start latency for scheduled jobs and model serving.
2. Policies Control What Stays and How Long
Once data is cached, four mechanisms govern its behavior:
Cache Priority (Pinning) — marks data as
HIGHpriority so LRU never evicts it before lower-priority dataTTL — sets a hard expiry: data is evicted when its lifetime expires, regardless of access
Cache Filter — controls admission: immutable, skip cache, or max-age revalidation
Quota — caps how much cache space a directory tree can consume
3. Data Leaves the Cache
Three ways data exits:
Automatic eviction: LRU (default), LFU, or FIFO removes data when workers fill up
TTL expiry: background scan removes data whose lifetime has elapsed
Manual removal:
job freeexplicitly purges a path — use this when invalidating a model version or freeing space on demand
Strategy Guide
LLM / ML Model Serving
Examples: fine-tuned models, base model weights, embedding indexes.
Goal: Zero cold-start, critical models never evicted.
Preload before serving:
job load --path s3://models/llama3/ --submit --verifyPin against LRU:
alluxio priority add --path s3://models/llama3/ --priority highOn model update:
job free --path s3://models/llama3-v1/ --submit→ reload v2
Periodically Updated Data
Examples: daily ETL output, retraining datasets.
Goal: Fresh data with good hit rate.
Use
maxAgefilter (e.g.1d) so Alluxio revalidates on next access after expiryRun
job load --skip-if-existsafter upstream update to pre-warm the new version
Temporary or Checkpoint Data
Examples: training checkpoints, temp query results.
Goal: Avoid filling cache with short-lived data.
skipCachefilter to bypass caching entirely, orShort TTL (e.g.
1h) +LOWpriority so eviction hits this first
Compliance / Sensitive Data
Examples: PII logs, GDPR-scoped directories.
Goal: Hard limit on data lifetime in cache.
TTL with a compliance-aligned window (e.g.
90d)job freeimmediately after processing completes
Last updated