UFS Fallback
If a client tries to read data from a worker and that worker is unavailable, the client can automatically fallback to reading the data directly from the Under File System. This ensures the application's read request succeeds without interruption, even if the Alluxio cluster becomes unresponsive.
When an Alluxio client fails to read data from its preferred worker, it will automatically attempt to recover by trying different sources in the following order:
Retry with Local Replicas: If data replication is enabled, the client will first try to read from other workers in the same cluster that have a replica of the requested data.
Failover to Remote Clusters (Multi-AZ): If all local replicas are unavailable and multi-AZ support is enabled, the client will then try to read from workers in other federated clusters, typically located in different Availability Zones.
Fallback to UFS: If all Alluxio workers (both local and remote) are unavailable, the client will fall back to reading the data directly from the Under File System (UFS) as a last resort.
This layered approach maximizes the chance of a successful read while prioritizing the fastest available data source.
The UFS fallback feature is enabled by default. In some cases, such as to prevent a "thundering herd" of requests to your UFS, you may want to disable it.
alluxio.dora.client.ufs.fallback.enabled
If true
, the client will fall back to reading from the UFS if it cannot read from an Alluxio worker.
true
Last updated