System Health Check & Quick Recovery
Health Check
Metrics and Dashboard
Liveliness
UFS Data Flow
Cache Hit Rate
sum by (instance) (irate(alluxio_cached_data_read_bytes_total{job="worker"}[5m])) / (sum by (instance) (irate(alluxio_cached_data_read_bytes_total{job="worker"}[5m])) + sum by (instance) (irate(alluxio_ufs_data_access_bytes_total{job="worker"}[5m])))sum(irate(alluxio_cached_data_read_bytes_total{job="worker"}[5m])) / (sum(irate(alluxio_cached_data_read_bytes_total{job="worker"}[5m])) + sum(irate(alluxio_ufs_data_access_bytes_total{job="worker"}[5m])))
General Status
Alluxio Process Readiness
ETCD Readiness
Logs
Alluxio Processes
Kubernetes CSI Driver
Quick Recovery
ETCD Failure
Worker Failure
FUSE Failure
Coordinator Failure
Last updated