Alluxio
ProductsLanguageHome
AI-3.5 (stable)
AI-3.5 (stable)
  • Overview
  • Getting Started with K8s
    • Resource Prerequisites and Compatibility
    • Install on Kubernetes
    • Monitoring and Metrics
    • Cluster Administration
    • System Health Check & Quick Recovery
    • Collecting Cluster Information
  • Architecture
    • Alluxio Namespace and Under File System Namespaces
    • I/O Resiliency
    • Worker Management and Consistent Hashing
  • Storage Integrations
    • Amazon AWS S3
    • HDFS
    • Aliyun OSS
    • Tencent COS
    • Volcengine TOS
    • Google Cloud GCS
  • Client APIs
    • Alluxio Python Filesystem API based on FSSpec
    • FUSE based POSIX API
      • Client Writeback
    • S3 API
  • Caching Operations
    • Cache Preloading
    • Cache Filter Policy
    • Cache Eviction
      • TTL Policy
      • Priority Policy
      • Free CLI Command
  • Resource Management
    • Directory-Based Cluster Quota
    • UFS Bandwidth Limiting
  • Performance Optimizations
    • Read Throughput Via Replicas
    • Reading Large Files
    • Metadata Listing
    • Data Prefetch
    • Writing Temporary Files
  • Security
    • TLS Support
  • Performance Benchmarks
    • Fio (POSIX) Benchmark
    • COSBench (S3) Benchmark
    • MLPerf Storage Benchmark
  • Reference
    • User CLI
    • Metrics
    • S3 API Usage
    • Third Party Licenses
  • Release Notes
Powered by GitBook
On this page
  • Alluxio Enterprise AI Overview
  • Epic Performance
  • Seamless Data Access
  • High Scalability
  • Cost Efficiency
  • Integration with AI Frameworks
  • Deploying with Kubernetes Operator

Overview

Last updated 7 months ago

Welcome to Alluxio Documentation! You will find resources regarding deploying Alluxio, integrations with various tech stacks, API references, and more! If you have any questions, join our Alluxio Community Slack →

Alluxio Enterprise AI Overview

Alluxio Enterprise AI is a high-performance data platform designed to significantly enhance machine learning training and data access through advanced software architecture and intelligent caching capabilities. It bridges the gap between compute and storage, offering a high performance and cost efficient solution for seamless data access with billions objects scalability. Our platform redefines the way AI training and inference accesses data, providing users with a streamlined and efficient method to leverage data wherever it resides.

Epic Performance

Model Training: Achieve up to 20x I/O performance on top of your data lake by leveraging enhanced distributed caching tailored to AI workloads. Alluxio improves IO performance throughout various stages of the training workflow, from reading dataset to writing checkpoints, eliminating GPU idle time.

Model Serving: Up to 10x acceleration for serving models from offline training clusters to offline and online inference nodes compared to serving models directly from object stores. The architecture can be easily scaled to serve thousands of inference nodes without worrying about untimely model updates.

Seamless Data Access

Quickly deploy Alluxio alongside your GPU cluster with Kubernetes and connect it to your storage clusters. Immediately start training jobs with increased performance without needing to explicitly migrate data. Minimize the time to production for machine learning platforms across different cloud and on premise clusters.

High Scalability

Our distributed system architecture handles up to 100 billion objects using commodity hardware on the cloud without sacrificing latency.

Cost Efficiency

Rather than utilizing specialized storage hardware to increase performance, our solution eliminates the need to purchase additional hardware. Alluxio integrates seamlessly with existing data lakes and storage solutions and is deployed on the same hardware as the GPU cluster to self-serve the data with high I/O throughput.

Integration with AI Frameworks

Alluxio Enterprise AI supports various APIs, including POSIX (FUSE-based), S3, and FSSpec, to run workloads via PyTorch, TensorFlow, Apache Ray, or Spark.

Alluxio Enterprise AI is a comprehensive solution designed to meet the demands of modern AI and ML workloads. It delivers exceptional performance, seamless data access, and scalability, making it an essential tool for enterprises aiming to scale their AI operations efficiently.

Deploying with Kubernetes Operator

See on how to install Alluxio on Kubernetes via , a Kubernetes package manager, and , a Kubernetes extension for managing applications.

Install Alluxio on Kubernetes
Helm
Operator
alluxio.io/slack