Alluxio
ProductsLanguageHome
Edge-1.1 (stable)
Edge-1.1 (stable)
  • Overview
  • Get Started
    • Deploy Alluxio Edge for Trino
    • Deploy Alluxio Edge for Trino with Kubernetes
  • Administration
    • Monitoring Guide
    • Basic Configuration Settings
    • Configuring Cache Filter
    • Configuring Multiple Mounts
  • Reference
    • Compatibility
    • List of Features
    • List of Metrics
    • Release Notes
    • Third Party Licenses
Powered by GitBook
On this page
  • What is Cache Filter
  • Filter Types
  • Configuring Cache Filter
  • Enable Cache Filter
  • Set Policy
  1. Administration

Configuring Cache Filter

Last updated 7 months ago

What is Cache Filter

Sometimes we may not want every single data to go into the cache, and customized admission is necessary. For example

  • There is limited space for the cache storage, and there is known "hotness" of the data (or tables).

  • Certain data are critical and needs to be fresh always, so we don't want to cache them. ie. Data lake manifest files.

Cache Filter is such an advanced feature that provides a way for the Admin to define customized logic to control what data can be admitted into the cache.

Filter Types

Currently, the supported filter types are

  • CACHE ALL: Admit all data into the cache.

  • ALLOW LIST: Admit only data that matches the regex in the list into the cache.

  • BLOCK LIST: Do not admit data that matches the regex in the list into the cache.

Configuring Cache Filter

Enable Cache Filter

Set the following configuration in alluxio-site.properties. Please refer to for instruction on how to update alluxio-site.properties.

alluxio.user.client.cache.filter.enabled=true
alluxio.user.client.cache.filter.class=alluxio.client.file.cache.filter.PatternBasedCacheFilter

Set Policy

Below are some sample policies for each filter type.

Example - Cache All

{
  "filterType": "CACHE_ALL"
}

Example - Allow List

{
  "filterType": "ALLOW_LIST",
  "regxPatternStrList": ["s3://some-test-bucket/.*test.*", "s3://some-test-bucket/.*NOTICE.*"]
}

Example - Block List

{
  "filterType": "BLOCK_LIST",
  "regxPatternStrList": ["s3://some-test-bucket/.*test.*", "s3://some-test-bucket/.*NOTICE.*"]
}

The policies are set in cache_filter.properties. Please refer to for instruction on how to use this file.

installation guide
installation guide