Security Global Forum

Our mission is to provide clients with an online user community of industry peers and IBM experts, to exchange tips and tricks, best practices, and product knowledge. We hope the information you find here helps you maximize the value of your IBM Security solutions.

View Only

Back to Blog List

Cache Me If You Can

By Rishab Santosh posted Tue May 20, 2025 10:33 AM

Caching can be very helpful in applications dealing with a huge amount of data requiring quick access.

The application I am working on is a policy execution engine that applies the received policies to the incoming data. I used the Open Policy Agent WASM solution for policy management and execution.

OPA can compile Rego files, a human-readable policy format, into WASM modules, which are used to evaluate the incoming data.

While working on it, I faced a couple of issues along the way.

1. How do I handle all the policy files?

2. Creating all the WASM modules for every incoming message would make evaluation performance numbers take a nosedive.

I thought to myself, wouldn’t it be better to have all these policies handy? With that, I can finish evaluating events as quickly as possible.

That’s when I had my EUREKA moment! I realised that having a cache to store all these policies would do me a great deal of good.

The next question is what to cache. Should I right away cache the Rego file as a single blob of string? When I receive an event, I can compile it into an OPA Policy Module and evaluate it. The second issue pops up here: creating all the necessary policy modules for each data event would be cumbersome.

What to cache will also depend on the cache we use and our use case.

Let's look at some of the potential caching technologies for the application.

GlobalKTable as Cache

GlobalKTable is known for its global replication and distributed store. Due to its global replication, all the application instances will have the complete topic data. It will have all the real-time data.

The GlobalKTable is stored in the local store; it can be accessed without much latency.

Using GlobalKTable, we can store policy data and use it for the incoming events after transforming the data into a policy module.

An application with far too many policies can overload the memory. Upon application restart, building the table with all the policies might also take time. We need not execute every policy against each event, so fetching a particular policy module from GlobalKTable isn’t straightforward.

KCache

KCache is an in-memory cache that provides persistence by backing up the data in a Kafka topic. KCache uses a log-compacted Kafka topic to persist the data. The data from the topic is read and stored in an in-memory Kafka cache, which implements SortedMap from Java Utils. The log-compacted topic helps to store at least the last known value for each message key.

KCache is completely handled by Kafka, so we need not worry about the complexities of persisting the data. The data is available remotely. In case of an app restart or an in-memory data loss, the data can still be recovered from Kafka.

We can use KCache to store the policy data on KCache, but we can’t store a policy module directly, as there isn’t any built-in OPAModule Serde.

Redis

Redis, Remote Dictionary Server, is an open-source, in-memory database that stores data as key-value pairs. Redis offers high-speed read-write with a lower response time.

Redis also provides a couple of options for persisting data.

RDB (Redis Database): Redis captures a snapshot of the dataset at specified time intervals, thereby persisting the data on the disk. RDB is not a great option if your application is very strict about data loss.

AOF (Append Only File): Redis logs every write operation the application performs in AOF. All the logged operations are replayed on application restart to restore the dataset's original state.

RDB + AOF: You can also combine the two, giving you a faster restart.

For the policy engine application, Redis can store the policy data instead of storing it in the file system. But again, Redis doesn’t provide an option to store the policy modules directly.

Caffeine

Caffeine Cache is a high-performance, in-memory caching library for Java. It is based on Google’s Guava cache but has various improvements. Caffeine uses the Window TinyLFU eviction policy, which uses frequency to determine the past usage of an entry.

TinyLFU decides based on the access history whether to add the accessed item to the cache by eliminating the eviction candidate.

Caffeine is purely in-memory and does not provide any persistence.

We can use Caffeine to store our policy modules in the cache. Although we cannot store all the policy modules in the cache, we can let Caffeine's near-optimal eviction policy decide what to cache. Policy modules can be fetched immediately upon a cache hit as they are stored directly in the cache.

Caffeine was best suited for the application after considering the number of incoming events and the policies.

Decisions, Decisions. These options make you feel like you're spoilt for choice, but remember that choosing the cache depends on your use case, what you will be caching, performance requirements, and scalability.

0 comments

26 views

Permalink

https://community.ibm.com/community/user/blogs/rishab-santosh/2025/05/20/comparison-of-different-caching-technologies

Security Global Forum

Security Global Forum

Cache Me If You Can

By Rishab Santosh posted Tue May 20, 2025 10:33 AM

GlobalKTable as Cache

KCache

Redis

Caffeine

Permalink

Additional
Resources

Office

Quick Links

Security Global Forum

Security Global Forum

Cache Me If You Can

By Rishab Santosh posted Tue May 20, 2025 10:33 AM

GlobalKTable as Cache

KCache

Redis

Caffeine

Permalink

Additional Resources

Office

Quick Links

Additional
Resources