ART#214 - What are repository cache modes in ATG?

Back

Next

Welcome to the last, but very important topic of ATG Repositories.
Before we start, we must understand the term "cache".

What is cache?
From a very general perspective in computing(not limited to ATG), cache is a place where some frequently accessed data is stored for "quick access".
You might have a RAM in your computer, where all the programs which are in execution are loaded. Well, RAM is a very big place and has a lot of programs loaded. There might be some program statements which are very frequently accessed. Now, our CPU has to do a lot of work to find and execute those statements in RAM (typically tough to find a few statements in a 4GB/8GB RAM). To overcome this, these frequently accessed statements are stored in "Cache", which is of very low storage capacity (and hence faster). Now, if this cache is of 10MB, it is easier to find some statements in this Cache rather than in an 8GB of RAM.
Therefore, CPU chooses to store frequently accessed statements in Cache for quick and easy access.

This concept is also used in storage and ATG is no different.
For any web-application (ATG or non-ATG), all the data is stored in database tables. A data base hit is ALWAYS an expensive operation. Imagine a website where thousands (or millions) of users access a site which fetches data from the database. It can have a very high performance impact.
Here's when Repository Cache comes into play.
There are various ways to configure cache for repositories, which we will learn soon.

What can be cached?
A simple answer to this question would be, we can cache, repository-items and even queries.

Item Caches
Item cache is the cache for Repository-Items. Repository-Items are indexed on the basis of their repository-ids and the cache is invalidated, whenever an item is updated. The invalidation also depends on the cache-mode you configure.

Query Caches
As the name suggests, it caches the repository-ids corresponding to the queries.
Also, the corresponding repository-items are cached in the Item-Cache separately.

If some query is executed on a repository and query-cache is enabled, the repository will check the cached repository-ids corresponding to this cached query. [from query-cache]
Next, the items corresponding to these repository-ids are fetched from the item-cache.
Items not present in the cache, are fetched from the database.

1. Query-caching is disabled by default.
2. Do NOT use query-caching if the repository incurs frequent updates OR repeated queries are not frequent. It might do more harm than good in this case in terms of performance.
3. Query-cache entry can be invalidated in following cases:-

If any property of a cached-item, which is specified in the query is modified.
Items of queried Repository-Item are added/removed from repository.

Below diagram show the concept of item and query-caches.

What are different cache-modes in ATG?
Cache mode is basically the type of caching you configure. Cache modes mainly deal with when any item in cache should be invalidated, so that user sees consistent data.
Each cache mode has its own pros and cons, so let us understand each of them.

1. disabled
No item is stored in cache. Data is directly fetched from database.
You should use this in cases when the items are very frequently updated and you simply cannot afford to show stale data on your website.
For example, in the case of InventoryRepository's "inventory" item-descriptor, you'll always want latest inventory to be shown on your site. Also, if you enable any cache, then because of frequent updates in inventory, there are very good chances that your data will become stale. Also, maintaining the cache for very-frequently changing items can have a high performance impact. Even more than fetching data directly from the database.

WHEN TO USE?: Use when data is required in real time. E.g. Inventory.
ADVANTAGE: You will always see consistent data.
DISADVANTAGE: Performance. Every-time you fetch data, there will be a hit on the database.

2. simple
In simple cache-mode, each server maintains its OWN COPY of cache. Cache invalidated on one server will not invalidate the cache on any other server.
You should use this mode only in case of items which are rarely updated. For example, most of the item-descriptors in ProductCatalog are configured as "simple".
Also, ProductCatalog is a versioned repository, and data can ONLY be changed via BCC deployment (which automatically clears the cache on all servers for the repository-item being deployed), so you wont have to worry on this one.

We also have to define a parameter, which contains the time before an item's cache is refreshed. You can set the attribute "item-cache-timeout" in milliseconds. The item will be present in the cache for "item-cache-timeout" seconds, after which, the cached item will be refreshed (on next access). If this value is set to zero, the item will be cached forever in the cache, unless manually invalidated.

WHEN TO USE?: Use for data which is hardly modified. [Product description, name etc.]
ADVANTAGE: Simple and fast.
DISADVANTAGE: This may lead to user seeing stale data on some servers. However, if used correctly, this could be a good option.

3. distributed
Distributed cache mode is a bit advanced cache-mode, which is better than simple cache mode but comes at a disadvantage.
Distributed cache mode maintains the cache across all the servers of the application by the use of networking.
This cache-mode is also divided into 3 sub-categories:-

3.1 Distributed TCP
1. Whenever an item is changed across any server, an invalidation event is broadcast across all the servers (which use TCP cache).
2. The message carries some data, e.g. repository id, type etc. to other TCP enabled servers.
3. Other servers receive this data and invalidate this item.

WHEN TO USE?: Use for data which is less frequently modified, but very frequently read. Items which are frequently changed should not use this cache-mode.
ADVANTAGE: More consistent than simple cache mode.
DISADVANTAGE: Network overhead in sending messages. If a server is down (on which the invalidation message is sent), there is no means of knowing whether the server received the invalidation event or not.

3.2 Distributed JMS
1. When an item is changed across any server, a JMS message is fired to invalidate the cache across other JMS enabled servers.
2. A JMS message delivery status is also stored in OOTB databse, hence ensuring the message delivery.

WHEN TO USE?: Use for data which is very frequently modified, but a consistent view is always required.
ADVANTAGE: Ensures better consistent view of data than Simple/Distributed TCP modes.
Delivery of invalidation message is ensured.
DISADVANTAGE: Performance is much slower than Distributed TCP.

3.2 Distributed Hybrid
This cache mode is so far the best among Distributed cache modes.
The invalidation event is sent ONLY to servers on which a particular item is cached.

WHEN TO USE?: Real-time data access.
ADVANTAGE: Better performance, with reduced network traffic.
DISADVANTAGE: Slight network overhead.

4. locked-caching
Locked caching is used, when you want an item to be modified by only 1 server at a time. For example, an order-item [commerceItem] can be modified by both; a user facing the ATG site and a customer service representative on another server.
In this case, you'd want only one person to modify that commerceItem at an instance of time. Here's when locked-caching comes into play.

WHEN TO USE?: For items, which can be modified by multiple servers.
ADVANTAGE: Consistent data view.
DISADVANTAGE: This cache-mode is write-based rather than read-based [like simple, distributed etc], therefore, it cannot be compared with other cache-modes.

If you want some in-depth detail on caching, you can refer to the ATG-Docs HERE.

Now that we have covered the last topic of repositories, we will be moving forward to the much awaited commerce articles..!!

Back

Next

14 comments:

Unknown25 June, 2015 23:29
Hi,it is nice explanation and can u explain differances b/w modes.wt is use of inherited cache mode?
I want to apply the cache in item-descrptor level bt that cache mode dont want to apply in property level.how to write.Explain?
Monis Yousuf26 June, 2015 10:37
Hi Lakshman,

1. DIFFERENCE BETWEEN CACHE MODES: You can find the differences between cache-modes from the above article by the points "When to use", "Advantages" and "Disadvantages" i have written for every cache mode.
Typically an interviewer would expect you to describe when to use which cache-mode as a difference.

2. APPLY CACHE AT ITEM-DESCRIPTOR LEVEL BUT NOT PROPERTY-LEVEL: the default cache mode is always "simple" cache mode.
The "cache-mode" attribute can be used in both;< item-descriptor > tag and a < property > tag.
Yo can define your cache mode you want at item-descriptor level and then you can apply
< property name="your property name" cache-mode="simple" />.
This way, your property will be cached by "simple" cache mode, nomatter what your item-descriptor cache mode is.
Unknown02 July, 2015 05:07
can you post the installation process steps of entire application in video format. I am unable to follw the installation steps provided in commerce documentation.
Unknown31 July, 2015 05:12
Hi Monis ,

U have any idea on code building and deploying by using ant tool in atg application.
Anonymous04 August, 2015 10:25
Hello Monis,

Can you please provide tutorial on using ATG REST Module? A brief documentation on how to configure and use OOTB Web service will be a great help
Sudhakar19 August, 2015 01:22
Hi Monis,
Grasping ATG from Oracle documentation is challenging even for experienced engineer and you made it possible. Thank you.

Couple of questions:
1. What is the best caching strategy for OrderRepository?
We have enabled(simple mode) caching with timeout of 15 min and we're seeing tons of ConcurrentUpdateExceptions. My answer is we should disable caching for OrderHistory and if needed use profile locks around OrderUpdates so that updates from multiple apps can be coordinated
2. Can you elaborate on cache-mode inherit at property level?

Can you please share your experience

Anonymous12 February, 2016 15:40
Excellent...
Unknown03 April, 2016 19:22
Hi Monis,

What caching-mode should be used on profile repository?

Oracle ATG Tutorials

ART#214 - What are repository cache modes in ATG?

14 comments:

About Me

Like Us

Popular Posts

Categories

Subscribe

Labels

Flickr