Erlang in-memory cache
Go to file
2016-03-23 22:49:08 +02:00
priv new async api, returns reference to operation 2015-05-10 14:02:58 +03:00
src update stats on cache eviction 2016-03-23 22:49:08 +02:00
test add unit test for async i/o 2016-03-23 22:15:39 +02:00
.gitignore use CT for unit-testing (mirgate eunit to _SUITE(s)) 2016-03-23 19:28:04 +02:00
distributed.png release 0.8.0 - re-factored cache bucket structure. The application uses segmented cache approach vs B-tree index for TTL handling 2013-07-27 18:28:26 +03:00
Emakefile new cache design (based on multi-page idea) 2013-04-16 12:15:14 +03:00
LICENSE release 0.8.0 - re-factored cache bucket structure. The application uses segmented cache approach vs B-tree index for TTL handling 2013-07-27 18:28:26 +03:00
local.png release 0.8.0 - re-factored cache bucket structure. The application uses segmented cache approach vs B-tree index for TTL handling 2013-07-27 18:28:26 +03:00
Makefile update Makefile to support rebar3 2016-03-20 22:15:12 +02:00
README.md new async api, returns reference to operation 2015-05-10 14:02:58 +03:00
rebar.config extra function calls to lookup data and meta-data 2013-04-26 23:24:05 +03:00

Segmented in-memory cache

Cache uses N disposable ETS tables instead of single one. The cache applies eviction and quota policies at segment level. The oldest ETS table is destroyed and new one is created when quota or TTL criteria are exceeded.

The write operation always uses youngest segment. The read operation lookup key from youngest to oldest table until it is found same time key is moved to youngest segment to prolong TTL. If none of ETS table contains key then cache-miss occurs.

The downside is inability to assign precise TTL per single cache entry. TTL is always approximated to nearest segment. (e.g. cache with 60 sec TTL and 10 segments has 6 sec accuracy on TTL)

Change log

  • 2.0.0 - various changes on asynchronous api, not compatible with version 1.x
  • 1.0.1 - production release

Usage

   application:start(cache).
   {ok, _} = cache:start_link(my_cache, [{n, 10}, {ttl, 60}]).
   
   %% synchronous i/o
   ok  = cache:put(my_cache, <<"my key">>, <<"my value">>).
   Val = cache:get(my_cache, <<"my key">>).

   %% asynchronous i/o
   Ref = cache:get_(my_cache, <<"my key">>).
   receive {Ref, Val} -> Val end.

configuration via Erlang sys.config

The cache instances are configurable via sys.config. Theses cache instances are supervised by application supervisor.

{cache, [
	{my_cache, [{n, 10}, {ttl, 60}]}
]}

distributed environment

The cache application uses standard Erlang distribution model. Please node that Erlang distribution uses single tcp/ip connection for message passing between nodes. Therefore, frequent read/write of large entries might impact on overall Erlang performance.

The global cache instance is visible to all Erlang nodes in the cluster.

   %% at a@example.com
   {ok, _} = cache:start_link({global, my_cache}, [{n, 10}, {ttl, 60}]).
   Val = cache:get({global, my_cache}, <<"my key">>).
   
   %% at b@example.com
   ok  = cache:put({global, my_cache}, <<"my key">>, <<"my value">>).
   Val = cache:get({global, my_cache}, <<"my key">>).

The local cache instance is accessible for any Erlang nodes in the cluster.

	%% a@example.com
   {ok, _} = cache:start_link(my_cache, [{n, 10}, {ttl, 60}]).
   Val = cache:get(my_cache, <<"my key">>).
   
   %% b@example.com
   ok  = cache:put({my_cache, 'a@example.com'}, <<"my key">>, <<"my value">>).
   Val = cache:get({my_cache, 'a@example.com'}, <<"my key">>).

Performance

MacBook Pro, Intel Core i5, 2.5GHz, 8GB 1600 MHz DDR3, 256 SSD

LRU Cache, 10 segments, 20 sec ttl (~2 sec per segment)

Local cache (application and cache within same VM) Local cache (application and cache within same VM)

Distributed cache (application and cache runs in different VMs) Distributed cache (application  and cache runs in different VMs)

Contributors