Go Distributed Cache

Go Distributed Cache is a highly scalable, fault-tolerant distributed caching system written in Go. It supports sharded caches, gRPC-based communication, and strong consistency using quorum-based replication. The system features dynamic membership with nodes joining and leaving the cluster without disruption, making it horizontally scalable.

Disclaimer: This project is not intended to be used in production.

Overview

This project provides a distributed in-memory cache system designed to handle high availability, strong consistency, and scalability.

By leveraging sharded caches and a consistent hash ring, the system ensures that data is distributed efficiently across all nodes in the cluster.

The project uses gRPC for inter-node communication and membership protocols to manage dynamic cluster membership.

Data replication across nodes is done using a quorum-based approach to ensure strong consistency during read and write operations.

How It Works

Sharded Cache: The cache is divided into multiple shards to reduce contention and improve performance.
gRPC Communication: Nodes communicate with each other using gRPC for efficiency, providing fast and reliable inter-node communication.
Quorum-Based Replication: Each key-value pair is replicated to a majority (quorum) of nodes. This ensures strong consistency even in the event of node failures.
Dynamic Membership: Nodes can join and leave the cluster dynamically, and the system adjusts the distribution of keys accordingly using consistent hashing.
Graceful Shutdown: The system ensures that nodes gracefully leave the cluster, completing in-progress operations before exiting.
Structured Logging: For fast structured logging, zerolog is used.

Environment Variables

The following environment variables can be configured to customize the system:

ADDR: The address (host ) on which the node will listen for gRPC requests (default: localhost:8080).
PEERS: Comma-separated list of peer node addresses to join the cluster.
NUM_SHARDS: Number of cache shards (default: 1).
CAPACITY: Total cache capacity across all shards (default: 1000).
TTL: Time-to-live for cache entries, in seconds (default: 3600).
MAX_RECV_MSG_SIZE: Maximum size (in bytes) for incoming gRPC messages (default: 4194304).
MAX_SEND_MSG_SIZE: Maximum size (in bytes) for outgoing gRPC messages (default: 4194304).
RPC_TIMEOUT: Timeout duration (in seconds) for inter-node gRPC calls (default: 5).
RATE_LIMIT: Maximum number of incoming requests per second (default: 10).
RATE_LIMIT_BURST: Maximum burst size for rate-limited requests (default: 100).

Installation

Binary

Clone the repository

git clone https://github.com/your-username/go-distributed-cache.git
cd go-distributed-cache

Build the project

go build -o distributed-cache .

Run the binary

./distributed-cache

Docker

Build the Docker image

docker build -t distributed-cache .

Run the container

docker run -d -p 8080:8080 -e ADDR=localhost:8080 distributed-cache

Basic Examples

Starting a Single Node

You can start a single node by running the following command

./distributed-cache

Starting Multiple Nodes

To run multiple nodes, each node should be started with its own address and a list of peers

./distributed-cache -e ADDR=node1:8080 -e PEERS=node2:8080,node3:8080
./distributed-cache -e ADDR=node2:8080 -e PEERS=node1:8080,node3:8080
./distributed-cache -e ADDR=node3:8080 -e PEERS=node1:8080,node2:8080

The system will automatically adjust and distribute cache entries across the nodes using consistent hashing.

Set and Get Example

To set a value:

grpcurl -plaintext -d '{"key":"foo", "value":"bar"}' localhost:8080 pb.CacheService/Set

To get the value:

grpcurl -plaintext -d '{"key":"foo"}' localhost:8080 pb.CacheService/Get

Missing Features / Trade-Offs

Anti-Entropy Mechanism: Re-distribution and replication is currently not handled when the hash ring changes (e.g. a node has left). This could be done using a merkle-tree in order to detect differences between nodes quicky and efficiently.
Last-Write-Wins: Currently the last-write-wins strategy is used for conflict resolution. This is done via a monotonically increasing ID (version). While this approach is simple and easy to understand, it is vulnerable to data-loss.

License

This project ist licensed under the Apache License, Version 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
cmd		cmd
internal		internal
proto/v1		proto/v1
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Go Distributed Cache

Overview

How It Works

Environment Variables

Installation

Binary

Docker

Basic Examples

Starting a Single Node

Starting Multiple Nodes

Set and Get Example

Missing Features / Trade-Offs

License

About

Releases

Packages

Languages

License

marvinlanhenke/go-distributed-cache

Folders and files

Latest commit

History

Repository files navigation

Go Distributed Cache

Overview

How It Works

Environment Variables

Installation

Binary

Docker

Basic Examples

Starting a Single Node

Starting Multiple Nodes

Set and Get Example

Missing Features / Trade-Offs

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages