Application caching & consistency

At SDE3, “we added Redis” is insufficient. You need topology (process-local vs cluster), eviction math, failure amplification (herd, avalanche, penetration), and consistency (invalidation vs dual-write races)—especially on AWS (ElastiCache / Redis OSS) behind Node.js services in Docker.

Core details

The database (or event log) is authoritative; cache is derived unless product and compliance accept staleness.

Patterns (application-level)

Pattern	Read path	Write path	Gotcha
Cache-aside (lazy)	App: cache miss → load DB → `SET` cache	App writes DB; delete or update cache async	Cold start miss storms
Read-through	Cache library loads on miss	Same as your write policy	Cache module must share your key schema
Write-through	Reads from cache after warm	Write DB + cache together	Slower writes; still not atomic across nodes without care
Write-behind (write-back)	Fast reads from cache	ACK after cache; async flush to DB	Durability risk—loss on crash unless durable queue

AWS mapping: ElastiCache for Redis is usually cache-aside or read-through with your app code; DynamoDB DAX is read-through accelerating Dynamo with explicit consistency semantics—same dual-write cautions if you also roll your own invalidation.

Sequence — cache-aside (lazy loading)

Loading diagram…

Sequence — write-through (sketch)

Loading diagram…

Deployment topologies

Tier	Pros	Cons
Embedded / in-process (Node heap, `lru-cache`, etc.)	Sub-ms; no network	No sharing across Docker replicas; cold every deploy
Distributed (Redis cluster, ElastiCache)	Shared state; TTL centralized	Network; failure = thundering herd to DB
CDN / edge	static & public	Purging, personalization traps

Node.js note: each container has its own heap—LRU in-process is a second tier above Redis for hot keys with short TTL and tolerance for inconsistency.

Loading diagram…

Eviction & memory

Policy	Behavior	Where seen
TTL	Key expires after wall clock	Everywhere; combine with jitter
LRU	Evict least recently used	Classic Redis `volatile-lru`; in-proc caches
LFU	Evict least frequently used	Resists one-off scans
W-TinyLFU	Window + TinyLFU (approx frequency)	Caffeine (JVM); high hit rate; concept appears in advanced local caches

Redis / ElastiCache: understand maxmemory-policy; no eviction + full memory → writes fail or OOM—operational playbook required.

Senior pitfalls (“gotchas”)

Cache penetration

Attack / bug: repeated misses for non-existent keys → thunder to DB.

Mitigations: cache negative results with short TTL; Bloom filter (or set membership) in front; rate limit suspicious keys.

Cache breakdown (thundering herd)

Hot key TTL expires → many concurrent misses → DB spike.

Mitigations: single-flight / mutex per key (only one loader); probabilistic early expiration (refresh before TTL); jitter; prefetch on schedule.

Cache avalanche

Many keys share TTL (e.g. same absolute expiry after deploy) or cluster reboot → mass miss.

Mitigations: randomized TTL (TTL ± jitter); layered TTLs; circuit breaker to DB; graceful degradation (stale-while-revalidate semantics where safe).

Dual-write race (DB vs cache order)

Classic bug: write DB then delete cache—but reader refreshes cache before writer completes → serves stale after “update”.

Better patterns: cache-aside on write (invalidate, not blind update); version in cache key; outbox + async invalidator; short TTL for contested keys.

Loading diagram…

Invalidation strategies

Approach	When
TTL only	OK for low-risk data; define max staleness
Write path DELETE key	Common; watch race above
Pub/sub channel `invalidate:{id}`	Good for fan-out across Node processes
Version bump `key:v42`	Cheap consistency check for personalized reads

Understanding

Wrong layer: personalized HTML at CDN without surrogate keys; balances from long TTL without UI disclosure.

Distributed invalidation is eventual—money reads: primary DB or strong read path + tight TTL.

Senior understanding

Tension	Staff move
Stale business decisions	key versioning, shorter TTL for hot keys, feature-flag cache off
Multi-tenant poisoning	namespace keys; never global flush
ElastiCache failover	clients retry; expect short spike → breaker + jitter

Deep sibling: Caching & consistency.

Diagram (write invalidates cache-aside)

Loading diagram…

Application caching & consistency

Core details

Patterns (application-level)

Sequence — cache-aside (lazy loading)

Sequence — write-through (sketch)

Deployment topologies

Eviction & memory

Senior pitfalls (“gotchas”)

Cache penetration

Cache breakdown (thundering herd)

Cache avalanche

Dual-write race (DB vs cache order)

Invalidation strategies

Understanding

Senior understanding

Diagram (write invalidates cache-aside)

See also

On this page