Predictable compute for an energy-constrained world.
EMBER is a hierarchical control plane that governs high-variance Python inference tasks across a distributed regional fabric. It replaces ad-hoc offloading with deterministic routing โ balancing local thermal budgets against regional energy costs so the system behaves predictably even when demand doesn't.
Performance is a side effect. Predictability is the goal.
Baseline validation proving each architectural pillar under test.
| Ticket | Status |
|---|---|
| Skeleton gRPC & Baseline Network Latency 1,000-ping test, p99 = 1.96ms within 20ms budget | PASS |
| Admission Controller & Hard Concurrency Limit Concurrency wall, 100ms dead letter TTL, 90%/75% hysteresis | PASS ยท 7/7 |
| Circuit Breaker & Failure Injection Abort-early at 22ms, 3-failure trip, half-open recovery | PASS ยท 8/8 |
| Local Execution Substrate 43-worker boot, 400MB shared memory, 450ms task leasing | PASS ยท 6/6 |
Edge + Regional topology. Chicago is an optimization layer, not a dependency.
Client โ โผ Edge Controller (Columbus) โ โโโ Admission Gate 90%/75% hysteresis, 22-slot queue โ โโโ Routing Client โโโโโโโ RouteRequest โโโ Chicago Regional โ 20ms timeout / circuit breaker โ โโโ Bounded Queue Dead letter TTL: 100ms โ โโโ Worker Scheduler Task leasing: 450ms reclaim โ โโโ Worker Pool (43) Shared model weights via mmap
The steel thread is complete. Every architectural pillar is validated. Upcoming work focuses on the pieces the design report flagged as still missing.