Page Hero Background

EXO Labs profile image
#92

EXO Labs

Open-source distributed inference project. Builds "exo," a tool that turns Macs and consumer devices into a local LLM cluster using topology-aware model sharding, MLX as the inference backend, and RDMA over Thunderbolt 5 for a claimed 99% latency reduction between nodes. Supports LLaMA, DeepSeek, and Qwen via OpenAI-compatible APIs.
Subcategories
QUANTIZATIONEDGE DEPLOYMENT
Links