/


#92
EXO Labs
Open-source distributed inference project. Builds "exo," a tool that turns Macs and consumer devices into a local LLM cluster using topology-aware model sharding, MLX as the inference backend, and RDMA over Thunderbolt 5 for a claimed 99% latency reduction between nodes. Supports LLaMA, DeepSeek, and Qwen via OpenAI-compatible APIs.
Categories
Subcategories
QUANTIZATIONEDGE DEPLOYMENT
Links
