/


#86
Tinker
Managed post-training API from Thinking Machines Lab (Mira Murati's startup). Targets researchers and developers who want full control over fine-tuning loops, SFT, GRPO/PPO RL, and DPO without managing GPU clusters. Uses LoRA to batch multiple training runs across shared compute, covering models from Qwen3-4B to large MoEs like Kimi-K2.
Topics
Subtopics
DISTILLATIONFINE TUNINGDATASETS
Links
