#86

Tinker

Managed post-training API from Thinking Machines Lab (Mira Murati's startup). Targets researchers and developers who want full control over fine-tuning loops, SFT, GRPO/PPO RL, and DPO without managing GPU clusters. Uses LoRA to batch multiple training runs across shared compute, covering models from Qwen3-4B to large MoEs like Kimi-K2.

Topics

POST_TRAINING

DATA

Subtopics

DISTILLATIONFINE TUNINGDATASETS

Links

LAST 30 DAYS

Tinker Bridgewater's Fine-Tuned Model Beats GPT, Claude, and Gemini at 13.8x Lower Cost

Bridgewater's Fine-Tuned Model Beats GPT, Claude, and Gemini at 13.8x Lower Cost

Tinker

post_training

14 hrs ago

14H AGO