The Last Job of Humankind is Alignment
More than a year ago, our CEO asked me a straightforward question: “Do we have a plan for fine-tuning our models when we grow bigger?”
When we started, we hadn’t planned for it for obvious reasons - we were small, and the tech was nascent. I told her that people were already moving away from the term “fine-tuning.” They were calling it “alignment.” That conversation happened way before DeepSeek-R1. Of course, in the AI era, “forever” is a concept measured in months. And then came the RL (Reinforcement Learning) rampage, and now everyone is talking about alignment.