@baldur Model tuning is AI hell, because the models are applied in such a broad spectrum of use cases (most of them rather ill-suited for a language model) while vendors are mostly concerned with scoring well in the benchmark-du-jour (and why wouldn't they be, given that their business model means their actual product is the PowerPoint deck they show at the next investment committee or at the IPO).