"Evaluating genuine reasoning in large language models via esoteric programming languages."

https://arxiv.org/abs/2603.09678