Is this a gotcha? Not sure where you got the "visual" from, but yes it is best human performance vs best LLM performance