News
As per the official statement, both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench ...
OpenAI's much-hyped GPT-5 AI model is now available as part of Microsoft Copilot. Here's what you need to know.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results