News

Another measure, 'HumanEval,' tests coding capabilities in the Python language, and the performance chart shows Grok-1 pulling ahead of GPT 3.5.