News
As per the official statement, both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench ...
Deferred module evaluation imports a module without immediately executing the module and its dependencies, avoiding ...
AI-native working allows Genspark to work at “gen speed” and release new products and features in nearly every week.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results