News

Benchmarks drive many areas of research forward, and this is indeed the case for two areas of research that I engage with: ...
For the first time, large language models performed on a par with gold medallists in the International Mathematical Olympiad.