News
Recently, numerous benchmarks have been developed to evaluate the logical reasoning abilities of large language models (LLMs). However, assessing the equally important creative capabilities of LLMs is ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results