Skip to content

THUDM/AgentBench

About

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published