-
Notifications
You must be signed in to change notification settings - Fork 189
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
License
THUDM/AgentBench
About
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published