Skip to content

[Feature] Adding Large Reasoning Models Results #186

@Aaron617

Description

@Aaron617

Hi AgentBench Team,

Thanks for your awesome effort in constructing this benchmark.
I would like to ask have you or plan to add the experimental results of large reasoning models like deepseek-r1, o3-mini, etc on AgentBench?

Best,
Mengkang

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions