[Feature] Adding Large Reasoning Models Results

Hi AgentBench Team,

Thanks for your awesome effort in constructing this benchmark. 
I would like to ask have you or plan to add the experimental results of large reasoning models like deepseek-r1, o3-mini, etc on AgentBench?

Best,
Mengkang