Skip to content

Agentic datasets support #3

@markpeterfejes

Description

@markpeterfejes

Feature Request: Agentic datasets

Description

Add support for evaluating agent-based interactions with specialized dataset features.

Features

  • Track tools used during agent execution
  • Compare against expected tool usage
  • Evaluate intermediate agent responses
  • Support for agent reasoning traces

Use Cases

  • Evaluating AI agents and their tool usage
  • Testing agent decision-making processes
  • Validating agent workflows

Package

@orq-ai/evaluatorq

This feature is part of the evaluatorq roadmap.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions