Get Evaluation Details

curl --request GET \
  --url https://{api_endpoint}/api/v1/orchestrate/agent/{agent_id}/evaluations/{evaluation_id} \
  --header 'Authorization: Bearer <token>'

{
  "executed_date": "<string>",
  "evaluation_status": "PENDING",
  "number_of_test_cases": 123,
  "executed_by": "<string>",
  "id": "<string>",
  "aggregate_metrices": {
    "tool_quality": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "accuracy": {
        "value": 123,
        "status": "<string>"
      },
      "relevance": {
        "value": 123,
        "status": "<string>"
      }
    },
    "answer_quality": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "relevance": {
        "value": 123,
        "status": "<string>"
      },
      "correctness": {
        "value": 123,
        "status": "<string>"
      },
      "faithfulness": {
        "value": 123,
        "status": "<string>"
      }
    },
    "transaction_completion": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "failed": 123,
      "success": 123,
      "total": 123
    }
  },
  "agent_name": "<string>",
  "agent_description": "<string>",
  "agent_environment": []
}

curl --request GET \
  --url https://{api_endpoint}/api/v1/orchestrate/agent/{agent_id}/evaluations/{evaluation_id} \
  --header 'Authorization: Bearer <token>'

{
  "executed_date": "<string>",
  "evaluation_status": "PENDING",
  "number_of_test_cases": 123,
  "executed_by": "<string>",
  "id": "<string>",
  "aggregate_metrices": {
    "tool_quality": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "accuracy": {
        "value": 123,
        "status": "<string>"
      },
      "relevance": {
        "value": 123,
        "status": "<string>"
      }
    },
    "answer_quality": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "relevance": {
        "value": 123,
        "status": "<string>"
      },
      "correctness": {
        "value": 123,
        "status": "<string>"
      },
      "faithfulness": {
        "value": 123,
        "status": "<string>"
      }
    },
    "transaction_completion": {
      "average": {
        "value": 123,
        "status": "<string>"
      },
      "failed": 123,
      "success": 123,
      "total": 123
    }
  },
  "agent_name": "<string>",
  "agent_description": "<string>",
  "agent_environment": []
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

agent_id

string<uuid>

required

evaluation_id

string<uuid>

required

Response

Successful Response

executed_date

string

required

Date evaluated

evaluation_status

enum<string> | null

required

Evaluation status

Available options:

PENDING,

IN_PROGRESS,

COMPLETED,

CANCELLED,

FAILED

number_of_test_cases

integer

required

Number of test cases

executed_by

string

required

Run by user

string | null

Evaluation ID

aggregate_metrices

EvaluationMetrics · object

Aggregated metrics

Show child attributes

agent_name

string | null

Name of the agent

agent_description

string | null

Description of the agent

agent_environment

EnvironmentSchema · object[] | null

List of environments associated with the agent

Show child attributes

Get Current Running Evaluation For A Given Agent

Get Test Cases Of An Evaluation For A Given Agent

Release Notes

Get Started

Build

Analyze

watsonx Orchestrate Developer Edition

watsonx Orchestrate ADK MCP Server

Reference

Legal notices

Get Evaluation Details

Authorizations

Path Parameters

Response