Skip to main content

evaluation_runs

Creates, updates, deletes, gets or lists an evaluation_runs resource.

Overview

Nameevaluation_runs
TypeResource
Iddigitalocean.genai.evaluation_runs

Fields

The following fields are returned by SELECT queries:

A successful response.

NameDatatypeDescription
created_by_user_idstring (uint64) (example: 12345)
agent_namestringAgent name (example: example name)
run_namestringRun name. (example: example name)
test_case_namestringTest case name. (example: example name)
agent_deletedbooleanWhether agent is deleted
agent_uuidstringAgent UUID. (example: 123e4567-e89b-12d3-a456-426614174000)
agent_version_hashstringVersion hash (example: example string)
agent_workspace_uuidstringAgent workspace uuid (example: 123e4567-e89b-12d3-a456-426614174000)
created_by_user_emailstring (example: example@example.com)
error_descriptionstringThe error description (example: example string)
evaluation_run_uuidstringEvaluation run UUID. (example: 123e4567-e89b-12d3-a456-426614174000)
evaluation_test_case_workspace_uuidstringEvaluation test case workspace uuid (example: 123e4567-e89b-12d3-a456-426614174000)
finished_atstring (date-time)Run end time. (example: 2023-01-01T00:00:00Z)
pass_statusbooleanThe pass status of the evaluation run based on the star metric.
queued_atstring (date-time)Run queued time. (example: 2023-01-01T00:00:00Z)
run_level_metric_resultsarray
star_metric_resultobject
started_atstring (date-time)Run start time. (example: 2023-01-01T00:00:00Z)
statusstringEvaluation Run Statuses (default: EVALUATION_RUN_STATUS_UNSPECIFIED, example: EVALUATION_RUN_STATUS_UNSPECIFIED)
test_case_descriptionstringTest case description. (example: example string)
test_case_uuidstringTest-case UUID. (example: 123e4567-e89b-12d3-a456-426614174000)
test_case_versioninteger (int64)Test-case-version.

Methods

The following methods are available for this resource:

NameAccessible byRequired ParamsOptional ParamsDescription
genai_get_evaluation_runselectevaluation_run_uuidTo retrive information about an existing evaluation run, send a GET request to /v2/gen-ai/evaluation_runs/{evaluation_run_uuid}.
genai_run_evaluation_test_caseinsertTo run an evaluation test case, send a POST request to /v2/gen-ai/evaluation_runs.

Parameters

Parameters can be passed in the WHERE clause of a query. Check the Methods section to see which parameters are required or optional for each operation.

NameDatatypeDescription
evaluation_run_uuidstringEvaluation run UUID. (example: "123e4567-e89b-12d3-a456-426614174000")

SELECT examples

To retrive information about an existing evaluation run, send a GET request to /v2/gen-ai/evaluation_runs/{evaluation_run_uuid}.

SELECT
created_by_user_id,
agent_name,
run_name,
test_case_name,
agent_deleted,
agent_uuid,
agent_version_hash,
agent_workspace_uuid,
created_by_user_email,
error_description,
evaluation_run_uuid,
evaluation_test_case_workspace_uuid,
finished_at,
pass_status,
queued_at,
run_level_metric_results,
star_metric_result,
started_at,
status,
test_case_description,
test_case_uuid,
test_case_version
FROM digitalocean.genai.evaluation_runs
WHERE evaluation_run_uuid = '{{ evaluation_run_uuid }}' -- required;

INSERT examples

To run an evaluation test case, send a POST request to /v2/gen-ai/evaluation_runs.

INSERT INTO digitalocean.genai.evaluation_runs (
data__agent_uuids,
data__run_name,
data__test_case_uuid
)
SELECT
'{{ agent_uuids }}',
'{{ run_name }}',
'{{ test_case_uuid }}'
RETURNING
evaluation_run_uuids
;