Skip to content

AgentCore Evaluations

AWS-managed evaluations provider backed by Bedrock AgentCore for LLM-as-a-judge evaluation.

Configuration

Server-Side (Gateway Config)

providers:
  evaluations:
    backend: "agentic_primitives_gateway.primitives.evaluations.agentcore.AgentCoreEvaluationsProvider"
    config:
      region: "us-east-1"
Parameter Default Description
region us-east-1 AWS region for AgentCore

Using the Evaluations API

Create an Evaluator

curl -X POST http://localhost:8000/api/v1/evaluations/evaluators \
  -H "Content-Type: application/json" \
  -d '{
    "name": "helpfulness",
    "description": "Evaluates response helpfulness",
    "scoring_rubric": "1-5 scale: 1=unhelpful, 5=very helpful"
  }'

Run an Evaluation

curl -X POST http://localhost:8000/api/v1/evaluations/evaluate \
  -H "Content-Type: application/json" \
  -d '{
    "evaluator_id": "eval-123",
    "input": "What is the capital of France?",
    "output": "The capital of France is Paris.",
    "expected": "Paris"
  }'

List Evaluators

curl http://localhost:8000/api/v1/evaluations/evaluators

Prerequisites

  • pip install agentic-primitives-gateway[agentcore]
  • AWS credentials with AgentCore access