Creative Writing Agent
agent.py
Running Evaluations
Validation Configuration
evals.yaml
Evaluation Results
Successful Validation
When all validators pass:summary.json
Failed Validation
When any validator fails:summary.json
Key Features
- Content Validation: Verify required keywords (
contains) and exclude unwanted content (not_contains) - Format Validation: Ensure responses follow expected structure with
regexpatterns - Time Validation: Monitor execution time with top-level
timevalidator - Usage Validation: Track resource consumption with top-level
usagevalidator - Combined Validators: Use per-key validators (
content.validators) and top-level validators (output.validators) together