Tag: evaluation

skill-forge

Automate AI skill and codebase improvement by iteratively testing, updating, and refining files using objective metrics ...

skill-quality-eval

Skill Quality Evaluator - Assess and score AI agent skill output quality. Trigger on: 'evaluate', 'quality check', 'scor...

openclaw-coding-skills

Production-grade coding workflow, execution scaffolding, and tuning skills for OpenClaw agents, part of the MyClaw.ai ec...

deep-research-skill

面向 OpenClaw / Codex Agent 的决策导向 deep research skill,强调任务路由、证据可追溯、current-state verification、反证约束与可审计交付。