Add agentic-eval skill for agent evaluation patterns #591

hashwnath · 2026-01-19T03:40:23Z

Summary

This PR adds a new agentic-eval skill to the skills collection, focused on patterns for evaluating and improving AI agent outputs.

Skill Contents

Reflection Pattern: Self-critique and iterative improvement loops
Evaluator-Optimizer Pattern: Separate generation/evaluation components
Code-Specific Reflection: Test-driven refinement workflows
Evaluation Strategies: Outcome-based, LLM-as-Judge, Rubric-based
Best Practices: Clear criteria, iteration limits, convergence checks

Use Cases

Implementing self-critique and reflection loops
Building evaluator-optimizer pipelines for quality-critical generation
Creating test-driven code refinement workflows
Designing rubric-based or LLM-as-judge evaluation systems
Measuring and improving agent response quality

This skill is domain-agnostic and can be applied to any AI agent system requiring output quality improvement.

aaronpowell

please ensure you run the update script so that the readme is updated with the changes

Ran the update script as requested by reviewer to regenerate the skills table.

hashwnath · 2026-01-22T03:34:31Z

Hi Aaron, #105c0f5 - update script and pushed the changes. Could you please re-review? Thanks.

aaronpowell · 2026-01-22T05:00:25Z

looks like multi-line descriptions are going to break out formatting. I'll have to get that fixed before I can merge this PR

…E and skills

Hashwanth Sutharapu and others added 2 commits January 18, 2026 19:39

Add agentic-eval skill for agent evaluation patterns

3d00ec4

Merge branch 'main' into add-agentic-eval-skill

1dd4a8e

aaronpowell requested changes Jan 19, 2026

View reviewed changes

chore: update README.skills.md via npm start

105c0f5

Ran the update script as requested by reviewer to regenerate the skills table.

fix: enhance markdown table cell formatting for descriptions in READM…

a2525e3

…E and skills

aaronpowell approved these changes Jan 22, 2026

View reviewed changes

aaronpowell merged commit 45ad6d8 into github:main Jan 22, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add agentic-eval skill for agent evaluation patterns #591

Add agentic-eval skill for agent evaluation patterns #591

hashwnath commented Jan 19, 2026

Uh oh!

aaronpowell left a comment

Uh oh!

hashwnath commented Jan 22, 2026 •

edited

Loading

Uh oh!

aaronpowell commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add agentic-eval skill for agent evaluation patterns #591

Add agentic-eval skill for agent evaluation patterns #591

Conversation

hashwnath commented Jan 19, 2026

Summary

Skill Contents

Use Cases

Uh oh!

aaronpowell left a comment

Choose a reason for hiding this comment

Uh oh!

hashwnath commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aaronpowell commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hashwnath commented Jan 22, 2026 •

edited

Loading