feat: add evaluation framework and CI/CD integration for csv-processor sample #1179

Chibionos · 2026-01-22T17:03:42Z

Summary

Add comprehensive evaluation framework for testing file input scenarios in the csv-processor sample.

Changes

Sample Updates (`samples/csv-processor/`)

✅ Add Output model to main.py for proper evaluation data access
✅ Create test data (sales_data.csv, large_dataset.csv, minimal.csv)
✅ Add custom evaluators:
- CSVShapeEvaluator: validates CSV dimensions (rows × columns)
- CSVColumnsEvaluator: verifies expected column names
- AttachmentCreatedEvaluator: checks output attachment creation
✅ Add evaluation set (file-input-tests-local.json) with 3 test cases
✅ Update uipath.json with Output model schema

CI/CD Integration (`testcases/csv-processor-evals/`)

✅ Add testcase directory structure following existing patterns
✅ Create run.sh script for automated evaluation runs
✅ Add assert.py for validation of evaluation results
✅ Configure pyproject.toml and uipath.json for testcase execution

Core Fixes

🐛 Fix evaluator path resolution bug in _evaluator_factory.py
- Support co-located .py and .json files in evaluators directory
- Check direct path before falling back to /custom/ subdirectory
📦 Add pandas dev dependency for CSV processing

Evaluation Results

All 3 test cases pass with perfect scores (1.0/1.0):

✅ Test Sales Data CSV Processing
✅ Test Large Dataset CSV Processing
✅ Test Minimal CSV Processing

Test plan

Run evaluation locally: uv run uipath eval main evaluations/eval-sets/file-input-tests-local.json
Verify all evaluators score 1.0
Test CI/CD integration script: ./testcases/csv-processor-evals/run.sh
Verify CI/CD pipeline runs successfully in GitHub Actions

🤖 Generated with Claude Code

…r sample Add comprehensive evaluation framework for testing file input scenarios in the csv-processor sample. ## Changes ### Sample Updates (samples/csv-processor/) - Add Output model to main.py for proper evaluation data access - Create test data (sales_data.csv, large_dataset.csv, minimal.csv) - Add custom evaluators: - CSVShapeEvaluator: validates CSV dimensions (rows × columns) - CSVColumnsEvaluator: verifies expected column names - AttachmentCreatedEvaluator: checks output attachment creation - Add evaluation set (file-input-tests-local.json) with 3 test cases - Update uipath.json with Output model schema ### CI/CD Integration (testcases/csv-processor-evals/) - Add testcase directory structure following existing patterns - Create run.sh script for automated evaluation runs - Add assert.py for validation of evaluation results - Configure pyproject.toml and uipath.json for testcase execution ### Core Fixes - Fix evaluator path resolution bug in _evaluator_factory.py - Support co-located .py and .json files in evaluators directory - Check direct path before falling back to /custom/ subdirectory - Add pandas dev dependency for CSV processing ## Evaluation Results All 3 test cases pass with perfect scores (1.0/1.0): - Test Sales Data CSV Processing - Test Large Dataset CSV Processing - Test Minimal CSV Processing 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Fixed three critical issues causing evaluation tests to fail: 1. Added pandas dependency to testcases/csv-processor-evals/pyproject.toml - Agent requires pandas but it was missing from eval dependencies 2. Fixed file paths in file-input-tests-local.json - Updated paths to be relative to testcases directory - Changed from "test-data/..." to "../../samples/csv-processor/test-data/..." 3. Optimized UiPath initialization in main.py - Moved UiPath() initialization to platform mode only - Prevents authentication errors when testing with local files All tests now pass with scores of 1.0 for both evaluators. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

github-actions bot added test:uipath-langchain Triggers tests in the uipath-langchain-python repository test:uipath-llamaindex Triggers tests in the uipath-llamaindex-python repository labels Jan 22, 2026

Chibi Vikram and others added 2 commits January 23, 2026 09:22

Chibionos force-pushed the feat/csv-processor-evaluations branch from adf54bc to dc2658c Compare January 23, 2026 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add evaluation framework and CI/CD integration for csv-processor sample #1179

feat: add evaluation framework and CI/CD integration for csv-processor sample #1179

Uh oh!

Chibionos commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add evaluation framework and CI/CD integration for csv-processor sample #1179

Are you sure you want to change the base?

feat: add evaluation framework and CI/CD integration for csv-processor sample #1179

Uh oh!

Conversation

Chibionos commented Jan 22, 2026

Summary

Changes

Sample Updates (samples/csv-processor/)

CI/CD Integration (testcases/csv-processor-evals/)

Core Fixes

Evaluation Results

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Sample Updates (`samples/csv-processor/`)

CI/CD Integration (`testcases/csv-processor-evals/`)