Back to projectsintermediate
MLOps and machine learning
LLM evaluation harness
regression test prompts and models
Status
Track where this project stands in your portfolio.
Suggested stack
promptfoo or deepeval, a dataset, CI
Proves
evaluation, cost control
Milestones
- 01golden set
- 02metrics
- 03CI gate
- 04cost report