Tiny, rubric-keyword proxy for clinical safety/quality signals
This example provides a minimal, rubric-driven proxy inspired by HealthBench—for quick sanity checks in clinical-style prompts. It is not a comprehensive or official reimplementation.
This example is now implemented as a suite in eval_protocol/benchmarks/suites/healthbench.py and exported as healthbench.