doteval is the simplest way to create high-signal evals, align LLM judges, and define rewards for RL — all in a single workspace.
$ pip install [coming soon]