LLM Benchmark Agent Configuration
Mode-first configuration GUI for benchmark runs, resume, metrics-only recompute, and run-and-validate workflows.
Mode-first configuration GUI for benchmark runs, resume, metrics-only recompute, and run-and-validate workflows.
All CLI flags this GUI can currently emit. Flags not listed here are still available only through the terminal.