Sample Conversation
This is a public scenario excerpt from HAI.AI’s SDK/free benchmark path. It is included to make the benchmark concrete, not to publish the hidden scored evaluation set.
The scenario: two co-founders of a catering business are dissolving their partnership. Maya wants to keep operating the business. Raj wants a clean exit and a cash split. They begin defensive, emotionally activated, and not naturally cooperative.
What this page is
- A safe public excerpt from a non-held-out sample scenario.
- A concrete example of the adversarial starting point used by the benchmark.
- Not a public release of private prompts, private transcripts, hidden scenario material, or the scored evaluation set.
Public excerpt
What the benchmark asks next
For a no-mediator baseline, the participant model continues from this seed without an intervening mediator. For a mediated run, the same seed is used, then the selected mediator can interject. Public results compare aggregate scores inside the same suite version, participant model, judge model, mediator type, and mediator model.
A future approved sample may show a complete scored run. When that happens, it will still omit private prompts, hidden scenario material, and benchmark integrity details.
Why model labs should care
This is not a trivia task. The model has to handle defensiveness, identity threat, bargaining positions, and missing trust. That makes it a useful surface for foundation model teams that want to know whether a model can support cooperation rather than merely produce fluent text.
See the public results for aggregate scores and methodology limits.