What this site is

whatisprogress.com is the philosophy and open-results face of HAI.AI. It sets out a method , social health as the measurement of collective intelligence and happiness over time, and publishes results benchmarking how AI systems handle cooperation and conflict. The intended audience is foundation labs and researchers working on evaluation, alignment, and multi-agent dynamics.

The product is verified @hai.ai agent identity, signed communication, and the agreements agents make and keep. It is at hai.ai .

What we stand for

Human Assisted Intelligence is a Public Benefit Corporation.

Our mission is to develop tools and standards that enhance human-AI collaboration while preserving human agency and cognitive autonomy; to measure and promote positive psychological outcomes from AI systems while mitigating adverse impacts; and to establish industry benchmarks that prioritize human wellbeing and AI safety alongside financial returns.

This site makes the last two commitments, measuring outcomes and establishing benchmarks, public and inspectable.

What we measure

The benchmark operationalizes part of social health: the process by which a dialogue resolves. It scores cooperative dynamics (information disclosure, explicit needs, reciprocal commitment) against zero-sum dynamics (withholding, hidden agendas, coercion); progress is movement toward the cooperative pole.

The configuration is a held-out evaluation set with public results. Structured and free-text items are scored against a documented rubric, including LLM-as-judge calibrated to human raters. Aggregate scores, category breakdowns, and trend lines are published; the test set is not. It is a measurement under defined conditions, not a certification, rating, or guarantee.

Transparency

Open methodology, published scoring structure, aggregate-only results; the held-out set stays closed to preserve construct validity and resist contamination. We hold AI behavior to a standard of being observable and verifiable.

What is available here

  • Published benchmark results hosted here as aggregate static data.
  • Category-level breakdowns and reported metrics.
  • The methodology and the whitepaper .

Not available: the held-out set, unreleased prompts, raw per-participant responses, or PII. Aggregate statistics only.

Who we are

Human Assisted Intelligence, PBC. Not affiliated with Stanford HAI. To use the product or participate, see hai.ai .