DriftBench v1.1: an ambivalence metric + 20 cross-domain scenarios

Project summary

DriftBench is an open-source, fully deterministic benchmark (Apache 2.0) for how faithfully an AI tracks a user's changing beliefs across a long conversation — scored reproducibly, with no LLM judge. This grant ships v1.1: a new deterministic "ambivalence" metric (working prototype already in-repo) and a 3x larger, multi-domain scenario library.

What are this project's goals? How will you achieve them?

DriftBench measures — deterministically, with no LLM judge — whether an AI correctly tracks how a user's beliefs, conflicts, and identity change across a long conversation.

Why it matters: systems that track a person's beliefs are increasingly deployed in high-stakes settings — mental-health support, coaching, advice — where misreading how someone's state is changing can cause real harm. Deterministic, reproducible evaluation lets anyone check how well a system actually does this, without trusting a vendor's own claims.

This funding covers one concrete, shippable bundle:

- Promote the working "ambivalence" (AMB) metric from prototype to a calibrated, tested metric in the core. AMB scores when an AI recognizes a person holds two conflicting beliefs at once (e.g. loyalty to the past and openness to the new) rather than treating it as a simple switch — a distinction existing direction-only metrics miss. Prototype already runs at research/v1_1_draft/ambivalence.py.

- Expand the scenario library from 7 to 20+, across new domains: relationships, health, money & risk, grief & loss, addiction & recovery (today's are all career/identity).

- Ship it: tagged v1.1 release, updated specification, and a public leaderboard.

All deliverables are verifiable in the open repository as they land.

How will this funding be used?

Solo developer time over ~3 months to design domain ontologies, author and validate 13+ new scenarios, calibrate and test the AMB metric, and cut the v1.1 release, plus a small amount for LLM API costs. Based in Ukraine, where costs are well below US/EU levels, so the funding goes far. At the $5,000 minimum I'd ship the AMB metric plus ~5 new-domain scenarios; the full $15,000 funds the complete 20+ scenario library and release.

Who is on your team? What's your track record on similar projects?

I'm the sole creator of DriftBench. To be upfront: I'm not a professional programmer — I designed the system (ontology, metrics, validation rules, scenarios) and built it by directing AI coding tools (mainly Claude) as my implementer, while I act as architect and tester. The deterministic, fully test-covered design is what makes this work: correctness is verifiable against fixed expected outputs, not dependent on my hand-coding.

Already shipped, open-source (Apache 2.0): deterministic scoring engine, five metrics, frozen ontology, seven scenarios, zero-trust validation with cryptographic integrity checks, adapter architecture, CI. The AMB prototype and a full new-domain (grief) scenario already run in-repo. https://github.com/simon9679/driftbench

What are the most likely causes and outcomes if this project fails?

The main risk is calibration: ambivalence thresholds need tuning so AMB is neither trivially easy nor impossible across diverse scenarios. This is engineering, not open research — the metric is fully deterministic and already runs; the work is validating thresholds against a broader library. Scenario authoring is labor but low-risk.

How much money have you raised in the last 12 months, and from where?

None raised yet. Applications pending: Long-Term Future Fund ($20k, for a separate adversarial safety-evaluation track) and Anthropic Claude for OSS (non-cash).