Research

AI agents are beginning to spend money on behalf of humans and organizations. Before this becomes the default mode of commerce, someone has to answer a fundamental question: how do you know an agent is making good economic decisions? We're building the instruments to find out.

You can't trust what you can't verify. Today, there is no standardized way to evaluate whether an AI agent's economic reasoning is sound, aligned with its principal's intent, or operating within acceptable risk. We're changing that.

Why This Matters Now

The financial system was designed around a single assumption: that every transaction has a human principal making a conscious decision. AI agents break this assumption. When an agent autonomously negotiates a price, selects a vendor, or commits a budget, the existing trust infrastructure has no way to verify the quality of that decision in real time.

This isn't a theoretical concern. Agents with access to payment instruments are already operating in production environments. The gap between what they can do and what can be verified about their reasoning is widening every month. Our research addresses this gap directly — building the evaluation frameworks, trust protocols, and verification methods that make autonomous economic activity auditable, governable, and safe.

Research Focus Areas

Decision Quality Measurement

Developing process-based metrics that evaluate the quality of an agent's economic reasoning independent of outcome — because a good decision can have a bad result, and a bad decision can get lucky.

Trust & Authorization

Designing layered authorization frameworks that let card networks, issuers, and merchants verify agent identity and intent before authorizing transactions on existing payment rails.

Failure Mode Analysis

Mapping the ways AI economic reasoning can degrade — from Goodhart's Law gaming to adversarial manipulation — and building detection mechanisms that catch problems before they propagate.

Featured Publications

Working Paper · June 2026

Agent Behavioral Telemetry: Behavioral Drift as a Leading Indicator of Agent Compromise

Introduces a seven-signal telemetry framework for continuous trust verification of AI agents in financial transactions. Extends behavioral biometrics research to autonomous agents, defining the Agent Behavioral Fingerprint and drift-detection architecture.

Williams, J. Read paper →

Working Paper · June 2026

Economic Decision Quality Score (EDQS): A Framework for Evaluating and Improving AI Agent Economic Reasoning

Introduces a six-dimension composite metric for evaluating the quality of an AI agent's economic reasoning process, independent of outcome. Draws on process-based supervision, Constitutional AI, bounded rationality, and Goodhart's Law taxonomy.

Williams, J. Read paper →

Working Paper · May 2026

The Decision Trust Protocol: A Layered Authorization Framework for Autonomous Agent Commerce

Proposes a four-layer authorization architecture for agent-to-agent and agent-to-merchant commerce on existing card network rails, introducing the Know Your Agent (KYA) standard for human-facing governance.

Williams, J. Read paper →

All Publications

Jun 2026 Behavioral Analysis Agent Behavioral Telemetry: Behavioral Drift as a Leading Indicator of Agent Compromise Jun 2026 Agent Reasoning Economic Decision Quality Score (EDQS): A Framework for Evaluating and Improving AI Agent Economic Reasoning May 2026 Trust Infrastructure The Decision Trust Protocol: A Layered Authorization Framework for Autonomous Agent Commerce

EDQS Benchmark

Get the benchmark when it publishes

We are benchmarking EDQS against transaction-fraud baselines on agent-initiated transactions. Leave your email and receive the results the day they go live.

Research updates only. No marketing.