OpenEnv Hackathon SF ยท March 2026

Auditron

An AI agent trained via reinforcement learning to detect fraud in a hostile, dynamic, multi-agent environment โ€” without ever seeing the truth directly.

๐Ÿ““ Open Interactive Notebook ๐Ÿ“Š View Training Report โš™๏ธ Live API

The Challenge

How do you train an AI to catch another AI lying?

A bridge is under construction. Supplier agents bid on contracts for structural parts. Some suppliers are honest. Some will cut corners on part quality if they think they can get away with it. The auditor agent watches the bidding patterns, tracks failure histories, and advises the buyer โ€” but it never sees actual part strength. It must infer deception from indirect signals alone.

Supplier personalities are reshuffled randomly every episode, so the auditor can't learn "supplier_2 is always the cheater" โ€” identity means nothing, behavior is everything. A low bid isn't automatically suspicious (costs are legitimately noisy). Pattern recognition across rounds is the only reliable signal.

Training Results

91%
Fraud detection rate (2nd half)
0%
False positive rate (2nd half)
0 / 50
Structural failures
24% โ†’ 91%
Detection improvement across episode

The auditor's language evolved from vague guesses to precise, evidence-grounded analysis across training. See Section 4 of the training report for the full story.

The Agents

Honest Supplier
Always builds to spec. Competes on price alone.
Shrewd Supplier
Cuts corners when scrutiny seems low.
Dishonest Supplier
Lies aggressively to maximize margin.
Random Supplier
Unpredictable. Sometimes honest, sometimes not.
Auditor
Flags suspected cheaters. Never sees true part strength.
Buyer
Selects suppliers each round. Follows auditor recommendations.

Technical Approach

A single base model (Qwen2.5-Instruct) plays all agent roles, differentiated by system prompt. Trained with GRPO via Unsloth + TRL on an H100 GPU. Rewards computed by a privileged "god-engine" oracle that never exposes ground truth to agents.

Built on the OpenEnv framework for stateful multi-agent environments. The live environment API is running in the companion Space.

Explore the Project