OpenEnv Hackathon SF · March 2026

Auditron

An AI agent trained via reinforcement learning to detect fraud in a hostile, dynamic, multi-agent environment — without ever seeing the truth directly.

📓 Open Interactive Notebook 📊 View Training Report ⚙️ Live API

The Challenge

How do you train an AI to catch another AI lying?

A bridge is under construction. Supplier agents bid on contracts for structural parts. Some suppliers are honest. Some will cut corners on part quality if they think they can get away with it. The auditor agent watches the bidding patterns, tracks failure histories, and advises the buyer — but it never sees actual part strength. It must infer deception from indirect signals alone.

Supplier personalities are reshuffled randomly every episode, so the auditor can't learn "supplier_2 is always the cheater" — identity means nothing, behavior is everything. A low bid isn't automatically suspicious (costs are legitimately noisy). Pattern recognition across rounds is the only reliable signal.

Training Results

91%

Fraud detection rate (2nd half)

False positive rate (2nd half)

0 / 50

Structural failures

24% → 91%

Detection improvement across episode

The auditor's language evolved from vague guesses to precise, evidence-grounded analysis across training. See Section 4 of the training report for the full story.

The Agents

Honest Supplier

Always builds to spec. Competes on price alone.

Shrewd Supplier

Cuts corners when scrutiny seems low.

Dishonest Supplier

Lies aggressively to maximize margin.

Random Supplier

Unpredictable. Sometimes honest, sometimes not.

Auditor

Flags suspected cheaters. Never sees true part strength.

Buyer

Selects suppliers each round. Follows auditor recommendations.

Technical Approach

A single base model (Qwen2.5-Instruct) plays all agent roles, differentiated by system prompt. Trained with GRPO via Unsloth + TRL on an H100 GPU. Rewards computed by a privileged "god-engine" oracle that never exposes ground truth to agents.

Built on the OpenEnv framework for stateful multi-agent environments. The live environment API is running in the companion Space.

Explore the Project

📓

Interactive Notebook

Full training pipeline and results on Google Colab

📊

Training Report

Detailed run analysis with auditor language evolution

⚙️

Live Environment API

OpenEnv-compatible REST API, running on HF Spaces

🐙

GitHub Repo

Full source code, training scripts, and environment