Skip to content

Probe Overview

GOVERN Probe — Runtime Inference Monitor

The GOVERN Probe is a lightweight Docker container that sits between your application and your AI model API. It scores every inference against governance policies in real time.

Architecture

Your App → Probe (:9090) → AI Model API (OpenAI, Anthropic, etc.)
Score input
Forward request
Score output
Buffer telemetry
Flush to GOVERN API every 60s

What It Scores (5 Dimensions)

DimensionWhat Is CheckedAlert Threshold
SecurityPII (SSN, email, CC, phone), prompt injection (5 patterns), credential leaks (API keys, tokens)Any detection
BiasGender stereotypes, racial bias, ageism patternsScore > 0.2
AccuracyHallucination signals, fabricated citations, temporal impossibilities, grounding indicatorsScore > 0.3
CostHourly token budget, USD spend rateExceeds configured budget
DriftEntropy-based behavioral deviation from baseline (exponential moving average)Drift > 0.3

Scoring Modes

ModeBehavior
logScore every request, never block. Telemetry flows silently.
flagScore and flag violations in response headers.
blockScore and block requests with critical violations (returns 403).

Health Endpoints (port 9091)

EndpointPurpose
/healthzLiveness — is the probe running?
/readyzReadiness — are policies loaded? Is buffer healthy?
/metricsPrometheus-compatible metrics (uptime, memory)