Skip to content

Sandbox Testing Overview

GOVERN Sandbox (Surface 6) provides isolated, ephemeral environments for testing AI governance policies, running red team exercises, and benchmarking governance effectiveness before production deployment.

What Is the Sandbox?

The sandbox is a fully isolated copy of the GOVERN platform that:

  • Shares no data with production (separate database, separate AI system connections)
  • Resets on demand — ephemeral environments can be wiped and recreated in minutes
  • Accepts adversarial inputs without risk to production systems
  • Records everything for post-exercise analysis

Use Cases

Use CaseWho Uses ItFrequency
Policy validationPolicy authorsBefore every policy change
Red team exercisesSecurity researchersQuarterly or on demand
Governance benchmarkingPlatform engineersBefore major releases
Pre-production testingDevOps, QAEvery release candidate
Training scenariosNew SOC analystsOnboarding + annual
Vendor evaluationProcurement teamsWhen evaluating new AI systems

Environment Types

Ephemeral Sandbox

A short-lived (default: 4 hours, max: 24 hours) isolated environment. Created in under 2 minutes. Automatically destroyed when the session expires.

Best for: One-off tests, quick policy validation, ad hoc exploration.

Persistent Sandbox

A long-lived environment (up to 90 days) that retains its state between sessions. Useful for extended red team exercises or benchmarking campaigns.

Best for: Multi-day exercises, longitudinal benchmarks, training curriculum.

Pre-loaded Scenarios

GOVERN Sandbox ships with pre-loaded test scenarios covering common governance challenges:

ScenarioTypeDescription
Basic PII leakagePolicy validationAI outputs SSN/email
Prompt injectionRed teamClassic injection attempts
Bias in hiring recommendationsBias testingProtected class disparate impact
Medical advice without disclaimerSafetyUnauthorized medical guidance
GDPR right-to-erasureComplianceData subject request handling
Model drift over timeDriftGradual score degradation

Sandbox vs. Production

FeatureSandboxProduction
Data isolationCompleteN/A
Adversarial testing allowedYesNo
SOC alerts generatedSandbox SOC onlyProduction SOC
Audit trailFull (for analysis)Compliance-grade
Auto-reset capabilityYesNo
GOVERN policiesTest policiesLive policies
AI system connectionsMock or isolated realLive real