01 / SEE
Adversarial testing
Continuous probing across 19 attack categories. OWASP LLM Top 10 + MITRE ATLAS, against your live agent stack.
QUESTIONNAIRE · “How do you test for prompt injection and tool misuse?”
Runtime controls and signed audit trails for the AI behind your enterprise contract. Installed inside your stack, in days — not quarters.
The Loop
The questions an enterprise reviewer asks — can it be tricked, what stops it, did the controls fire, what evidence can we show our auditor — map to four primitives that ship as one runtime.
01 / SEE
Continuous probing across 19 attack categories. OWASP LLM Top 10 + MITRE ATLAS, against your live agent stack.
QUESTIONNAIRE · “How do you test for prompt injection and tool misuse?”
02 / CONTROL
Tool allowlists, parameter validation, approval gates, redaction. Enforced at the action boundary, before the agent calls a tool.
QUESTIONNAIRE · “What controls run between the model and your systems?”
03 / PROVE
Every test, every blocked attempt, every controlled action signed on the OVERT 1.0 standard. Verifiable offline, without GLACIS in the loop.
QUESTIONNAIRE · “Can you prove the controls fired in production?”
04 / IMPROVE
Findings flow back into the next testing run. Every fix becomes a regression test. The threat model can’t decay between releases.
QUESTIONNAIRE · “How do you keep up with new attack classes?”
Install in your stack
The runtime drops in beside your agent. The output is a signed evidence packet your customer’s security team, your auditor, your regulator, or your underwriter can verify offline.
import { attest } from '@glacis/runtime'; const receipt = await attest({ workflow: 'refund.issue', policy: 'acme.support.refund.v3', decision: 'BLOCK', rules: ['amount.exceeds_threshold'], }); // → signed OVERT receipt, verifiable offline
Built for teams in review
The questions a security reviewer, an FDA reviewer, and a model-risk officer ask are different. The runtime evidence behind the answer is the same.
A 30-minute call. Walk one named workflow, see the runtime in your stack, and decide if the next 10 days save you a quarter.
Different use case, or just exploring? Get in touch.