Question 1

What types of AI agents can Humanbound test?

Accepted Answer

Any agent that exposes an API endpoint. You point Humanbound at the endpoint and it runs attacks against it as a black-box adversary. There is no SDK to install and no modification to the agent required. If a user can talk to your agent, Humanbound can test it.

Question 2

How is this different from a traditional penetration test?

Accepted Answer

A penetration test gives you a point-in-time report. Humanbound gives you a continuous posture score that updates as your agents, models, and configurations change. The testing engine adapts to your agent’s behavior, running multi-turn and agentic attack chains that evolve over time rather than replaying a fixed set of payloads.

Question 3

Is the open-source version limited compared to the platform?

Accepted Answer

No. The testing engine, SDK, and firewall are the same code that powers the platform. Nothing is held back or artificially gated. The platform adds continuous monitoring, finding lifecycle management, cross-session intelligence, and managed infrastructure for teams running security across a fleet of agents.

Question 4

What frameworks does Humanbound map findings to?

Accepted Answer

EU AI Act, NIST AI RMF, OWASP LLM Top 10, and OWASP Agentic AI Top 10. Every finding includes a framework mapping and severity rating. You can export compliance evidence packages in `html`, `pdf`, `json`, `sarif`, and `cef` formats.

Question 5

Can I run it fully air-gapped?

Accepted Answer

Yes. The local engine supports Ollama and other self-hosted models, so you can run a complete security test without any data leaving your environment. No Humanbound account is required for local use.

Question 6

How does the Humanbound Firewall work?

Accepted Answer

The firewall sits between users and your agent, evaluating every input before it reaches the model. Four tiers work together: input sanitisation, pre-trained attack detection, an agent-specific classifier trained on your own test data, and deep contextual analysis by an LLM judge. It ships as a Python package under Apache-2.0 and can be added to your agent in a few lines of code.

Question 7

What does “posture score” mean?

Accepted Answer

Every agent gets a score from 0 to 100 based on the findings from adversarial and behavioral testing. The score reflects the current security state of the agent, not a historical snapshot. When models update or configurations change, the score updates to reflect the new reality.

Question 8

How long does a first assessment take?

Accepted Answer

A baseline campaign against a single agent endpoint typically completes in hours, not weeks. For an enterprise-wide assessment covering multiple agents, plan for about two weeks from kickoff to delivered posture scores with evidence packs.

Deploy AI agents you can prove are secure.

Test before launch. Protect at runtime. Monitor forever.

Automated adversarial & behavioral testing.

The Humanbound Firewall.

Continuous assurance campaigns.

Run it locally. No login required.

production-support-bot

Every finding is mapped, scored, and exportable.

Questions, answered.

See your first posture score in two weeks.