io.tooloracle/agentguard icon

agentguard

by Tooloracle.io

io.tooloracle/agentguard

AgentGuard — 20-tool AI safety MCP: policy preflight, risk scoring, audit logging, rate limits.

agentguard · v1.0.0

by Tooloracle.io

33

🛡️ AgentGuard MCP

Security, Policy & Audit Layer for AI Agent Tool Execution — 20 tools | Part of FeedOracle & ToolOracle

AgentGuard is the governance and security layer for AI agent workflows. Before any tool executes, AgentGuard evaluates policies, scores risk, detects secrets and injection attempts, logs to a tamper-evident audit trail, and explains every decision. Built for regulated environments, autonomous payments, and enterprise AI agent deployments.

Quick Connect

# FeedOracle (compliance-focused)
npx -y mcp-remote https://feedoracle.io/guard/mcp/

# ToolOracle (agent-commerce focused)
npx -y mcp-remote https://tooloracle.io/guard/mcp/
{
  "mcpServers": {
    "agentguard": {
      "command": "npx",
      "args": ["-y", "mcp-remote", "https://feedoracle.io/guard/mcp/"]
    }
  }
}

How It Works — The Agent Security Loop

Agent wants to call: payment_execute({amount: 5000})
         │
         ▼
   policy_preflight()         ← Check before execution
         │
   risk_score: 95             ← Critical
   matched: pol-001, pol-002  ← Payment + High-risk policies
   decision: require_approval ← Human gate triggered
         │
   approval_required()        ← Register pending approval
         │
   [Human approves]
         │
   tool executes
         │
   audit_log_write()          ← Record outcome with signature
         │
   decision_explain()         ← Exportable compliance evidence

Tools — Welle 1: Core Security (7)

Tool Description
policy_preflight Pre-flight check before any tool call. Evaluates 7 policies, computes risk score, detects threats, auto-logs. Returns allowed/denied/require_approval/flagged.
tool_risk_score 0-100 risk score for tool + input. Detects secrets (API keys, passwords), prompt injection, high-value amounts. eth_gas→5, payment_execute→95+.
approval_required Check if tool needs human approval. Optionally registers a pending approval request with tracking URL.
audit_log_write Write tool execution to persistent, cryptographically-signed audit log (SQLite WAL). Call after execution to record outcome.
audit_log_query Query audit trail. Filter by agent, tool, decision, time range. Paginated. Returns signed entries for tamper verification.
decision_explain Human-readable explanation of any allow/deny decision. Pass request_id for stored entry or tool_name + tool_args for fresh analysis.
rate_limit_check Check agent rate limits: 200/min, 5000/hr, 50000/day. Returns per-window usage with percentage.

Tools — Welle 2: Payment Controls & Safety

Tool Description
payment_policy_check Validate payment against policy: amount limits (>100k warns, >1M blocks), recipient denylist, supported currencies/networks, AML thresholds (>10k fiat flagged), MiCA flags.
spend_limit_check Check per-call/hour/day spend limits by trust level. Default: 10k/call, 50k/hr, 200k/day. Trusted: 100k/call, 500k/hr, 2M/day.
secret_exposure_check Deep scan for 19 secret patterns: OpenAI/GitHub/AWS/Slack keys, Bearer/Basic auth, ETH private keys, Bitcoin WIF, credit cards, SSNs, emails. Returns severity + remediation.
payload_safety_check 18-pattern safety scan: prompt injection, jailbreak/DAN, role hijacking, SQL (UNION/DROP/OR 1=1), XSS, Python/JS/Shell injection, path traversal, null bytes, oversized payloads.
replay_guard_check Detect replay attacks via SHA256 fingerprint (agent+tool+args). Configurable window (default 5 min). Returns duplicate count + first/last seen.

Tools — Welle 3: Governance & Threat Intelligence (5)

Tool Description
cross_tool_anomaly_check Detect anomalous patterns: risky combos (wallet-recon→transfer, AML→payment), high frequency, repeated denials (policy probing), broad reconnaissance, elevated avg risk score.
scope_check Role-based scope control. Roles: admin, compliance_officer, trader, auditor, developer, readonly. Returns has_scope, missing scope, granting roles. Logs denials.
session_validate Full session lifecycle: create (TTL + call budget), validate (increment counter), invalidate, info. Sessions carry role, scopes, tenant, expiry.
tenant_policy_check Multi-tenant governance. Built-in tenants: default, fintech_eu (MiCA/DORA/AMLD6), defi_protocol, enterprise_read. Per-tenant blocklists, risk limits, spend caps.
threat_intel_check Entity threat intelligence. Auto-detects ETH addresses, IPs, domains. Checks sanctions (Tornado Cash, mixers), disposable services, behavioral analysis from audit log.

Built-in Tenants

Tenant Max Risk Spend/Day Frameworks
default 70 100,000
fintech_eu 60 500,000 MiCA, DORA, AMLD6
defi_protocol 80 10,000,000 MiCA
enterprise_read 30 0

Built-in Roles & Scopes

Role Scopes
admin All scopes
compliance_officer audit:read, compliance:read, blockchain:read, security:scan
trader blockchain:read, payment:check, payment:execute, audit:read
auditor audit:read, audit:write, compliance:read, monitor:read
developer blockchain:read, security:scan, audit:read, monitor:read
readonly blockchain:read, audit:read

Built-in Policies (7 Default)

Policy Condition Action
pol-001 Payment/transfer tools require_approval
pol-002 Risk score ≥ 80 require_approval
pol-003 Secret/key in payload deny
pol-004 Rate limit exceeded flag
pol-005 Risk score ≤ 20 allow freely
pol-006 Prompt injection detected deny
pol-007 Same tool > 50 calls/60s flag

Risk Score Guide

Score Level Action
0-14 Minimal Proceed freely
15-39 Low Proceed, log for audit
40-69 Medium Flag and proceed with caution
70-89 High Require human approval
90-100 Critical Block execution

Use Cases

  • Regulated AI Workflows: MiCA/DORA compliance requires audit trails — AgentGuard provides them automatically
  • Autonomous Payments: x402 agent payments run through approval_required gate before execution
  • Multi-tenant Platforms: Rate limiting and policy scoping per agent/session
  • Security Monitoring: Real-time detection of prompt injection and secret exposure in tool arguments
  • Compliance Reporting: Export audit log with cryptographic signatures for regulatory review

Backend: SQLite WAL-Mode

Persistent, stable, no daemon required. WAL-mode supports 1000+ writes/second. Shared between feedoracle.io and tooloracle.io — one source of truth.

Links

  • 🛡️ FeedOracle: https://feedoracle.io/guard/mcp/
  • 🔧 ToolOracle: https://tooloracle.io/guard/mcp/
  • 📚 Docs: feedoracle.io
  • 🏠 Platform: tooloracle.io

Built by FeedOracle — Evidence by Design