AgentGuard

Multi-agent AI observability for Splunk — instrument agents with a Python SDK, stream spans via HEC, investigate failures with SPL dashboards and an MCP server for Claude.

pip install agentguard

Architecture

flowchart LR
    subgraph Part A: Telemetry Ingestion Pipeline
        A[AgentGuard Python SDK<br/>demo/run_demo.py] -->|POST JSON Spans| B(Splunk HEC<br/>Port: 8088)
        B -->|Route to Index| C[(Splunk Enterprise<br/>Index: main<br/>Sourcetype: agentguard:trace)]
    end

    subgraph Part B: AI Diagnostic Workflow
        D[Claude Desktop App<br/>AI Agent] -->|npx mcp-remote via HTTP| E(Splunk MCP Server App<br/>Port: 8089)
        E -->|Execute SPL Query| C
        C -.->|Return JSON Results| E
        E -.->|Return Diagnostic Payload| D
    end

    %% Minimal Styling (Let GitHub handle the default theme)
    classDef database fill:#000000,stroke:#4CAF50,stroke-width:2px,color:#fff;
    class C database;

Pitch

AgentGuard instruments multi-agent AI systems with a lightweight Python SDK, streams execution traces into Splunk via HEC, surfaces failure patterns in real-time SPL dashboards, and exposes a Splunk MCP server so Claude can query telemetry to explain why agents failed.

Ports (important)

Service	URL	Purpose
Splunk Web	`http://localhost:8000`	Splunk UI, dashboards, HEC token setup
AgentGuard Django	`http://localhost:8001`	Span ingest API, alert webhooks
Splunk HEC	`https://localhost:8088`	SDK sends spans here
Splunk REST	`https://localhost:8089`	MCP / Claude queries

Do not run Django on port 8000 — that port belongs to Splunk. Splunk URLs like /en-GB/app/... will 404 on Django.

Quick start

1. SDK + demo agents

Requires Python 3.10+ (3.11 recommended).

python3.11 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
pip install -e sdk/

cp .env.example .env   # set SPLUNK_HEC_URL + SPLUNK_HEC_TOKEN

python demo/run_demo.py --cycles 3
python demo/inject_failure.py

Backend-only (no Splunk):

AGENTGUARD_BACKEND_URL=http://localhost:8001 python demo/run_demo.py --backend-only

2. Django backend (optional mirror)

cd backend
python manage.py makemigrations api   # first time only
python manage.py migrate
python manage.py runserver 8001
# GET http://localhost:8001/api/v1/agents/

Optional ingest auth: set AGENTGUARD_API_KEY in .env (SDK sends Authorization: Api-Key ...).

3. Splunk HEC

See splunk_app/README.md. Search:

index=main sourcetype=agentguard:trace status=FAILED

4. MCP server

pip install mcp
SPLUNK_MOCK=1 python mcp_server/server.py

With Django fallback when Splunk is unavailable:

SPLUNK_MOCK=0 AGENTGUARD_BACKEND_URL=http://localhost:8001 python mcp_server/server.py

Tools: search_agent_traces, explain_agent_failure, agent_health_summary, nl_search, anomaly_detection, failure_rate_analysis, alert_summary, check_ai_features

5. Hackathon demo (24h sprint)

chmod +x scripts/setup.sh
./scripts/setup.sh

# Terminal 1
cd backend && python manage.py runserver 8001

# Terminal 2
python scripts/demo.py --traces 150 --backend-only

Layers added for demo day:

Layer	What	Files
Splunk AI Assistant	NL → SPL (rule-based fallback)	`mcp_server/ai_assistant.py`
MLTK / anomaly	`DensityFunction` or `anomalydetection`	`splunk_app/mltk_setup.py`, saved searches
Smart alerting	Splunk webhooks → Django	`splunk_app/default/alerts.conf`, `sdk/agentguard/alert_handler.py`

Live Splunk: set SPLUNK_MOCK=0, SPLUNK_HEC_*, SPLUNK_REST_TOKEN, optional SPLUNK_AI_ASSISTANT_ENABLED=1.

Repo layout

Path	Purpose
`sdk/agentguard/`	SDK: `@trace_agent`, `@trace_tool`, HEC + backend exporters
`backend/`	Django + DRF span ingest
`demo/`	psutil infrastructure monitors
`mcp_server/`	Splunk MCP tools + AI Assistant NL→SPL
`splunk_app/`	SPL saved searches, alerts, MLTK setup
`scripts/`	`setup.sh`, `demo.py` hackathon pipeline
`IMPLEMENTATION_PLAN.md`	Full build roadmap

Legacy PromptOps

Prompt/eval APIs under /api/v1/prompts/ remain for reference. New work uses AgentGuard spans only.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentGuard

Architecture

Pitch

Ports (important)

Quick start

1. SDK + demo agents

2. Django backend (optional mirror)

3. Splunk HEC

4. MCP server

5. Hackathon demo (24h sprint)

Repo layout

Legacy PromptOps

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
demo		demo
mcp_server		mcp_server
scripts		scripts
sdk		sdk
splunk_app		splunk_app
.env.example		.env.example
.gitignore		.gitignore
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
LICENSE		LICENSE
README.md		README.md
architecture_diagram.md		architecture_diagram.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AgentGuard

Architecture

Pitch

Ports (important)

Quick start

1. SDK + demo agents

2. Django backend (optional mirror)

3. Splunk HEC

4. MCP server

5. Hackathon demo (24h sprint)

Repo layout

Legacy PromptOps

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages