Using the Aspire Dashboard

The .NET Aspire dashboard is your command center for developing and debugging CitadelMesh. This guide covers everything you need to know to maximize productivity.

Starting the Dashboard

Quick Start

cd /path/to/CitadelMesh/src/CitadelMesh.AppHost
dotnet run

Expected Output:

Building...
info: Aspire.Hosting.DistributedApplication[0]
      Aspire version: 8.0.0
info: Aspire.Hosting.DistributedApplication[0]
      Distributed application started. Press Ctrl+C to shut down.
info: Aspire.Hosting.DistributedApplication[0]
      Dashboard URL: https://localhost:5000

Navigate to: https://localhost:5000

CLI Route (recommended when AppHost can’t find the dashboard)

If you see errors like “CliPath/DashboardPath not found,” use the Aspire CLI. It brings its own dashboard runner and avoids path issues.

Ensure .NET 9 SDK is installed (9.0.300+)
Install the Aspire workload for that SDK
Verify the CLI: dotnet aspire --help
From src/CitadelMesh.AppHost, start via CLI

Troubleshooting tips:

If dotnet aspire isn’t found, install/repair the Aspire workload for the SDK version you’re using
Make sure dev certs are trusted if using HTTPS locally (dotnet dev-certs https --trust)
Prefer the CLI route on macOS when the AppHost path resolution fails

Advanced Startup Options

Custom Port

# Set custom port via environment variable
export ASPIRE_DASHBOARD_PORT=7000
dotnet run

Or update appsettings.Development.json:

{
  "Dashboard": {
    "Port": 7000
  }
}

Selective Service Startup

Edit src/CitadelMesh.AppHost/Program.cs to comment out services:

var builder = DistributedApplication.CreateBuilder(args);

// Core infrastructure (always needed)
var redis = builder.AddRedis("redis", port: 6379);
var postgres = builder.AddPostgres("postgres", port: 5432)
    .AddDatabase("citadel-db");

// Optional: Comment out if not needed
// var jaeger = builder.AddContainer("jaeger", "jaegertracing/all-in-one");
// var prometheus = builder.AddContainer("prometheus", "prom/prometheus");

var app = builder.Build();
app.Run();

Dashboard Features

1. Resources Tab

The Resources view shows all running services with real-time status.

Service Categories

Resource Type	Purpose	Default Port
redis	Caching & pub/sub	6379
postgres	Persistent storage	5432
nats	Event bus	4222
opa	Policy engine	8181
spire-server	Identity provider	8081
spire-agent	Workload attestation	-
jaeger	Distributed tracing	16686
agent-runtime	Python agent container	-

Resource Actions

Each resource has quick actions:

View Logs: Opens console output
View Details: Shows configuration
Restart: Graceful restart
Stop/Start: Manual control

Example: Restarting OPA

Find opa in Resources list
Click ⋮ menu
Select Restart
Monitor logs for successful reload

2. Console Logs Tab

Real-time log streaming from all services.

Filtering Logs

By Service:

Filter: service:opa
Shows only OPA logs

By Level:

Filter: level:error
Shows only errors across all services

By Message Pattern:

Filter: message:policy
Shows logs containing "policy"

Combined Filters:

Filter: service:agent-runtime level:error
Shows errors from agent runtime

Log Levels

TRACE - Verbose debug info
DEBUG - Development diagnostics
INFO - Normal operations
WARN - Potential issues
ERROR - Failures requiring attention
FATAL - Critical system errors

Example: Debugging Policy Denials

Filter: message:"Policy violation"

Result:
[12:34:56] ERROR [CitadelMesh.Safety] Policy violation: Door unlock denied
  Policy: citadel.security.door_unlock
  Input: {"action":"door_unlock","duration_seconds":600}
  Reason: Exceeded maximum duration (300s)

3. Structured Logs Tab

Advanced log analysis with filtering, grouping, and export.

Query Examples

Find all policy evaluations:

{
  "policy.result": {"$exists": true}
}

Find slow operations:

{
  "duration_ms": {"$gt": 1000}
}

Find agent errors:

{
  "service": "agent-runtime",
  "level": "error"
}

Grouping and Aggregation

Select Group By → service
Select Aggregate → count
Result: Log count per service

Export Logs

Apply filters
Click Export → JSON
Use for offline analysis or bug reports

4. Traces Tab

Distributed tracing with OpenTelemetry/Jaeger integration.

Trace View

Each trace shows:

Trace ID: Unique identifier
Duration: Total time
Spans: Individual operations
Status: Success/Error/Timeout

Example: Agent Execution Trace

Trace: agent.security.process_scenario
├─ span: monitor_events (12ms)
│  └─ span: camera_action.get_incidents (8ms)
├─ span: analyze_threat (5ms)
├─ span: coordinate_response (3ms)
├─ span: execute_door_control (45ms)
│  └─ span: mcp.schneider.lock_door (40ms)
└─ span: audit_log_response (2ms)

Total: 67ms

Finding Performance Bottlenecks

Click Traces tab
Sort by Duration (descending)
Click slow trace
Examine span waterfall
Identify bottleneck (longest span)

Trace Filtering

By Service:

service.name:agent.security

By Operation:

operation.name:execute_door_control

By Status:

status.code:ERROR

By Duration:

duration:&gt;1000ms

5. Metrics Tab

Real-time metrics and dashboards.

Available Metrics

Infrastructure:

Redis: Operations/sec, memory usage
PostgreSQL: Query count, connection pool
NATS: Messages/sec, queue depth

Application:

Agent executions
Policy evaluations
MCP tool invocations
Event processing latency

System:

CPU usage per container
Memory usage per container
Network I/O
Disk I/O

Creating Custom Dashboards

Click + New Dashboard
Add metrics:
- citadel_agent_executions_total
- citadel_policy_evaluations_total
- citadel_mcp_tool_duration_seconds
Choose visualization (line chart, bar chart)
Set refresh interval (5s, 30s, 1m)

Alerting

Set up alerts for critical metrics:

Select metric (e.g., error_rate)
Click Create Alert
Set threshold: > 5%
Configure notification (email, webhook)

6. Environment Variables Tab

View and edit environment variables for all services.

Viewing Variables

Click Environment tab
Select service (e.g., opa)

See all env vars:

OPA_LOG_LEVEL=debug
OTEL_EXPORTER_OTLP_ENDPOINT=http://jaeger:4317

Hot-Reloading Variables

Click Edit on variable
Change value (e.g., OPA_LOG_LEVEL=info)
Click Save
Service auto-restarts with new value

Note: Only supported for containerized services with restart policies.

Debugging Workflows

Debugging OPA Policy Denials

Scenario: Agent action is denied by policy

Steps:

Find the denial in logs:
- Console Logs → Filter: message:"Policy violation"
- Note the input and reason
Check policy evaluation:
- Structured Logs → Query: {"policy.result": false}
- View full policy input/output

Test policy locally:

# Create test input
cat > input.json <<EOF
{
  "action": "door_unlock",
  "duration_seconds": 600
}
EOF

# Evaluate policy
opa eval -i input.json -d policies/security.rego \
  'data.citadel.security.allow'

Fix policy or input:
- Edit policies/security.rego
- Policies auto-reload in OPA container
Verify in dashboard:
- Watch Console Logs for policy reload
- Re-run agent
- Check for successful evaluation

Debugging Agent State Machine

Scenario: Agent gets stuck in a state

Steps:

Find agent execution trace:
- Traces → Filter: service.name:agent.security
- Find stuck execution (long duration)
Examine span timeline:
- Click trace
- Look for incomplete spans or timeouts

Check agent logs:

Console Logs → Filter: service:agent-runtime level:debug

Look for state transitions:

DEBUG: State transition: monitor → analyze
DEBUG: State transition: analyze → coordinate_response
DEBUG: State transition: coordinate_response → STUCK

Inspect agent state:

Structured Logs → Query: {"agent.state": {"$exists": true}}

View state object:

{
  "status": "active",
  "current_state": "coordinate_response",
  "context": {...},
  "error": "Timeout waiting for MCP response"
}

Fix the issue:
- Add timeout handling in agent code
- Or fix MCP adapter issue

Debugging MCP Adapter Issues

Scenario: MCP tool invocation fails

Steps:

Check MCP server logs:

Resources → Find MCP container
View Logs

Look for errors:

ERROR: Failed to execute tool: set_temperature
Error: Connection refused to EcoStruxure API

Verify MCP server is running:

docker ps | grep mcp
# Should show running container

Test MCP server directly:

# List available tools
curl http://localhost:3001/tools

# Invoke tool
curl -X POST http://localhost:3001/tools/set_temperature \
  -H 'Content-Type: application/json' \
  -d '{"zone": "lobby", "temperature": 22.0}'

Check OPA policy for tool:
- MCP tools may be blocked by policy
- Console Logs → Filter: message:set_temperature
Enable mock mode for development:
- Environment → Select MCP server
- Edit ENABLE_MOCK_MODE=true
- Restart service

Hot Reload Workflows

.NET Microservices Hot Reload

Aspire supports .NET hot reload out of the box.

Steps:

Edit C# file (e.g., src/microservices/CitadelMesh.Safety/Program.cs)
Save file
Dashboard shows: 🔄 Reloading: safety-engine
Changes applied (no restart needed)

Limitations:

Method signature changes require restart
New dependencies require dotnet restore

OPA Policy Hot Reload

Policies auto-reload when files change.

Steps:

Edit policy: vim policies/security.rego
Save file

Dashboard Console Logs shows:

INFO: OPA bundle reloaded
INFO: Loaded policies: citadel.security

Test immediately (no restart)

Python Agent Development

Agents don't auto-reload, but you can use watchdog:

cd src/agents

# Install watchdog
pip install watchdog[watchmedo]

# Auto-restart on file change
watchmedo auto-restart \
  --pattern="*.py" \
  --recursive \
  -- python security/security_agent.py

Advanced Features

Custom Resource Health Checks

Add health check to custom service:

// In Program.cs
var myService = builder.AddContainer("my-service", "my-image")
    .WithHealthCheck("http://localhost:8080/health");

Dashboard will show health status with ✅/❌ indicator.

Resource Dependencies

Ensure services start in order:

var postgres = builder.AddPostgres("postgres");
var orchestrator = builder.AddProject<Projects.Orchestrator>("orchestrator")
    .WithReference(postgres)  // Waits for postgres
    .WaitFor(postgres);        // Explicit wait

External Services

Connect to external systems:

// External Redis
builder.AddConnectionString("external-redis", "redis://prod-redis:6379");

// Use in services
var myService = builder.AddProject<Projects.MyService>("my-service")
    .WithReference("external-redis");

Performance Tips

1. Reduce Log Volume

For better performance, reduce log verbosity in production-like testing:

{
  "Logging": {
    "LogLevel": {
      "Default": "Warning",
      "CitadelMesh": "Information"
    }
  }
}

2. Disable Unused Features

// Disable tracing if not needed
builder.Services.AddOpenTelemetry()
    .WithTracing(tracing => tracing.SetSampler(new AlwaysOffSampler()));

3. Use Persistent Volumes

Avoid recreating containers on each start:

var postgres = builder.AddPostgres("postgres")
    .WithDataVolume("citadel-postgres-data");  // Persists across restarts

Troubleshooting Dashboard Issues

Dashboard not accessible

Error: Cannot connect to https://localhost:5000

Solution:

# Check if AppHost is running
ps aux | grep CitadelMesh.AppHost

# Restart AppHost
cd src/CitadelMesh.AppHost
dotnet run

Services show as unhealthy

Error: Red ❌ indicators for all services

Solution:

# Check Docker
docker ps

# Restart Docker Desktop if needed
# Then restart Aspire
dotnet run

Logs not appearing

Issue: Console Logs tab is empty

Solution:

Check log level in appsettings.Development.json
Ensure "Aspire": "Debug" is set
Restart AppHost

Traces not showing

Issue: Traces tab shows "No traces"

Solution:

# Verify Jaeger is running
curl http://localhost:16686

# Check OTLP endpoint
curl http://localhost:4317

Next Steps

Docker Compose Alternative - Simpler deployment option
Writing OPA Policies - Policy development
Testing Agents - Agent debugging techniques
Production Deployment - Deploy to K3s

Dashboard Keyboard Shortcuts

Shortcut	Action
`Cmd/Ctrl + K`	Open command palette
`Cmd/Ctrl + F`	Focus search/filter
`Cmd/Ctrl + R`	Refresh current view
`Esc`	Clear filters
`?`	Show help overlay

Master the dashboard and you'll be 10x more productive! Continue to Docker Compose Setup.

Starting the Dashboard​

Quick Start​

CLI Route (recommended when AppHost can’t find the dashboard)​

Advanced Startup Options​

Custom Port​

Selective Service Startup​

Dashboard Features​

1. Resources Tab​

Service Categories​

Resource Actions​

2. Console Logs Tab​

Filtering Logs​

Log Levels​

Example: Debugging Policy Denials​

3. Structured Logs Tab​

Query Examples​

Grouping and Aggregation​

Export Logs​

4. Traces Tab​

Trace View​

Example: Agent Execution Trace​

Finding Performance Bottlenecks​

Trace Filtering​

5. Metrics Tab​

Available Metrics​

Creating Custom Dashboards​

Alerting​

6. Environment Variables Tab​

Viewing Variables​

Hot-Reloading Variables​

Debugging Workflows​

Debugging OPA Policy Denials​

Debugging Agent State Machine​

Debugging MCP Adapter Issues​

Hot Reload Workflows​

.NET Microservices Hot Reload​

OPA Policy Hot Reload​

Python Agent Development​

Advanced Features​

Custom Resource Health Checks​

Resource Dependencies​

External Services​

Performance Tips​

1. Reduce Log Volume​

2. Disable Unused Features​

3. Use Persistent Volumes​

Troubleshooting Dashboard Issues​

Dashboard not accessible​

Services show as unhealthy​

Logs not appearing​

Traces not showing​

Next Steps​

Dashboard Keyboard Shortcuts​

Starting the Dashboard

Quick Start

CLI Route (recommended when AppHost can’t find the dashboard)

Advanced Startup Options

Custom Port

Selective Service Startup

Dashboard Features

1. Resources Tab

Service Categories

Resource Actions

2. Console Logs Tab

Filtering Logs

Log Levels

Example: Debugging Policy Denials

3. Structured Logs Tab

Query Examples

Grouping and Aggregation

Export Logs

4. Traces Tab

Trace View

Example: Agent Execution Trace

Finding Performance Bottlenecks

Trace Filtering

5. Metrics Tab

Available Metrics

Creating Custom Dashboards

Alerting

6. Environment Variables Tab

Viewing Variables

Hot-Reloading Variables

Debugging Workflows

Debugging OPA Policy Denials

Debugging Agent State Machine

Debugging MCP Adapter Issues

Hot Reload Workflows

.NET Microservices Hot Reload

OPA Policy Hot Reload

Python Agent Development

Advanced Features

Custom Resource Health Checks

Resource Dependencies

External Services

Performance Tips

1. Reduce Log Volume

2. Disable Unused Features

3. Use Persistent Volumes

Troubleshooting Dashboard Issues

Dashboard not accessible

Services show as unhealthy

Logs not appearing

Traces not showing

Next Steps

Dashboard Keyboard Shortcuts