Skip to main content

Grafana Dashboard Catalog

Complete reference for all dashboards and data sources in Grafana.


Overview

Grafana provides observability dashboards for the RavenmaskOS platform. All dashboards are provisioned via configuration files and automatically updated.

URL: grafana.ravenhelm.dev


Dashboard Index

DashboardUIDTagsPurpose
System Overviewsystem-overviewsystem, containers, overviewPlatform health at a glance
Docker Logsdocker-logsdocker, logs, lokiContainer log search
Distributed Tracestracesobservability, tempo, tracesRequest tracing

Data Sources

SourceTypeURLPurpose
Prometheusprometheushttp://prometheus:9090Metrics
Lokilokihttp://loki:3100Logs
Tempotempohttp://tempo:3200Traces

Dashboards

System Overview

URL: grafana.ravenhelm.dev/d/system-overview UID: system-overview Tags: system, containers, overview

Main platform health dashboard showing resource utilization and container status.

┌─────────────────────────────────────────────────────────────────────────────┐
│ SYSTEM OVERVIEW │
├─────────────────┬─────────────────┬─────────────────┬───────────────────────┤
│ CPU Usage │ Memory Usage │ Disk Usage │ Running Containers │
│ 15% │ 65% │ 45% │ 47 │
├─────────────────┴─────────────────┴─────────────────┴───────────────────────┤
│ Container CPU Usage (timeseries) │
│ ████████████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │
├──────────────────────────────────────────────────────────────────────────────┤
│ Container Memory Usage (timeseries) │
│ ██████████████████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │
├──────────────────────────────────────────────────────────────────────────────┤
│ Container Network I/O (timeseries) │
│ ████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │
├──────────────────────────────────────────────────────────────────────────────┤
│ Log Volume by Container (timeseries) │
│ ██████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ │
├──────────────────────────────────────────────────────────────────────────────┤
│ Recent Errors (All Containers) │
│ [2025-01-03 10:23] norns-agent: ConnectionError: Redis connection refused │
│ [2025-01-03 10:45] bifrost-api: TimeoutError: Tool execution timed out │
└──────────────────────────────────────────────────────────────────────────────┘

Panels:

PanelTypeData SourceDescription
CPU UsagestatPrometheusCurrent CPU utilization percentage
Memory UsagestatPrometheusCurrent memory utilization percentage
Disk UsagestatPrometheusCurrent disk utilization percentage
Running ContainersstatPrometheusCount of running Docker containers
Container CPU UsagetimeseriesPrometheusCPU usage by container over time
Container Memory UsagetimeseriesPrometheusMemory usage by container over time
Container Network I/OtimeseriesPrometheusNetwork traffic by container
Log Volume by ContainertimeseriesLokiLog line count by container
Recent Errors (All Containers)logsLokiLatest error-level log entries

Docker Logs

URL: grafana.ravenhelm.dev/d/docker-logs UID: docker-logs Tags: docker, logs, loki

Log exploration dashboard for searching and filtering container logs.

Panels:

PanelTypeData SourceDescription
Live LogslogsLokiReal-time log stream (auto-refresh)
Log Volume by ContainertimeseriesLokiLog volume breakdown by container
Log StreamlogsLokiFilterable log history

Variables:

  • container - Filter by container name
  • level - Filter by log level (error, warn, info, debug)

Example Queries:

# All errors from a specific container
{container="norns-agent"} |= "error"

# HTTP 500 errors from traefik
{container="traefik"} |= "500" |~ "error|Error"

# Slow database queries
{container="postgres"} |= "duration"

Distributed Traces

URL: grafana.ravenhelm.dev/d/traces UID: traces Tags: observability, tempo, traces

Distributed tracing dashboard for request flow analysis.

Panels:

PanelTypeData SourceDescription
Trace RatetimeseriesTempoTraces per second over time
P95 LatencytimeseriesTempo95th percentile request latency
Recent TracestracesTempoSearchable trace list with flamegraph

Use Cases:

  • Debug slow API requests
  • Trace request flow across services
  • Identify bottlenecks in multi-service calls

Quick Access URLs

DashboardDirect URL
System Overviewhttps://grafana.ravenhelm.dev/d/system-overview
Docker Logshttps://grafana.ravenhelm.dev/d/docker-logs
Traceshttps://grafana.ravenhelm.dev/d/traces
Explore (Loki)https://grafana.ravenhelm.dev/explore?orgId=1&left=["now-1h","now","Loki",{}]
Explore (Tempo)https://grafana.ravenhelm.dev/explore?orgId=1&left=["now-1h","now","Tempo",{}]

Provisioning

Dashboards are automatically provisioned from JSON files:

Location: /etc/grafana/provisioning/dashboards/ Update Interval: 30 seconds

FileDashboard
system-overview.jsonSystem Overview
docker-logs.jsonDocker Logs
traces.jsonDistributed Traces

To add a new dashboard:

  1. Create JSON file in /Users/ravenhelm/ravenhelm/services/grafana/provisioning/dashboards/
  2. Restart Grafana or wait for auto-sync

Alert Rules

Currently no alert rules are configured. Alerting is handled via:

  • n8n workflows (GitLab Pipeline → Alerting)
  • Uptime Kuma (service health checks)

See Also