Monitoring
Performance metrics, dashboards, and log aggregation
Avg Response
23ms
p50
P99 Latency
142ms
across all services
Error Rate
0.08%
last 24h
Uptime
99.97%
30 day SLA
Dashboards
Grafana-powered metrics visualization
Platform Overview
overview12 panelsLast viewed 2 hours ago
Service Health
service8 panelsLast viewed 35 min ago
Node Metrics
infrastructure6 panelsLast viewed 1 day ago
API Latency Analysis
custom4 panelsLast viewed 5 hours ago
Metrics
Prometheus targets
| Target | Endpoint | Status | Last Scrape | Scrape Duration | Samples |
|---|---|---|---|---|---|
| api-gateway | http://api-gateway:9090/metrics | up | 15s ago | 12ms | 1,247 |
| user-service | http://user-service:9090/metrics | up | 15s ago | 8ms | 892 |
| order-service | http://order-service:9090/metrics | up | 15s ago | 11ms | 743 |
| payment-service | http://payment-service:9090/metrics | up | 15s ago | 6ms | 421 |
| node-exporter | http://planet-01:9100/metrics | up | 30s ago | 23ms | 3,891 |
| postgres-exporter | http://users-db:9187/metrics | up | 30s ago | 45ms | 567 |
Logs
Loki log aggregation
14:23:01.432INF[api-gateway]→ POST /api/v1/orders 201 89ms consumer=web-app
14:23:01.510INF[user-service]cache hit uid=8a3f2 latency=0.3ms
14:23:02.101WRN[order-service]slow query: SELECT * FROM orders WHERE... (342ms)
14:23:02.450DBG[payment-service]stripe webhook received event=payment_intent.succeeded
14:23:02.892INF[frontend]SSR render /dashboard 45ms cache=HIT
14:23:03.220ERR[payment-service]stripe webhook sig verification failed req=w9x2k
14:23:03.567INF[api-gateway]→ GET /api/v1/users/me 200 12ms consumer=mobile-app
14:23:03.891INF[order-service]order ord_29f1k created total=€49.99
14:23:04.102WRN[api-gateway]rate limit 85/100 rpm for consumer=partner-api
14:23:04.334INF[user-service]jwt refreshed uid=3b7e1 exp=+1h
14:23:04.567ERR[notification-svc]SMTP connection timeout after 30s host=smtp.eu.mailgun.org
14:23:05.001INF[auth-service]token issued realm=production client=web-app sub=sarah@startup.com