đ Alert Center
Centralized alert management and notification hub
4
Critical
8
Warning
12
Info
156
Resolved (24h)
42
Alert Rules
đ¨ Active Alerts
All (24)
Critical (4)
Warning (8)
Info (12)
Redis Memory Critical - 95% Usage
REDIS-CACHE-01 memory usage has exceeded 95% threshold. Immediate action required to prevent OOM.
Backup Failed - ORACLE-HR
Full backup job failed with error: ORA-19502 - write error on backup piece
High CPU Usage - PROD-SQL-03
CPU usage has been above 75% for the last 15 minutes. Current: 78%
Disk Space Low - PROD-SQL-01
Drive E: is at 85% capacity. Estimated 12 days until full at current growth rate.
Index Maintenance Recommended
5 indexes on PROD-SQL-01 have fragmentation above 30% and should be rebuilt.
Connection Pool Exhaustion - Resolved
Azure SQL connection pool returned to normal after auto-scaling triggered.
đ Alert Rules
CPU > 75%
All servers âĸ 5 min duration
Memory > 90%
All servers âĸ Immediate
Disk > 85%
All servers âĸ Immediate
Backup Failed
All databases âĸ Immediate
Replication Lag > 5min
All replicas âĸ 2 min duration
Blocking > 30 sec
Production only âĸ Immediate
đ§ Notification Channels
Email - DBA Team
dba-team@company.com
Slack - #db-alerts
Critical & Warning only
PagerDuty
Critical only âĸ 24/7
SMS - On-Call
Critical only âĸ After hours