Skip to content

Monitoring system and incident detection ​

Cloud-IAM employs a multi-layered monitoring and alerting framework to proactively detect incidents across all managed Keycloak deployments.
This approach ensures rapid detection, timely escalation, and swift remediation, minimizing downtime and safeguarding the high availability of customer environments.

How monitoring works ​

To provide robust and independent validation of service availability, Cloud-IAM uses an external monitoring solution that continuously performs health checks on all managed deployments. Monitoring is conducted from four geographically diverse regions to ensure resilience and detect regionalized issues:

  • Australia
  • North America
  • Europe
  • Asia

From each location, monitoring agents generate authentication tokens every 30 seconds.
These tests validate both deployment availability and response latency, enabling Cloud-IAM to detect anomalies such as service degradation, network instability, or authentication failures with high accuracy.

What Cloud-IAM Monitor ​

Cloud-IAM continuously monitors the health and availability of all Keycloak deployments through automated probes and performance checks. An incident is declared when:

  • Token generation - All monitoring probes fail to obtain a token for a duration of at least 2 consecutive minutes.
  • High response times β€” Response time consistently exceed 5 seconds.
  • Unexpected status codes β€” The system returns an error or response that deviates from the expected behavior.

Incident resolution process ​

  1. Once an incident is triggered, the Cloud-IAM on-call team (24/7) is immediately notified and begins investigation and remediation.
  2. The issue is mitigated or fully resolved by the on-call team.
  3. The incident remains active until all monitors confirm successful recovery for at least 5 consecutive minutes.
  4. At that point, the incident is automatically resolved from the status page and the deployment’s availability status is updated.

This monitoring and escalation process ensures that disruptions are detected rapidly and resolved with minimal impact on customer services.

Notifications and Status Tracking ​

Cloud-IAM provides multiple channels to help you track the status of your Keycloak deployments and stay informed during incidents:

Reporting an Incident ​

Customers can also initiate reports: