Observability
Astronete provides monitoring capabilities by exposing various performance and operational metrics. These metrics allow to gain insight into the system’s health, performance, and behavior, ensuring that you can take proactive measures to maintain system stability.
Metrics
The metrics are exposed in Prometheus format, which is a widely-adopted open-source standard for monitoring. This format enables seamless integration with Prometheus-based monitoring solutions.
Assets by status
The status of each asset managed by the operator: KubernetesClusters, Buckets and Databases.
Prometheus metric: astronetes_asset_status.
Status values: Ready, Progressing, Terminating, Unknown or Failed.
Synchronizations by status
The status of each synchronization object: Synchronization, SynchronizationPlan and LiveSynchronization.
Prometheus metric: astronetes_synchronization_status.
Status values: Ready, Progressing, Terminating, Unknown or Failed.
Total synchronized objects by status
The count of synchronized objects by status.
Prometheus metric: astronetes_total_synchronized_objects.
Status values: Sync, OutOfSync or Unknown.
Alerts
Based on the exposed metrics, alerting can be configured using the widely-adopted open-source standard PrometheusRules. This format enables seamless integration with Prometheus-based monitoring solutions.
Platform alerts
The following alerts reports a possible issue with the platform.
| Alert Name | Description | Severity | Duration |
|---|---|---|---|
| AssetFailure | At least one asset is failing | critical | 5 minutes |
| SynchronizationFailure | At least one synchronization is failing | critical | 5 minutes |
Applications alerts
The following alerts reports a possible issue with the objects configured to be synchronized. Those alerts are usually related to applications issues.
| Alert Name | Description | Severity | Duration |
|---|---|---|---|
| SynchronizationNotInSync | There are synchronizations items out of sync | warning | 1 hour |
| WriteOperationsFailed | One or more write operations failed | warning | 1 hour |