Monitoring Infrastructure
The Infrastructure Monitoring service uses API-based monitoring for cloud provider-level metrics or agent-based monitoring, to provide a detailed view of operating system metrics, including databases and applications. Monitoring of your cloud resources is automatically discovered and included in your service.
Deployment of Operating System agents is controlled by changing a SoftwareOne-specific agent deployment tag on the desired resource. We then use Automation to deploy the agents within 24 hours of detecting the tags on your resource.
For Azure infrastructure:
Items with defined thresholds are monitored using SoftwareOne’s Azure Monitoring Baseline.
SoftwareOne have defined Incident Responses for each monitored item.
Items without defined thresholds and/or not present in the Azure Monitoring Baseline are not monitored.
Monitor type | Metric | Trouble | Critical | ||
---|---|---|---|---|---|
Limit | Poll count | Limit | Poll count | ||
App Service Plans | CpuPercentage | 80 | 10 | 95 | 10 |
App Service Plans | MemoryPercentage | 80 | 10 | 90 | 5 |
App Services | Http4xx | 20 | 3 | 100 | 3 |
App Services | Http5xx | 10 | 3 | 50 | 3 |
Azure SQL Database | dtu_consumption_percent | 80 | 10 | 90 | 10 |
Azure SQL Database | storage_percent | 80 | 15 | 90 | 15 |
Azure SQL Database | Sessions_percent | 80 | 10 | 95 | 10 |
Container Instances | CpuUsage | 80 | 10 | 95 | 10 |
Container Instances | MemoryUsage | 80 | 10 | 90 | 5 |
Load Balancer | Health Probe Status | 98 | 3 | 95 | 3 |
Load Balancer | Data Path Availability | 98 | 3 | 95 | 3 |
Network Connection | Up/Down | N/A | N/A | N/A | N/A |
Network Interfaces | ResourceDeleted | N/A | N/A | N/A | N/A |
Storage Accounts | ResourceDeleted | N/A | N/A | N/A | N/A |
Storage Accounts | Availability | <100 | 10 | N/A | N/A |
Storage Accounts | SuccessE2ELatency | >200 | 10 | N/A | N/A |
Virtual Machines API) | CPU Utilization threshold | 851 | 10 | 95 | 10 |
Virtual Machines (Agent) | Memory Utilization threshold | 90 | 15 | 95 | 15 |
Virtual Machines (Agent) | Disk Utilization threshold | 80 | 15 | 90 | 15 |
Virtual Machines (Agent) | Partition Disk Utilization threshold(%) | 90 | 5 | N/A | N/A |
Virtual Networks | ResourceDeleted | N/A | N/A | N/A | N/A |
VNet Gateway | ResourceDeleted | N/A | N/A | N/A | N/A |
VNet Gateway | P2SConnectionCount | >115 | 10 | N/A | N/A |
All other resource types which does not have a default threshold can be specified during the Onboarding in the Operations Definition or by raising a Service Request in BAU.
This section includes the topics: