Thresholds

Infraon allows users to define health indexes to monitor the network's performance. Thresholds play an essential role in tracking the fault and performance of devices and detecting faults and performance-based alerts. These faults and performance are indicated using severity levels as follows:

  • Critical: Indicated in Red

  • Major: Indicated in Orange

  • Minor: Indicated in Yellow

  • Informational: Indicated in Green

Furthermore, there are two state-based indicators which are:

  • Unknown/Dependent: Indicated in Blue – when the parent/primary device is down, the dependent/Child device’s status is unknown.

  • Under Maintenance: Indicated in Grey, this is when a device/node is placed under maintenance for a planned outage.

Thresholds can be defined for any monitoring KPI (Key Performance Indicators). Infraon is configured with built-in (global) thresholds for all the monitoring nodes and their components. Based on these thresholds, users can quickly spot resources that require attention even before thresholds are breached.

Default threshold configuration covers important indicators like:

  • CPU Utilization

  • Memory Utilization

  • Disk Utilization

  • Latency

  • Packet Loss

  • Network Bandwidth Utilization

  • BGP Status

  • Error Rate

  • Discard Rate

  • Process CPU Usage

  • Process Memory Usage

Infraon offers multiple threshold configurations where the administrator can configure threshold settings for a specific resource or as a group. Infraon allows the user to define the following:

  • Set Point

  • Reset Point

  • Set Message

  • Reset Message

  • Set Alarm Hold Time

  • Reset Alarm Hold Time

  • Severity

Alarm Condition Logic

For the positive polarity parameters:

If the Current Value is > Set Point, the node/device gets into an alarm state and continues to be in an alarm state until the value <= Reset Point.

For the negative polarity parameters:

If the Current Value is < Set Point, the node/device gets into an alarm state and continues to be in an alarm state until the value >= Reset Point.

In addition to the standard/global thresholds across the network, the Adaptive/Dynamic threshold or baseline helps determine the performance issue of every component within the network based on its performance benchmarks and expectations. It helps eliminate unwanted alerts, thus resulting in focused business operations. Infraon UNMS supports an ML-based algorithm for auto-baselining thresholds for various performance metrics.

Thresholds are mainly for Performance metrics such as Traffic Utilization, Latency, Jitter, CPU Utilization, Memory Utilization, Temperature, etc. Users can define multiple thresholds with different severity levels for a single monitoring parameter. For e.g.,

CPU Utilization Thresholds can be defined as follows:

If CPU Utilization > 50%, severity is ‘Minor’

If CPU Utilization > 75%, severity is ‘Major’

If CPU Utilization > 90%, severity is ‘Critical’

What you see on the screen

This page displays the list of predefined thresholds on Infraon. We recommend you not make any changes to the default values. However, these values can be edited or deleted using the action icon on each line item. Information display includes:

Label

Description/Example

Category

Denotes categories of the Threshold like Performance, Availability, etc.

Threshold Name

Name of the Threshold parameter like Access Point, Interface, etc.

Statistics

Denotes the statistic applicable for the Threshold, i.e., Device Availability, Overflow Rate, Packet Loss, etc.

Severity

Indicates severity in case of threshold breach (Critical, Major, Minor, Informational).

Alarm Message

Displays the message to be included in the alarm/notification raised.

Set Point

Denotes the configured Threshold point or Set Point.

Status

Displays the status of the selected Threshold. Thresholds can be enabled/disabled using the edit option.

Action

Displays action icons to edit or delete the selected Threshold.

Note: Disabling Thresholds results in non-monitoring of devices for the particular Thresholds. Alarms are not raised for the same.

The Device Credential View can be toggled between the card and list views using the respective icon.

Instructions to 'Add a Threshold'

  • Go to Infraon Configuration -> IT Operations-> Threshold

  • Click on 'Add' and select 'Protocol' as desired.

There are two tabs on the 'Add Threshold' page. Refer to the table for information.

Category | Applicable Devices

Few fields vary based on the protocol selected. Refer to the 'Description' column for details.

Label

Action

Description/Example

Threshold Name*

Add a name for the threshold.

Name is usually used as an identifier. E.g., Access Point Availability, CPU Utilization, etc.,

Status

Use the toggle to mark the threshold active/inactive.

Infraon monitors only active thresholds.

Category*

Select the threshold category using the dropdown menu.

Available options are performance, basic, major, and informational.

Statistic*

Select statistics using the dropdown menu.

Multiple options are available.

Severity*

Select severity using the dropdown.

Available options are critical, major, and minor.

Type

Select if the Threshold type is Rising or Falling.

SetPoint

A setpoint is a baseline for the threshold. Setpoint includes Threshold*, Hold Time, and Breach Count.

Threshold* - Add value for the threshold.

Hold Time (Optional) - Add hold time in seconds. An alarm or notification will be raised if and only the threshold value crosses the mentioned hold time.

Breach Count (Optional) - Add breach count for the threshold. An alarm or notification will be raised if and only the threshold value crosses the hold time, the mentioned number of times.

Alarm Message

Add a message for the threshold.

This message will be included in the threshold breach alarm/notification.

Reset point

Add a reset point for the threshold. Reset point includes Threshold* and Hold Time.

Threshold* - Add value for the reset point threshold.

Hold Time (Optional) -Add hold time in minutes. Denotes the no. of minutes the node has to be equal to or below the Reset point to be reported as within the Threshold Limit.

Clear Message

Add a message for the reset point.

This message will be included in the threshold reset alarm/notification.

Threshold configuration

Customize the asset-level details

Customize the asset-level details in the threshold configuration module and input details like asset ID, IP address, Hostname, and more. This permits configuring thresholds applicable for all selected devices and triggering an alert when a threshold breaches.

Once all the parameters are defined, click ‘Save’ to save the credentials or click 'Next' to apply thresholds to a specific device or a set of devices.

Category | Applicable Devices

Label

Action

Description/Example

Applicable For*

Select if the threshold must be applied for selected nodes or all devices.

Add filter conditions if the 'Selected Nodes' option is selected.

The filter includes a condition (informational, minor, major, critical) and a value.

Click 'Submit.' Saved thresholds can be edited or deleted using the respective icons.

Last updated