-
Notifications
You must be signed in to change notification settings - Fork 576
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
NMS-15584: Update to alarm docs (#6033)
* NMS-15584: Some improvements in the docs are required to answer the following questions * Apply suggestions from code review Co-authored-by: Bonrob2 <[email protected]> * NMS:15584: fix alarm into link * Apply suggestions from code review Co-authored-by: mmahacek <[email protected]> * Update docs/modules/operation/pages/deep-dive/alarms/alarm-handling.adoc Co-authored-by: mmahacek <[email protected]> * Added the missing screenshots * NMS-15584: Alarm doc updates * Apply suggestions from code review Co-authored-by: Bonrob2 <[email protected]> --------- Co-authored-by: Bonrob2 <[email protected]> Co-authored-by: Mark Mahacek <[email protected]>
- Loading branch information
1 parent
533fa9d
commit 640c388
Showing
17 changed files
with
170 additions
and
102 deletions.
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
68 changes: 68 additions & 0 deletions
68
docs/modules/operation/pages/deep-dive/alarms/alarm-example.adoc
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,68 @@ | ||
|
||
= Alarm Lifecycle | ||
|
||
The following is an example of the alarm lifecycle based on a `nodeLostService` event. | ||
|
||
== Lifecycle example | ||
|
||
A new `nodeLostService` event is received and creates a new alarm. | ||
|
||
.New alarm visible in outstanding alarm list | ||
image::alarms/single_alarm_1.png["New alarm visible in outstanding alarm list] | ||
|
||
Clicking the number displayed in the *Count* column displays the corresponding events and their details. | ||
|
||
.Event list showing events related to the alarm | ||
image::alarms/single_alarm_2.png["Event list showing events related to the alarm] | ||
|
||
The alarm clears automatically when service is restored, based on a `nodeRegainedService` event. | ||
|
||
.Alarm cleared | ||
image::alarms/single_alarm_3.png["Alarm List displaying one cleared alarm and its log message"] | ||
|
||
.Service down and service restored events | ||
image::alarms/single_alarm_4.png["Event list page displaying one service down event and one service restored event"] | ||
|
||
If the problem occurs again, the events are reduced into the existing alarm. | ||
The alarm's count is updated to reflect the new activity. | ||
|
||
.Alarm reopened with an increase in the `count` value | ||
image::alarms/single_alarm_5.png["Alarm List displaying one alarm with a count of 2"] | ||
|
||
.Event list showing events related to the alarm | ||
image::alarms/single_alarm_6.png["Detailed event list page displaying two service down events and one service restored event, all of which are components of the same alarm"] | ||
|
||
The alarm once again clears immediately when service is restored. | ||
|
||
.Reduced alarm cleared | ||
image::alarms/single_alarm_7.png["Alarm List displaying one cleared alarm with a count of 2, and its log message"] | ||
|
||
Note that the alarm's count only increments on events with a severity of Warning or greater. | ||
|
||
.Service down and restored events | ||
image::alarms/single_alarm_8.png["Detailed event list page displaying two service down events and two service restored events, all of which are members of the same alarm"] | ||
|
||
== Alarm lifetime rules | ||
|
||
Alarms are deleted from the {page-component-title} database after a set amount of time. | ||
This lifetime can be configured via Drools rules in the `$\{OPENNMS_HOME}/etc/alarmd/drools-rules.d/alarmd.drl` file. | ||
The default alarm lifetimes: | ||
|
||
[options="autowidth"] | ||
|=== | ||
| Alarm State | Deletion Delay | ||
|
||
| Cleared and Unacknowledged | ||
| 5 minutes | ||
|
||
| Cleared and Acknowledged | ||
| 1 day | ||
|
||
| Active and Unacknowledged | ||
| 3 days | ||
|
||
| All other alarms | ||
| 8 days | ||
|=== | ||
|
||
These delays are based on the last event time, and will restart the counter if a new problem event is reduced into the same alarm. |
71 changes: 71 additions & 0 deletions
71
docs/modules/operation/pages/deep-dive/alarms/alarm-handling.adoc
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,71 @@ | ||
|
||
= Alarm Handling | ||
|
||
The following are ways you can interact with alarms. | ||
|
||
== Acknowledgment | ||
|
||
Users can acknowledge alarms to let other {page-component-title} users see that someone is aware of the alarm. | ||
The alarm will be moved out from `Alarm(s) outstanding` into the `Alarm(s) acknowledged` view. | ||
Acknowledged alarms will also be hidden from the "Nodes with Pending Problems" section of the home page. | ||
|
||
.Acknowledged alarm of an HTTP outage in the alarm overview | ||
image::alarms/acked_alarm_overview.png["Acknowledged alarm of an HTTP outage in the alarm overview"] | ||
|
||
.Acknowledged alarm of an HTTP outage in detail view | ||
image::alarms/acked_alarm_detail.png["Acknowledged alarm of an HTTP outage in detail view"] | ||
|
||
== Clearing | ||
|
||
Clearing an alarm means "mark it as resolved". | ||
|
||
.Cleared alarm of an HTTP outage in the alarm overview | ||
image::alarms/cleared_alarm_overview.png["Cleared alarm of an HTTP outage in the alarm overview"] | ||
|
||
.Cleared alarm of an HTTP outage in detail view | ||
image::alarms/cleared_alarm_detail.png["Cleared alarm of an HTTP outage in detail view"] | ||
|
||
== Escalation | ||
|
||
By default, an alarm has the same <<deep-dive/events/event-configuration.adoc#severities, severity>> as its most recent event. | ||
If an alarm gets escalated, the alarm's severity increases by one level. | ||
|
||
.Escalated alarm of an HTTP outage in the alarm overview | ||
image::alarms/escalated_alarm_overview.png["Escalated alarm of an HTTP outage in the alarm overview"] | ||
|
||
.Escalated alarm of an HTTP outage in detail view | ||
image::alarms/escalated_alarm_detail.png["Escalated alarm of an HTTP outage in detail view"] | ||
|
||
== Related events | ||
|
||
{page-component-title} correlates possible related events into alarms based on events with the same <<deep-dive/alarms/configuring-alarms#ga-reduction-key, reduction key>>. | ||
You can use the related events section of the alarm details view to see which events have been grouped into the alarm. | ||
|
||
|
||
.Alarm Related Events page | ||
image::alarms/alarm_related-events.png["Alarm Related Events page displaying related events.", 850] | ||
|
||
== Alarm notes | ||
|
||
Alarm notes let you assign comments to a specific alarm, or to a whole class of alarms, and share that information with other people on your team. | ||
|
||
.Alarm Details page with sample notes | ||
image::alarms/01_alarm-notes.png["Alarm Details page displaying sample notes in the Sticky Memo and Journal Memo boxes", 850] | ||
|
||
You can add two types of notes to existing alarms or alarm classes: | ||
|
||
Sticky Memo:: A user-defined note for a specific instance of an alarm. | ||
Deleting the alarm also deletes any associated sticky memos. | ||
Journal Memo:: A user-defined note for a class of alarms, based on the resolved reduction key. | ||
Journal memos are shown for all alarms that match a specific reduction key. | ||
Deleting an individual alarm does not remove the journal memo. | ||
You must click *Clear* on an alarm with an associated journal memo to remove the memo. | ||
|
||
The Alarm List Summary and Alarm List Detail pages will have a symbol to indicate if individual alarms have associated sticky or journal memos. | ||
|
||
[[ga-advanced-alarm-handling]] | ||
== Advanced alarm handling | ||
|
||
In addition to the manual actions described above, it is possible to automate alarm handling with the use of https://www.drools.org/[Drools] scripts. | ||
There is a default rule set for handling alarm cleanup in the `$\{OPENNMS_HOME}/etc/alarmd/drools-rules.d/` directory. | ||
You can find some additional examples in the `$\{OPENNMS_HOME}/etc/examples/alarmd/drools-rules.d/` directory. |
20 changes: 0 additions & 20 deletions
20
docs/modules/operation/pages/deep-dive/alarms/alarm-notes.adoc
This file was deleted.
Oops, something went wrong.
7 changes: 0 additions & 7 deletions
7
docs/modules/operation/pages/deep-dive/alarms/alarm-related-events.adoc
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters