Monitor alert not auto resolving
I have an issue where some of my monitor alerts are not auito resolving even once the issue, that caused the alert" is fixed. For example, I currently have a critical alert raised by the "SQL Server Integrations Services Windows Service" indicating
that the service has stopped. When I look at the server the service is running. In looking at health explorer I can see that the alert was raised 2 days ago, but 10 minutes after it was raised it went to a "non monitored status" and then 2 minutes
later went to a healthy state, but the alert never auto closed. I presume an agent issue occurred and then lost it's ability to detect the state which would allow it to auto resolve, but am not sure. I have seen this with other monitors as
well. Can someone explain this or know what the proper way to recover is? I have tried resetting the health state, recycling the service on the agent, but to know avail. The only way I know how to resolve would be to manually close the alert,
but I understand that this is not advisable since it is a monitor.
October 4th, 2010 8:00pm
Hi Keith
Given that the monitor itself is in a healthy state, there is no issue with manually closing the alert. The main issue is that (as you are aware), you shouldn't close an alert when then health is still unhealthy.
It is difficult to troubleshoot after the event (especially after a couple of days) but it is more likely to be either a maintenance mode issue (the not monitored status tends to suggest it went into maintenance mode) or that the root management server was
over stretched. It might be worth keeping an eye out for whether this is happening often.
Cheers
GrahamCheers Graham View OpsMgr tips and tricks at
http://systemcentersolutions.wordpress.com/
Free Windows Admin Tool Kit Click here and download it now
October 4th, 2010 8:15pm
Thanks Graham for your quick response and clarification.
October 4th, 2010 8:25pm