How to findout the Servers are in GRAY State and Create a report
Hi Team,
We have SCOM2007 SP1 and 1800+ Agents and 4 Management Servers & 2 Gateway Servers.
Our Porblem is daily few servers stops communicating and not getting health service failure alert too. So daily we need to check manually which server is in GRAY state and take action against that.
Especially those communicate thro GATEWAY Servers....
Can you please advise us how to findout the Servers are in GRAY State and Create a report?
November 26th, 2010 4:10am
Hi.
This query from Kevin Holmans collection:
http://blogs.technet.com/b/kevinholman/archive/2007/10/18/useful-operations-manager-2007-sql-queries.aspx might get you started.
SELECT bme.DisplayName,s.LastModified as LastModifiedUTC, dateadd(hh,-5,s.LastModified) as 'LastModifiedCST (GMT-5)'
FROM state AS s, BaseManagedEntity AS bme
WHERE s.basemanagedentityid = bme.basemanagedentityid
AND s.monitorid
IN (SELECT MonitorId FROM Monitor WHERE MonitorName = 'Microsoft.SystemCenter.HealthService.ComputerDown')
AND s.Healthstate = '3' AND bme.IsDeleted = '0'
ORDER BY s.Lastmodified DESC
This powershell script will also get you the agents in gray state
$WCC = get-monitoringclass -name "Microsoft.SystemCenter.Agent"
$MO = Get-MonitoringObject -monitoringclass:$WCC | where {$_.IsAvailable -eq $false}
$MO | select DisplayName
mats.w | www.opsmode.com
Free Windows Admin Tool Kit Click here and download it now
November 26th, 2010 4:34am
Hi.
This query from Kevin Holmans collection:
http://blogs.technet.com/b/kevinholman/archive/2007/10/18/useful-operations-manager-2007-sql-queries.aspx might get you started.
SELECT bme.DisplayName,s.LastModified as LastModifiedUTC, dateadd(hh,-5,s.LastModified) as 'LastModifiedCST (GMT-5)'
FROM state AS s, BaseManagedEntity AS bme
WHERE s.basemanagedentityid = bme.basemanagedentityid
AND s.monitorid
IN (SELECT MonitorId FROM Monitor WHERE MonitorName = 'Microsoft.SystemCenter.HealthService.ComputerDown')
AND s.Healthstate = '3' AND bme.IsDeleted = '0'
ORDER BY s.Lastmodified DESCmats.w | www.opsmode.com
November 26th, 2010 4:41am
Mats,
Its not giving output as excepted... Shows only few servers that they are in GRAY....there are many servers that is GRAY State...
Free Windows Admin Tool Kit Click here and download it now
November 26th, 2010 5:30am
install the latest core mp and it will have this out of the box :)
http://blogs.technet.com/b/kevinholman/archive/2010/11/24/opsmgr-2007-r2-core-mp-s-updated-6-1-7695-0.aspx
hmmm i just realised this is r2 and reading your post more carefully, you say agents stop communicating without a heartbeat failure alert. That in itself is strange enough and i would focus on resolving the issue of not generating an alert when the heartbeat
stops (i remember we had a support call on this, but it fixed itself when we upgraded to r2. maybe a known issue in sp1, i never got an answer on that).
Rob Korving
http://jama00.wordpress.com/
November 26th, 2010 5:31am
I think the SQL query only shows changes from the last 5 hours. Try changing the
(hh,-5,s.LastModified) as 'LastModifiedCST (GMT-5)'
Did you try the Powershell script?mats.w | www.opsmode.com
Free Windows Admin Tool Kit Click here and download it now
November 26th, 2010 5:49am
If you want to see all computers that are grey - this means all computers that have something seriously wrong with the health service watcher.
To identify those - simply use the console - and view a state view for the Health Service Watcher class.
In a report - you can do the same thing - just change the monitor in the query below to the root health service watcher class rollup monitor.Kevin Holman
November 26th, 2010 5:23pm