Recycle Health Service folder on SCOM Server
Hello,
I am new to SCOM, In our environment we have SCOM 2012 R2 Servers, we had some issue and would like to Recycle Health Service folder on one of the SCOM Server. (SCOM on Seperate server and SQL on Seperate server).
In general which account we need to use to clear the cache of the health service folder on SCOM Server?
action account? DWaccount?Data Reader or Data writer??
One more thing what is the impact if we Recycle the health service folder with normal accounts? other than the above accounts?
Could some one answer to my quires and help me out.
Regards,
Niranjana.
February 4th, 2015 2:45pm
Hi!
You can use any account with local admin permission.
There is no impact. Either you can delete the Health Service State folder or not. If done succesfully, the Health Service will load all the config data from the config service and so from the database.
Cheers,
P
February 4th, 2015 3:28pm
Any account can be used to recycle HS folder. There will not be any change in the impact when used any of the accounts.
Stop the SCOM health service, rename/delete 'health service state' folder, restart SCOM health service.
Agent will load all the recent configuration and will recreate all sub folder under HSS folder and pull in fresh data.
Just ensure that you get "OpsMgrConnector.Config" file in the Connector config cache subfolder anda your agent gets back to healthy state in scom console.
hope this helps.
February 4th, 2015 9:48pm
Local Admin right is enough
1) Stop SCOM Health service/Monitoring Agent service which depend on the version of SCOM agent
2) rename or delete folder "Health Service State" folder
3) Start SCOM Health service/Monitoring Agent service
4) SCOM agent will download updated information from MS into agent health service state folder
Roger
February 4th, 2015 9:58pm
Thanks Patrick / Agarwal,
Do we need to start all the scom services (SDK/Config/SystemCenterManagement)Services. Because recently i have recycled HS folder by restarted all the above 3 services with my credentials which has admin permissions. latter noticed few issues
1. In monitoring pane ==Operationsmanager==managementgrouphealth==Status of the Dataaccess service group turns in to red X.
2. SDK services started but still SCOM throws an error the service is not started /initiated.
3. Had problem with Reports (Alerts and Common Reports).
not sure how to fix, trying to figure out what exactly went wrong. Could you please share your experience on this issue.
Thanks in Advance!
Niranjana
February 4th, 2015 11:42pm
Niranjana,
It's enough to stop Health Service aka Microsoft Monitoring Agent.
1: clear, you stopped Data Access Service and it takes some time to be initialized again.
2: see 1
3: maybe either 1, but don't think. Problems went by?
You're welcome,
Patrick
February 5th, 2015 12:38am
Hello Patrick,
Thank you very much for giving detailed explanation.
I was thinking that i have done some thing wrong, but your reply boost me up.
Could you please provide work arround on how to fix 2 and 3 points.
I had only problem in extracting Alerts and MostCommon alerts report (Blank Data).
Thanks in Advance,
Niranjana
February 5th, 2015 2:33am
Hi!
Sure thing:
Ad 2: When you restart Data Access (SDK) it lasts for a couple of seconds (in some slow environments up to one minute) until the service is initialized. During that time you get the following error:

Ad 3: Well, Most Common Alerts is a good report since it cannot happen that you select wrong group/object. May I ask: did you change the From Date correctly? Is that still a problem? Did you restart SQL Reporting Service and/or DWH DB?
BR, P
February 5th, 2015 3:47am
Thanks a lot for assisting me! Now i am clear with second point.
Regarding point 3 : i have selected the dates correctly and selected the group as "all windows computer" but still getting a blank report. My BAD and strange situation .,The above issue was encountered after recycled health services
(all the 3 services).
Not Restarted SQL reporting services, because we are able to generate other reports with out issues,.
"Alerts" and "Most Common Alerts Report" had issue (Blank Reports) and why other reports are coming with out any issues.
Best Regards,
Niranjana
February 5th, 2015 12:29pm
Ad 3:
Weird... are the other (working) reports either targeting on group All Windows Computers?
And in all reports you're using it as group, not as object.
Can you verify that the group has been calculated correctly and check it under Authoring/Groups, right click on All Windows Computers and Show Members (or similar)?
/patrick
February 5th, 2015 3:21pm
Hello Patrick,
I am sorry, i was on vacation.. I have checked the above points we are able to see all the servers in the groups correctly. Is it because of DATA warehouse DB Corruption? Here in this issue i am still not understanding why only most common alerts and
alerts reports are coming blank with out any data, where in other reports has no issue.
I have gone through many blogs but no solution. Is there any possibility to pull "most common alerts " report from SSRS.. http://xyz/reportserver.. Please let me know if there is any work arround to fix the issue.
Regards,
Niranjana
February 17th, 2015 3:23pm
Hi!
At least the situation is not that bad so that you had to skip vacation ;-)
Could you please run the following queries against your DWH-DB and let me know if you see any output:
To get all raw alert data from the data warehouse to build reports from:
select * from Alert.vAlertResolutionState ars
inner join Alert.vAlertDetail adt on ars.alertguid = adt.alertguid
inner join Alert.vAlert alt on ars.alertguid = alt.alertguid
Raw data core query:
select top 10 *
from Perf.vPerfRaw pvpr
inner join vManagedEntity vme on pvpr.ManagedEntityRowId = vme.ManagedEntityRowId
inner join vPerformanceRuleInstance vpri on pvpr.PerformanceRuleInstanceRowId = vpri.PerformanceRuleInstanceRowId
inner join vPerformanceRule vpr on vpr.RuleRowId = vpri.RuleRowId
Cheers,
Patrick
February 17th, 2015 3:46pm
Please further run the following query and post the figures:
select count(*) from Alert.AlertStage
select count (*) from Event.eventstage
select count (*) from Perf.PerformanceStage
select count (*) from state.statestage
And please check if you see any 31552 events in the event log on your MS.
/patrick
February 19th, 2015 3:05am