07 September 2010

OMG !!! I can't monitor new servers in OpsMgr

Recently I bumped into this problem. Apparently every new servers (that is being installed with SCOM monitoring agent) cannot be monitored by SCOM. You can clearly see from the OpsMgr console that I have tons of servers with the status of Not Monitored.

Bad News + Tension = ~non stop sweating~

I checked all my agent communication ports and nothing is wrong. My Management Server event logs doesn't show any significant errors. The only thing I got was heaps of

Event 20070

The OpsMgr Connector connected to (RMS FQDN), but the connection was closed immediately after authentication occured. The most likely cause of this error is that the agent is not authorized to communicate with the server, or the server has not received configuration. Check the event log on the server for the presence of 20000 events, indicating that agents which are not approved are attempting to connect.

Event 21023
OpsMgr has no configuration for management group (managementgroupname) and is requesting new configuration from the Configuration Service.

Event 21402
Operations Manager has discarded 1 items in management group (managementgroupname) , which came from $$ROOT$$. These items have been discarded because no valid route exists at this time. This can happen when new devices are added to the topology but the complete topology has not been distributed yet. The discarded items will be regenerated.

Not long after that, my RMS starts to generate heaps of Warning in the Event Viewer. I get tons of
Event 29106 indicating problem writing data into the OpsMgr database. Somehow I got an impression that something is not right the Management Pack.

I made a query to the OpsMgr database to look for which Management Pack that was affected and apparently something is amiss with the Windows Server 2003 MP in my case.

I quickly look back at the Management Pack and the Windows Server 2003 MP is not updated whilst the Windows Server Operating System library has been updated. Maybe certain objects was removed from the data structure when the new Base OS library was updated but then the Windows Server 2003 MP was not updated to tally with the changes done.

I updated the Windows Server 2003 Management Pack and all is well. The new servers have successfully reported back to the system and problem solved !