ITM6 - CTX_RPCError - RPC Error to node ... TDW - WPA Error

iKnow-IT banner

We discovered the CTX_RPCError in our ITM6 environment. See picture below:

ITM6 - CTX_RPCError - RPC Error to node...

The reason for this Critical message was, in this case a reboot action of the client machine.

This client machine collect data and send this information every hour or every day (depending on setting) via the WPA (Warehouse Proxy) to the TDW (Tivoli Data Warehouse). During the sending action, the machine was rebooted and that caused this incident.

See AIX Error report:

IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
A6DF45AA 0806120115 I O RMCdaemon The daemon is started.
BC3BE5A3 0806120115 P S SRC SOFTWARE PROGRAM ERROR
2BFA76F6 0806120115 T S SYSPROC SYSTEM SHUTDOWN BY USER
9DBCFDEE 0806120115 T O errdemon ERROR LOGGING TURNED ON
192AC071 0806115915 T O errdemon ERROR LOGGING TURNED OFF
6D19271E 0806114215 I O topsvcs Topology Services daemon stopped
28854E81 0806114215 I O grpsvcs Group Services daemon stopped
99FA80C7 0806114215 U S haemd SOFTWARE
AA8AB241 0806114215 T O OPERATOR OPERATOR NOTIFICATION
BC3BE5A3 0806114215 P S SRC SOFTWARE PROGRAM ERROR
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0806114015 U S LVDD PHYSICAL PARTITION MARKED STALE
....

In the example above the machine was rebooted because there were Stale Partitions in the Volume Group that try to repair with the reboot. Also you can see that after the reboot the local historical data collection file (in this example Disk Performance data) become smaller.

the file

-rw-rw-r--    1 root     system      2213120 Aug 06 12:45 UNIXDPERF 

in the directory : /opt/IBM/ITM/aix526/ux/hist