Under Construction
Support for (Simplified) Remote Restart
In the case of problems with a managed system, or the failure of a managed system, the remote restart (RR) or simplified remote restart (SRR) feature makes it possible to restart one or more LPARs from such a system onto another managed system. An LPAR with the identical configuration is created on the target managed system and then started. If a managed system fails, this enables a quick restart of important LPARs onto another, working hardware.
View the remote restart capabilities of one or more HMCs with hmc lsrr:
$ hmc lsrr hmc01 hmc02
HMC AUTO_CLEANUP_ENABLED
hmc01 1
hmc02 1
$
Currently, auto_cleanup_enabled is the only capability supported by HMCs. After a remote restart operation, this automatically carries out a cleanup on the original managed system as soon as it is available again.
Whether a managed system supports remote restart (RR) or simplified remote restart (SRR), as well as how many such operations are currently being executed, can be displayed using the “ms lsrr” command:
$ ms lsrr ms11 ms13 ms15 ms17 ms18
NAME RR SRR MAX_NUM_RR RR_IN_PROGRESS
ms11 0 0 0 0
ms13 1 0 32 0
ms15 1 1 32 0
ms17 1 1 32 0
ms18 0 1 32 0
$
Note: New power systems usually only support simplified remote restart.
You can determine whether one (or more) LPAR(s) support remote restart using “lpar lsrr“:
$ lpar -m ms18 lsrr
LPAR RR SRR MS STATE
aixtsmp104 0 0 ms18 null
aixtsmp902 0 0 ms18 null
aixtsmp904 0 0 ms18 null
aixtsmp908 0 0 ms18 null
saptest1 0 1 ms18 Remote Restartable
ms18-vio1 0 0 ms18 null
ms18-vio2 0 0 ms18 null
$
A remote restart or simplified remote restart operation can be started manually using the “lpar restart” command. Normally this is used in the event of problems or failure of a managed system:
$ lpar restart saptest1 ms17
> saptest1
$
The LPAR is automatically restarted. As soon as the initial managed system is accessible again, the HMC will remove the source LPAR that is still there (if auto_cleanup_enabled is turned on).