Physical Infrastructure Maintenance

This chapter provides some methods for physical infrastructure maintenance. In a production environment, you need to perform maintenance operations by referring to the procedures described in this document. Otherwise, uncontrollable risks will be caused.


Power Maintenance in Equipment Room

Planned Shutdown

To perform planned shutdown in an equipment room, follow these steps:
  1. Stop all running businesses.
  2. Place all primary storages into maintenance mode. After all primary storages enter maintenance mode, make sure that all VM instances are shut down.
  3. Place all hosts into maintenance mode.
  4. Stop the management node (MN). The methods to stop the MN in different scenarios are as follows:
    • In a single MN scenario, run the zstack-ctl stop command to stop the MN.
    • In a multi-MN (host) HA scenario, run the zsha2 stop-node command in each MN to stop the MNs and shut down the zsha2 service.
  5. Verify that no I/O operation is performed in the primary storages of the Cloud, and run the poweroff command on each host to normally shut down all hosts.
  6. Disable the storages, such as the NFS, Ceph, SAN, and SMP storages.
  7. Disable the switches and other hardware facilities.

Planned Power-on

To perform planned power-on in an equipment room, follow these steps:
  1. Power on the switches.
  2. Power on the storages, such as the NFS, Ceph, SAN, and SMP storages.
  3. Power on the server.
  4. Check the service status of the MN and make sure that the MN service is started successfully. The methods to check the MN status in different scenarios are as follows:
    • In a single MN scenario, run the zstack-ctl status command to view the MN status.
    • In a multi-MN (host) HA scenario, run the zsha2 status command in one of the MNs to view the HA status and the MN status.
  5. Log in to the ZStack Cloud, enable all primary storages and hosts and make sure that all hosts and primary storages are in the connected state.
  6. Start the VM instances and resume the businesses.

Recovery from Unexpected Power Outage

The steps to recover from unexpected power outage in an equipment room are the same as those in planned power-on.
  1. Power on the switches.
  2. Power on the storages, such as the NFS, Ceph, SAN, and SMP storages.
  3. Power on the server.
  4. Check the service status of the MN and make sure that the MN service is started successfully. The methods to check the MN status in different scenarios are as follows:
    • In a single MN scenario, run the zstack-ctl status command to view the MN status.
    • In a multi-MN (host) HA scenario, run the zsha2 status command in one of the MNs to view the HA status and the MN status.
  5. Log in to the ZStack Cloud, enable all primary storages and hosts and make sure that all hosts and primary storages are in the connected state.
  6. Start the VM instances and resume the businesses.

Equipment Room Relocation

When you relocate an equipment room, perform maintenance by following these steps:
  1. Power off all power supplies by referring to Planned Shutdown.
  2. Mark the access ports of all switch wiring and hosts.
  3. Pack the server, move the server to the new equipment room, and connect the server.
  4. Power on all power supplies by referring to Planned Power-on.
  5. Check the recovery status of the network, Cloud, and VM instances.

Switch Maintenance

Management Network Switch Maintenance

The maintenance on management network switches might affect the proper running of the business. Please exercise caution. If this kind of maintenance is necessary, we recommend that you perform this operation during off-peak hours. The maintenance procedure is as follows:
  1. Turn off VM high availability in the global settings of the Cloud.
  2. Stop the management node (MN). The methods to stop the MN in different scenarios are as follows:
    • In a single MN scenario, run the zstack-ctl stop command to stop the MN.
    • In a multi-MN (host) HA scenario, run the zsha2 stop-node command in each MN to stop the MN and the zsha2 service.
  3. Adjust or restart related management switches.
  4. Check the IP connectivity of all hardware resources (such as the host, primary storage, and backup storage) at the access end of the switch to ensure that all the management networks in the access end can communicate with each other properly.
    Note: If a node fails to connect to the MN, troubleshot this issue before you proceed with subsequent operations.
  5. Start and check the MN service status to ensure that the MN service starts successfully. The methods to check the MN startup in different scenarios are as follows:
    • In a single MN scenario, run the zstack-ctl start command to start the MN service, and run the zstack-ctl status command to view the MN status.
    • In a multi-MN (host) HA scenario, run the zsha2 start-node command in each MN to start the MN service, and run the zsha2 status command to view the HA status and the MN status.
  6. Make sure that all hosts and primary storages are in the connected state, and VM high availability is enabled in the global settings of the Cloud.

Business Network Switch Maintenance

We recommend that you perform maintenance on business network switches during off-peak hours to avoid affecting important services. The maintenance procedure is as follows:
  1. Place the hosts accessed to the switches into maintenance mode.
  2. Adjust or restart related service switches.
  3. Start a host that is in maintenance mode and check the connectivity. For example, you can use this host to create a test VM instance and check the connectivity between this VM instance and other VM instances. If the connectivity succeeds, the switch interface configuration corresponding to this host is correct and available. If the network is unreachable, continue to check the connectivity.
  4. Repeat Step 3 to check the connectivity in turn to ensure that all business networks are available.

Storage Network Switch Maintenance

The maintenance on storage network switches directly affects all the business. You must stop all business before you perform the maintenance. The maintenance procedure is as follows:
  1. Place all primary storages associated with the storage switches on the Cloud into maintenance mode.
  2. Log in to the UI of Ceph Enterprise and choose Settings > Disable data recovery > Ban.
  3. Adjust or restart related storage switches.
  4. Make sure that the storage networks of all storage nodes are interconnected and reachable.
  5. Log in to the UI of Ceph Enterprise and choose Settings > Enable Data Recovery > Enable.
  6. Enable the primary storages that are in the maintenance mode and make sure that the primary storages are in the connected state.
  7. Start the stopped VM instances.

Back to Top

Download

Already filled the basic info?Click here.

Enter at least 2 characters.
Invalid mobile number.
Enter at least 4 characters.
Invalid email address.
Wrong code. Try again. Send Code Resend Code (60s)

An email with a verification code will be sent to you. Make sure the address you provided is valid and correct.

Download

Not filled the basic info yet? Click here.

Invalid email address or mobile number.

Email Us

contact@zstack.io
ZStack Training and Certification
Enter at least 2 characters.
Invalid mobile number.
Enter at least 4 characters.
Invalid email address.
Wrong code. Try again. Send Code Resend Code (60s)

Email Us

contact@zstack.io
Request Trial
Enter at least 2 characters.
Invalid mobile number.
Enter at least 4 characters.
Invalid email address.
Wrong code. Try again. Send Code Resend Code (60s)

Email Us

contact@zstack.io

The download link is sent to your email address.

If you don't see it, check your spam folder, subscription folder, or AD folder. After receiving the email, click the URL to download the documentation.

The download link is sent to your email address.

If you don't see it, check your spam folder, subscription folder, or AD folder.
Or click on the URL below. (For Internet Explorer, right-click the URL and save it.)

Thank you for using ZStack products and services.

Submit successfully.

We'll connect soon.

Thank you for using ZStack products and services.