Skip to content
This repository was archived by the owner on Jun 6, 2024. It is now read-only.
This repository was archived by the owner on Jun 6, 2024. It is now read-only.

Add manual for adding node to existing PAI cluster  #5267

@Binyang2014

Description

@Binyang2014

To add node to existing PAI cluster we need to do:

  1. run ansible-playbook -i ${HOME}/pai-deploy/cluster-cfg/hosts.yml docker-runtime-setup.yml --limit=nodelist to change docker daemon config
  2. run ansible-playbook -i inventory/pai/hosts.yml cluster.yml --become --become-user=root -e "@inventory/gcrv100/openpai.yml" --limit=node list to add node to k8s cluster.
  3. If you encounter cert related error. Please make sure all etcd master node has: /usr/local/bin/etcd-scripts and has /etc/ssl/etcd/openssl.conf. If not, copy it from other etcd masters
  4. After add node to k8s cluster, if you want to remove CNI, please follow doc to remove it
  5. Change PAI layout.yaml to add node to layout.yaml. Change services-configuration.yaml to update hived config
  6. Update pai config by ./paictl config push -p /udpated-config
  7. run ./paictl service start -n cluster-configuration to update config
  8. Restart rest-server, hived, job-exporter

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions