infrastructure/os/config
2020-10-28 17:07:55 +01:00
..
group_vars/all Refactor 2 2020-09-12 20:17:07 +02:00
roles Switch Matrix+Plume to IPv6, Add Trusted Net to ip6tables 2020-10-28 16:55:11 +01:00
cluster_nodes.yml Refactor 2 2020-09-12 20:17:07 +02:00
production.yml Reintroduce resolv.conf, it is needed + change DNS from FDN (broken) to Free 2020-10-22 20:22:57 +02:00
README.md Add some doc 2020-10-28 17:07:55 +01:00
README.more.md Refactor 2 2020-09-12 20:17:07 +02:00
site.yml Refactor 2 2020-09-12 20:17:07 +02:00

ANSIBLE

How to proceed

For each machine, one by one do:

  • Check that cluster is healthy
    • Check gluster
      • sudo gluster peer status
      • sudo gluster volume status all (check Online Col, only Y must appear)
    • Check that Nomad is healthy
      • nomad server members
      • nomad node status
    • Check that Consul is healthy
      • consul members
    • Check that Postgres is healthy
  • Run ansible-playbook -i production.yml --limit <machine> site.yml
  • Run nomad node drain -enable -force -self
  • Reboot
  • Run nomad node drain -self -disable
  • Check that cluster is healthy