Commit graph

776 commits

Author SHA1 Message Date
Baptiste Jonglez
fe68fdf54a plume: increase memory again 2025-03-26 20:21:57 +01:00
Baptiste Jonglez
187d36eb9b deploy_nixos: add help to apply changes without rebooting in production 2025-03-26 00:17:59 +01:00
Baptiste Jonglez
fd6275f5bc prod: Fix vim configuration syntax (different between staging and prod due to NixOS version difference) 2025-03-26 00:17:08 +01:00
Baptiste Jonglez
fc88a063b1 node_exporter: avoid using network mode host 2025-03-25 22:21:35 +01:00
Baptiste Jonglez
bb8c9db2ed telemetry: avoid network mode host, and poll less often 2025-03-25 22:12:42 +01:00
451068d716 Merge pull request 'prod: telemetry: Add smartctl_exporter based on staging work' (#53) from prod_smartctl_monitoring into main
Reviewed-on: #53
2025-03-25 21:09:08 +00:00
Baptiste Jonglez
797f946578 prod: telemetry: Add smartctl_exporter based on staging work 2025-03-24 17:53:17 +01:00
Baptiste Jonglez
596b7ab966 prod: telemetry: rename node-exporter job 2025-03-24 17:51:55 +01:00
Baptiste Jonglez
ec1fa3e540 staging: telemetry: Use a init task to create fake disk devices for smartctl_exporter 2025-03-24 17:47:05 +01:00
67230dd60c
guichet now advertise the correct dxfl login command 2025-03-24 16:48:18 +01:00
305c160899
guichet upgrade 2025-03-21 00:27:05 +01:00
Baptiste Jonglez
8d9aa00de5 staging: harden config of smartctl exporter
It currently requires all nodes to have /dev/sda (the device passthrough is hardcoded for now)
2025-03-19 23:46:55 +01:00
Baptiste Jonglez
5790453ff1 nix: Allow all capabilities in Nomad
This will be necessary for the smartctl exporter since it needs Linux
capabilities that are not allowed by default in Nomad.

We only have trusted Nomad jobs, and we already allow privileged
containers anyway, so there is no security impact.
2025-03-19 23:39:04 +01:00
Baptiste Jonglez
a2a470ac3d staging: promote piranha to Nomad server (caribou is dead) 2025-03-19 23:08:49 +01:00
Baptiste Jonglez
2009572fea prod: telemetry: move storage from bespin/scorpio to bespin/corrin 2025-03-12 21:22:56 +01:00
Baptiste Jonglez
8f0a45f03e staging: telemetry: add smartctl exporter 2025-03-12 21:06:56 +01:00
Baptiste Jonglez
b98e72af96 staging: telemetry: Fix metric collection due to faulty Consul connection 2025-03-12 20:51:49 +01:00
Baptiste Jonglez
e805cf5cf6 Augmentation stockage prometheus
La limite actuelle correspond à environ 2 mois d'historique prometheus,
c'est parfois trop peu pour pouvoir relever des tendances sur le long terme.
2025-03-11 23:10:07 +01:00
6b52ccd374 Merge pull request 'upgrade garage to v1.99.1' (#49) from garage-1.99 into main
Reviewed-on: #49
2025-03-09 09:48:50 +00:00
Armaël Guéneau
c5a0577cbf upgrade garage to v1.99.1 2025-03-09 10:44:12 +01:00
Armaël Guéneau
40da5ccca2 nixos config: tweak 2025-03-07 11:43:49 +01:00
Armaël Guéneau
0051891ff0 staging: upgrade garage to v1.99-internal (support for redirections) 2025-03-07 11:43:06 +01:00
Armaël Guéneau
41961df583 woodpecker: change site neptune->corrin 2025-03-01 22:34:55 +01:00
Armaël Guéneau
e61c7449c1 matrix: allow running on site 'corrin' and remove 'neptune' (not a prod site anymore) 2025-03-01 22:32:10 +01:00
8f42b40a57 ajout domaine à la signingtable DKIM 2025-02-26 09:46:59 +01:00
Baptiste Jonglez
93b39af879 Update doc 2025-02-26 00:33:21 +01:00
Baptiste Jonglez
1383c34c39 Increase Plume memory again 2025-02-22 12:55:22 +01:00
9846058387
some tweaks 2025-02-17 20:10:48 +01:00
d8131af3e2
Do not restart garage automatically if config is updated 2025-02-17 20:02:59 +01:00
Armaël Guéneau
b7ce8a2cda staging: upgrade to nixos 24.11 2025-02-15 17:33:49 +01:00
Armaël Guéneau
2f7b846fb0 staging: switch back diplonat & d53 to the docker runner 2025-02-15 17:33:25 +01:00
Armaël Guéneau
5ad325ebed adjust ssh_config 2025-02-11 16:31:17 +01:00
Armaël Guéneau
c33c32b8db upgrade_nixos: upgrade prod to 24.05 2025-02-11 16:19:03 +01:00
Baptiste Jonglez
4bf7f9a76b Update firewall config to avoid timeouts and avoid spamming logs 2025-02-10 22:37:55 +01:00
c4e5ddb24b
Add tip for system.stateVersion 2025-02-10 22:24:08 +01:00
9d4d8ef630
Add staging IPv4 endpoint to ssh config 2025-02-10 21:56:30 +01:00
38929646bf
Move ssh to port 110, update nomad to 1.7 2025-02-10 11:06:24 +01:00
43c46ef041
Update prod grafana to v11.4.1 2025-02-03 08:13:55 +01:00
bfd2ba6ef8
Update grafana to v11.4.1 in staging 2025-02-03 07:28:05 +01:00
f37623821d
add varnish cache between tricot and plume 2025-01-29 08:14:07 +01:00
f21ec907a6 woodpecker runner template: update to v3.0.1 2025-01-27 18:54:29 +01:00
6ef89630ac upgrade woodpecker to v3.0.1 2025-01-27 12:08:33 +01:00
Maxime Lalisse
f14a282a28 ssh keys (stitch) 2025-01-26 12:09:40 +01:00
6c6af54655
Switch back staging telemetry to docker and update stack 2025-01-25 18:50:55 +01:00
f304dec9ce
Update telemetry stack in production 2025-01-25 12:04:59 +01:00
a76df0015e
prod plume: up memory to 1GB 2025-01-19 22:55:27 +01:00
Armaël Guéneau
8e739e152f staging: upgrade to nixos 24.05 2025-01-18 17:01:59 +01:00
Armaël Guéneau
fa0c3203f1 consul config: update deprecated options 2025-01-18 17:01:10 +01:00
Armaël Guéneau
cd629a86e9 update caribou's IPv6 2025-01-18 17:00:43 +01:00
Armaël Guéneau
6416f8a350 update cryptpad to 2024.12.0 2025-01-18 15:08:27 +01:00