Baptiste Jonglez
187d36eb9b
deploy_nixos: add help to apply changes without rebooting in production
2025-03-26 00:17:59 +01:00
Baptiste Jonglez
fd6275f5bc
prod: Fix vim configuration syntax (different between staging and prod due to NixOS version difference)
2025-03-26 00:17:08 +01:00
Baptiste Jonglez
fc88a063b1
node_exporter: avoid using network mode host
2025-03-25 22:21:35 +01:00
Baptiste Jonglez
bb8c9db2ed
telemetry: avoid network mode host, and poll less often
2025-03-25 22:12:42 +01:00
451068d716
Merge pull request 'prod: telemetry: Add smartctl_exporter based on staging work' ( #53 ) from prod_smartctl_monitoring into main
...
Reviewed-on: #53
2025-03-25 21:09:08 +00:00
Baptiste Jonglez
797f946578
prod: telemetry: Add smartctl_exporter based on staging work
2025-03-24 17:53:17 +01:00
Baptiste Jonglez
596b7ab966
prod: telemetry: rename node-exporter job
2025-03-24 17:51:55 +01:00
Baptiste Jonglez
ec1fa3e540
staging: telemetry: Use a init task to create fake disk devices for smartctl_exporter
2025-03-24 17:47:05 +01:00
67230dd60c
guichet now advertise the correct dxfl login command
2025-03-24 16:48:18 +01:00
305c160899
guichet upgrade
2025-03-21 00:27:05 +01:00
Baptiste Jonglez
8d9aa00de5
staging: harden config of smartctl exporter
...
It currently requires all nodes to have /dev/sda (the device passthrough is hardcoded for now)
2025-03-19 23:46:55 +01:00
Baptiste Jonglez
5790453ff1
nix: Allow all capabilities in Nomad
...
This will be necessary for the smartctl exporter since it needs Linux
capabilities that are not allowed by default in Nomad.
We only have trusted Nomad jobs, and we already allow privileged
containers anyway, so there is no security impact.
2025-03-19 23:39:04 +01:00
Baptiste Jonglez
a2a470ac3d
staging: promote piranha to Nomad server (caribou is dead)
2025-03-19 23:08:49 +01:00
Baptiste Jonglez
2009572fea
prod: telemetry: move storage from bespin/scorpio to bespin/corrin
2025-03-12 21:22:56 +01:00
Baptiste Jonglez
8f0a45f03e
staging: telemetry: add smartctl exporter
2025-03-12 21:06:56 +01:00
Baptiste Jonglez
b98e72af96
staging: telemetry: Fix metric collection due to faulty Consul connection
2025-03-12 20:51:49 +01:00
Baptiste Jonglez
e805cf5cf6
Augmentation stockage prometheus
...
La limite actuelle correspond à environ 2 mois d'historique prometheus,
c'est parfois trop peu pour pouvoir relever des tendances sur le long terme.
2025-03-11 23:10:07 +01:00
6b52ccd374
Merge pull request 'upgrade garage to v1.99.1' ( #49 ) from garage-1.99 into main
...
Reviewed-on: #49
2025-03-09 09:48:50 +00:00
Armaël Guéneau
c5a0577cbf
upgrade garage to v1.99.1
2025-03-09 10:44:12 +01:00
Armaël Guéneau
40da5ccca2
nixos config: tweak
2025-03-07 11:43:49 +01:00
Armaël Guéneau
0051891ff0
staging: upgrade garage to v1.99-internal (support for redirections)
2025-03-07 11:43:06 +01:00
Armaël Guéneau
41961df583
woodpecker: change site neptune->corrin
2025-03-01 22:34:55 +01:00
Armaël Guéneau
e61c7449c1
matrix: allow running on site 'corrin' and remove 'neptune' (not a prod site anymore)
2025-03-01 22:32:10 +01:00
8f42b40a57
ajout domaine à la signingtable DKIM
2025-02-26 09:46:59 +01:00
Baptiste Jonglez
93b39af879
Update doc
2025-02-26 00:33:21 +01:00
Baptiste Jonglez
1383c34c39
Increase Plume memory again
2025-02-22 12:55:22 +01:00
9846058387
some tweaks
2025-02-17 20:10:48 +01:00
d8131af3e2
Do not restart garage automatically if config is updated
2025-02-17 20:02:59 +01:00
Armaël Guéneau
b7ce8a2cda
staging: upgrade to nixos 24.11
2025-02-15 17:33:49 +01:00
Armaël Guéneau
2f7b846fb0
staging: switch back diplonat & d53 to the docker runner
2025-02-15 17:33:25 +01:00
Armaël Guéneau
5ad325ebed
adjust ssh_config
2025-02-11 16:31:17 +01:00
Armaël Guéneau
c33c32b8db
upgrade_nixos: upgrade prod to 24.05
2025-02-11 16:19:03 +01:00
Baptiste Jonglez
4bf7f9a76b
Update firewall config to avoid timeouts and avoid spamming logs
2025-02-10 22:37:55 +01:00
c4e5ddb24b
Add tip for system.stateVersion
2025-02-10 22:24:08 +01:00
9d4d8ef630
Add staging IPv4 endpoint to ssh config
2025-02-10 21:56:30 +01:00
38929646bf
Move ssh to port 110, update nomad to 1.7
2025-02-10 11:06:24 +01:00
43c46ef041
Update prod grafana to v11.4.1
2025-02-03 08:13:55 +01:00
bfd2ba6ef8
Update grafana to v11.4.1 in staging
2025-02-03 07:28:05 +01:00
f37623821d
add varnish cache between tricot and plume
2025-01-29 08:14:07 +01:00
f21ec907a6
woodpecker runner template: update to v3.0.1
2025-01-27 18:54:29 +01:00
6ef89630ac
upgrade woodpecker to v3.0.1
2025-01-27 12:08:33 +01:00
Maxime Lalisse
f14a282a28
ssh keys (stitch)
2025-01-26 12:09:40 +01:00
6c6af54655
Switch back staging telemetry to docker and update stack
2025-01-25 18:50:55 +01:00
f304dec9ce
Update telemetry stack in production
2025-01-25 12:04:59 +01:00
a76df0015e
prod plume: up memory to 1GB
2025-01-19 22:55:27 +01:00
Armaël Guéneau
8e739e152f
staging: upgrade to nixos 24.05
2025-01-18 17:01:59 +01:00
Armaël Guéneau
fa0c3203f1
consul config: update deprecated options
2025-01-18 17:01:10 +01:00
Armaël Guéneau
cd629a86e9
update caribou's IPv6
2025-01-18 17:00:43 +01:00
Armaël Guéneau
6416f8a350
update cryptpad to 2024.12.0
2025-01-18 15:08:27 +01:00
64abdd2210
disable auth media as all clients are not yet compatible
2025-01-17 13:41:55 +01:00