1
0
Fork 0
Commit Graph

58 Commits

Author SHA1 Message Date
clerie 2775acdb48 hosts/monitoring-3: Migrate secrets to sops 2024-04-21 19:15:15 +02:00
clerie 84b67ee47d hosts/monitoring-3: Enable reloading of prometheus 2024-04-20 17:53:05 +02:00
clerie df6a540281 hosts/monitoring-3: Fix IPv6 listen addr for Grafana 2024-03-27 19:11:02 +01:00
clerie 4f96034838 hosts/monitoring-3: add prometheus job for high frequency pings 2024-03-24 13:03:23 +01:00
clerie 3c42d25ecd hosts/monitoring-3: migrate renamed options 2024-03-24 13:01:32 +01:00
clerie 62dd3b7471 hosts: remove deprecated grub version option 2024-03-19 19:37:43 +01:00
clerie bc35fbb0d6 host/monitoring: Use correct variable in alerting rule description 2024-01-15 18:08:23 +01:00
clerie cff95863fd hosts/monitoring-3: Add alert for Synapse 2023-12-01 17:50:54 +01:00
clerie 226e4198e0 hosts/monitoring-3: add synapse monitoring 2023-11-05 13:36:58 +01:00
clerie 4d4c5eed8c hosts/monitoring-3: add matrix server ping targets 2023-11-04 14:10:51 +01:00
clerie 1ff45a9068 hosts/monitoring-3: remove mail-1 from monitoring 2023-06-28 18:48:44 +02:00
clerie 5270f493b8 hosts/monitoring-3: Make alerting rules more relsilient against missing scrapes 2023-05-28 12:10:45 +02:00
clerie ad137204c3 hosts/monitoring-3: tune altering rules for backups to reduce false positives 2023-05-25 04:33:43 +02:00
clerie b77e9016d7 host/monitoring-3: add rule for backups that are behind 2023-05-24 08:41:35 +02:00
clerie 0393d26e71 flake.nix: update nixos-exporter and use provided modules 2023-05-09 11:56:53 +02:00
clerie 398067f533 hosts/monitoring-3: alert on averaged metrics 2023-05-04 14:43:14 +02:00
clerie e9de141316 hosts/monioring-3: add more ping targets 2023-05-03 16:20:02 +02:00
clerie b60824e796 hosts/monitoring-3: use xmpp password from secrets 2023-05-02 20:27:03 +02:00
clerie dcf8bc4035 modules/monitoring: migrate monitoring vpn secrets to age 2023-05-02 19:42:46 +02:00
clerie d068fea2ce Add ssh public host keys 2023-05-02 10:33:56 +02:00
clerie 882df0098f hosts/monitoring-3: alert for all storage drives when they are full 2023-04-22 18:30:51 +02:00
clerie de8a485779 hosts/monitoring: use correct instance for backup storage monitoring rule 2023-04-18 22:52:52 +02:00
clerie c68004f02e hosts/monitoring-3: add hydra monitoring 2023-04-16 16:01:45 +02:00
clerie 41cd4792a6 hosts/monitoring-3: Replace InstanceUp alert with KernelChanged 2023-03-25 20:42:17 +01:00
clerie 2fd7a4c5aa hosts/monitoring-3: add monitoring of mercury 2023-02-24 23:47:46 +01:00
clerie 9849e4868d hosts/monitoring-3: Use solid-xmpp-alarm 2023-02-06 13:38:16 +01:00
clerie 8d623692c7 hosts/mail-1: Move monitoring config for manually managed host to config directly 2023-02-06 12:51:20 +01:00
clerie 9ee8585716 Replace lib/hosts.nix with an injected special argument containing the nix flake 2023-02-06 12:20:59 +01:00
clerie 8748015acc hosts: remove explicit per host configuration/common import 2023-02-05 21:19:05 +01:00
clerie 38567829f1 hosts/monitoring-3: alert on out of sync host system 2023-02-04 01:15:07 +01:00
clerie 4fffc64c35 hosts/monitoring-3: validate nixos hash versions 2023-02-04 00:57:55 +01:00
clerie 6082fb0744 hosts/monitoring-3: split host config to multiple files 2023-02-03 22:28:50 +01:00
clerie 44148007fc hosts/monitoring-3: update changed option names 2023-02-03 21:23:26 +01:00
clerie 5a387c3c23 hosts/monitoring-3: update dashboard 2023-01-08 15:23:19 +01:00
clerie cfd746fddb Introduce service levels and change alert routing based on this 2023-01-05 23:16:50 +01:00
clerie 30e22dff8d hosts/monitoring-3: use primary fqdn for instance label in prometheus 2023-01-05 22:02:48 +01:00
clerie 1dfba9663a activate NixOS monitoring in prometheus 2023-01-02 21:43:43 +01:00
clerie be5b1c1baf hosts/monitoring-3: move to blackbox monitoring 2022-10-31 22:54:06 +01:00
clerie e9414209f5 hosts/monitoring-3: alert for hosts that just booted 2022-10-02 11:59:37 +02:00
clerie abd589aa73 Alert for full backup storage 2022-09-14 19:38:10 +02:00
clerie cdbe62e788 Alert for hosts that are up for too long 2022-09-11 17:01:24 +02:00
clerie 5ba4163f95 Adding matrix server to monitoring 2022-04-14 21:12:44 +02:00
clerie 588db80877 Add bird to monitoring 2022-03-22 12:16:28 +01:00
clerie 8708e02d35 Add more addresses to ping 2022-02-28 16:33:23 +01:00
clerie c42932db0e Trying out smokeping exporter 2021-12-20 17:49:06 +01:00
clerie 835c5e396e Monitor XMPP Notifications 2021-12-20 16:47:57 +01:00
clerie 4ea5a21103 Resolve monitoring-3 hostname to loopback 2021-12-20 16:37:31 +01:00
clerie 5b4d3bca76 Use correct python environment 2021-12-07 18:29:16 +01:00
clerie b0d64acb33 Increased monitoring rule wait for host down 2021-10-23 18:26:08 +02:00
clerie 3ea21db30b Improve monitoring rules 2021-10-23 18:14:51 +02:00