1
0
Commit Graph

52 Commits

Author SHA1 Message Date
bc35fbb0d6 host/monitoring: Use correct variable in alerting rule description 2024-01-15 18:08:23 +01:00
cff95863fd hosts/monitoring-3: Add alert for Synapse 2023-12-01 17:50:54 +01:00
226e4198e0 hosts/monitoring-3: add synapse monitoring 2023-11-05 13:36:58 +01:00
4d4c5eed8c hosts/monitoring-3: add matrix server ping targets 2023-11-04 14:10:51 +01:00
1ff45a9068 hosts/monitoring-3: remove mail-1 from monitoring 2023-06-28 18:48:44 +02:00
5270f493b8 hosts/monitoring-3: Make alerting rules more relsilient against missing scrapes 2023-05-28 12:10:45 +02:00
ad137204c3 hosts/monitoring-3: tune altering rules for backups to reduce false positives 2023-05-25 04:33:43 +02:00
b77e9016d7 host/monitoring-3: add rule for backups that are behind 2023-05-24 08:41:35 +02:00
0393d26e71 flake.nix: update nixos-exporter and use provided modules 2023-05-09 11:56:53 +02:00
398067f533 hosts/monitoring-3: alert on averaged metrics 2023-05-04 14:43:14 +02:00
e9de141316 hosts/monioring-3: add more ping targets 2023-05-03 16:20:02 +02:00
b60824e796 hosts/monitoring-3: use xmpp password from secrets 2023-05-02 20:27:03 +02:00
dcf8bc4035 modules/monitoring: migrate monitoring vpn secrets to age 2023-05-02 19:42:46 +02:00
d068fea2ce Add ssh public host keys 2023-05-02 10:33:56 +02:00
882df0098f hosts/monitoring-3: alert for all storage drives when they are full 2023-04-22 18:30:51 +02:00
de8a485779 hosts/monitoring: use correct instance for backup storage monitoring rule 2023-04-18 22:52:52 +02:00
c68004f02e hosts/monitoring-3: add hydra monitoring 2023-04-16 16:01:45 +02:00
41cd4792a6 hosts/monitoring-3: Replace InstanceUp alert with KernelChanged 2023-03-25 20:42:17 +01:00
2fd7a4c5aa hosts/monitoring-3: add monitoring of mercury 2023-02-24 23:47:46 +01:00
9849e4868d hosts/monitoring-3: Use solid-xmpp-alarm 2023-02-06 13:38:16 +01:00
8d623692c7 hosts/mail-1: Move monitoring config for manually managed host to config directly 2023-02-06 12:51:20 +01:00
9ee8585716 Replace lib/hosts.nix with an injected special argument containing the nix flake 2023-02-06 12:20:59 +01:00
8748015acc hosts: remove explicit per host configuration/common import 2023-02-05 21:19:05 +01:00
38567829f1 hosts/monitoring-3: alert on out of sync host system 2023-02-04 01:15:07 +01:00
4fffc64c35 hosts/monitoring-3: validate nixos hash versions 2023-02-04 00:57:55 +01:00
6082fb0744 hosts/monitoring-3: split host config to multiple files 2023-02-03 22:28:50 +01:00
44148007fc hosts/monitoring-3: update changed option names 2023-02-03 21:23:26 +01:00
5a387c3c23 hosts/monitoring-3: update dashboard 2023-01-08 15:23:19 +01:00
cfd746fddb Introduce service levels and change alert routing based on this 2023-01-05 23:16:50 +01:00
30e22dff8d hosts/monitoring-3: use primary fqdn for instance label in prometheus 2023-01-05 22:02:48 +01:00
1dfba9663a activate NixOS monitoring in prometheus 2023-01-02 21:43:43 +01:00
be5b1c1baf hosts/monitoring-3: move to blackbox monitoring 2022-10-31 22:54:06 +01:00
e9414209f5 hosts/monitoring-3: alert for hosts that just booted 2022-10-02 11:59:37 +02:00
abd589aa73 Alert for full backup storage 2022-09-14 19:38:10 +02:00
cdbe62e788 Alert for hosts that are up for too long 2022-09-11 17:01:24 +02:00
5ba4163f95 Adding matrix server to monitoring 2022-04-14 21:12:44 +02:00
588db80877 Add bird to monitoring 2022-03-22 12:16:28 +01:00
8708e02d35 Add more addresses to ping 2022-02-28 16:33:23 +01:00
c42932db0e Trying out smokeping exporter 2021-12-20 17:49:06 +01:00
835c5e396e Monitor XMPP Notifications 2021-12-20 16:47:57 +01:00
4ea5a21103 Resolve monitoring-3 hostname to loopback 2021-12-20 16:37:31 +01:00
5b4d3bca76 Use correct python environment 2021-12-07 18:29:16 +01:00
b0d64acb33 Increased monitoring rule wait for host down 2021-10-23 18:26:08 +02:00
3ea21db30b Improve monitoring rules 2021-10-23 18:14:51 +02:00
00caae0ed3 Move rules to dedicated file 2021-10-22 23:53:42 +02:00
4392302eb4 Add alerting to monitoring 2021-10-22 23:21:26 +02:00
1cb3143096 Let prometheus scrape temperature values from iot data 2021-06-20 16:05:37 +02:00
6ee3387680 Add status page to monitoring-3 2021-05-15 18:43:15 +02:00
d8547c2a98 Change monitoring scraping interval to 15s 2021-05-07 16:51:15 +02:00
79dc192662 Feed prometheus from hostconfigs 2021-02-24 00:16:30 +01:00