As far as I can tell from the rhythm of the logs, this has something to do with “traefik1” and “mail1.” Accordingly, my mail proxy can no longer deliver emails to Dovecot. My email program can no longer establish an SMTP connection to “mail1.”
Below are two significant, recurring log sections.
By the way, the IP address 192.168.118.70 is my Mac (with the web config in the browser). Testing other browsers is difficult because they always display the “Connection unstable” message.
traefik1
2025-03-27T01:09:01+01:00 [1:traefik1:agent@traefik1] task/module/traefik1/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5: get-certificate/20readconfig is starting
2025-03-27T01:09:01+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:01 +0000] “GET /cluster-admin/api/module/traefik1/task/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5/context HTTP/2.0” 200 237 “-” “-” 61724 “cluster-admin-https@file” “http://127.0.0.1:9311” 20ms
2025-03-27T01:09:01+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:01 +0000] “GET /cluster-admin/api/module/traefik1/task/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5/context HTTP/2.0” 200 237 “-” “-” 61725 “cluster-admin-https@file” “http://127.0.0.1:9311” 30ms
2025-03-27T01:09:01+01:00 [1:traefik1:agent@traefik1] task/module/traefik1/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5: action “get-certificate” status is “completed” (0) at step validate-output.json
2025-03-27T01:09:02+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:02 +0000] “GET /cluster-admin/api/module/traefik1/task/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5/context HTTP/2.0” 200 237 “-” “-” 61726 “cluster-admin-https@file” “http://127.0.0.1:9311” 16ms
2025-03-27T01:09:02+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:02 +0000] “GET /cluster-admin/api/module/traefik1/task/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5/context HTTP/2.0” 200 237 “-” “-” 61727 “cluster-admin-https@file” “http://127.0.0.1:9311” 38ms
2025-03-27T01:09:02+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:02 +0000] “GET /cluster-admin/api/module/traefik1/task/098b12c5-a7fc-44c9-b16c-b4e4e6b309a5/status HTTP/2.0” 200 6933 “-” “-” 61728 “cluster-admin-https@file” “http://127.0.0.1:9311” 16ms
2025-03-27T01:09:09+01:00 [1:traefik1:agent@traefik1] task/module/traefik1/d790fa5b-600a-4a1a-b686-2a6f15288f12: get-certificate/20readconfig is starting
2025-03-27T01:09:09+01:00 [1:traefik1:agent@traefik1] task/module/traefik1/d790fa5b-600a-4a1a-b686-2a6f15288f12: action “get-certificate” status is “completed” (0) at step validate-output.json
2025-03-27T01:09:11+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:11 +0000] “GET /cluster-admin/api/module/traefik1/task/d790fa5b-600a-4a1a-b686-2a6f15288f12/context HTTP/2.0” 200 237 “-” “-” 61729 “cluster-admin-https@file” “http://127.0.0.1:9311” 21ms
2025-03-27T01:09:11+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:11 +0000] “GET /cluster-admin/api/module/traefik1/task/d790fa5b-600a-4a1a-b686-2a6f15288f12/context HTTP/2.0” 200 237 “-” “-” 61731 “cluster-admin-https@file” “http://127.0.0.1:9311” 27ms
2025-03-27T01:09:11+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:11 +0000] “GET /cluster-admin/api/module/traefik1/task/d790fa5b-600a-4a1a-b686-2a6f15288f12/context HTTP/2.0” 200 237 “-” “-” 61732 “cluster-admin-https@file” “http://127.0.0.1:9311” 36ms
2025-03-27T01:09:11+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:11 +0000] “GET /cluster-admin/api/module/traefik1/task/d790fa5b-600a-4a1a-b686-2a6f15288f12/context HTTP/2.0” 200 237 “-” “-” 61730 “cluster-admin-https@file” “http://127.0.0.1:9311” 47ms
2025-03-27T01:09:12+01:00 [1:traefik1:traefik] 192.168.118.170 - - [27/Mar/2025:00:09:12 +0000] “GET /cluster-admin/api/module/traefik1/task/d790fa5b-600a-4a1a-b686-2a6f15288f12/status HTTP/2.0” 200 6934 “-” “-” 61733 “cluster-admin-https@file” “http://127.0.0.1:9311” 21ms
2025-03-27T01:09:17+01:00 [1:traefik1:agent@traefik1] task/module/traefik1/a2b694e1-e07e-40c3-a78f-a08835bbd5d8: get-certificate/20readconfig is starting
mail1
2025-03-27T01:23:49+01:00 [1:mail1:postfix] deb833547fee9765d82dd712d5ec6a035483d023988623cddd6a03b132db2f1f
2025-03-27T01:23:49+01:00 [1:mail1:systemd] postfix.service: Consumed 2.240s CPU time.
2025-03-27T01:23:49+01:00 [1:mail1:systemd] postfix.service: Scheduled restart job, restart counter is at 13168.
2025-03-27T01:23:49+01:00 [1:mail1:systemd] Stopped postfix.service - Postfix MTA/MSA server.
2025-03-27T01:23:49+01:00 [1:mail1:systemd] postfix.service: Consumed 2.240s CPU time.
2025-03-27T01:23:49+01:00 [1:mail1:systemd] Starting get-certificate.service - Get TLS certificate from Traefik…
2025-03-27T01:23:51+01:00 [1:mail1:get-certificate] Certificate for mysrv.mydomain.tld is unchanged.
2025-03-27T01:23:51+01:00 [1:mail1:systemd] Finished get-certificate.service - Get TLS certificate from Traefik.
2025-03-27T01:23:51+01:00 [1:mail1:systemd] Starting postfix.service - Postfix MTA/MSA server…
2025-03-27T01:23:52+01:00 [1:mail1:postfix] systemctl --user --quiet is-enabled clamav.service
2025-03-27T01:23:53+01:00 [1:mail1:podman] 2025-03-27 01:23:53.015978317 +0100 CET m=+0.043328821 image pull Package mail-postfix · GitHub
2025-03-27T01:23:53+01:00 [1:mail1:podman]
2025-03-27T01:23:53+01:00 [1:mail1:podman] 2025-03-27 01:23:53.171569653 +0100 CET m=+0.198920194 container create ce32a036bdda900e5a2e3027228081033277db92cff2520bf045e60e869226fe (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, io.buildah.version=1.33.7, PODMAN_SYSTEMD_UNIT=postfix.service)
2025-03-27T01:23:53+01:00 [1:mail1:podman] 2025-03-27 01:23:53.301344332 +0100 CET m=+0.328694868 container init ce32a036bdda900e5a2e3027228081033277db92cff2520bf045e60e869226fe (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, io.buildah.version=1.33.7, PODMAN_SYSTEMD_UNIT=postfix.service)
2025-03-27T01:23:53+01:00 [1:mail1:podman] 2025-03-27 01:23:53.324028981 +0100 CET m=+0.351379519 container start ce32a036bdda900e5a2e3027228081033277db92cff2520bf045e60e869226fe (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, io.buildah.version=1.33.7, PODMAN_SYSTEMD_UNIT=postfix.service)
2025-03-27T01:23:55+01:00 [1:mail1:postfix/postfix-script] the Postfix mail system is not running
2025-03-27T01:23:56+01:00 [1:mail1:postfix/postfix-script] starting the Postfix mail system
2025-03-27T01:23:56+01:00 [1:mail1:postfix] postfix/postlog: starting the Postfix mail system
2025-03-27T01:23:56+01:00 [1:mail1:postfix/master] fatal: bind 0.0.0.0 port 25: Address in use
2025-03-27T01:23:57+01:00 [1:mail1:podman] 2025-03-27 01:23:57.550371387 +0100 CET m=+0.152226873 container remove ce32a036bdda900e5a2e3027228081033277db92cff2520bf045e60e869226fe (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, PODMAN_SYSTEMD_UNIT=postfix.service, io.buildah.version=1.33.7)
2025-03-27T01:23:57+01:00 [1:mail1:postfix] ce32a036bdda900e5a2e3027228081033277db92cff2520bf045e60e869226fe
2025-03-27T01:23:57+01:00 [1:mail1:systemd] postfix.service: Consumed 1.933s CPU time.
2025-03-27T01:23:57+01:00 [1:mail1:systemd] postfix.service: Scheduled restart job, restart counter is at 13169.
2025-03-27T01:23:57+01:00 [1:mail1:systemd] Stopped postfix.service - Postfix MTA/MSA server.
2025-03-27T01:23:57+01:00 [1:mail1:systemd] postfix.service: Consumed 1.933s CPU time.
2025-03-27T01:23:57+01:00 [1:mail1:systemd] Starting get-certificate.service - Get TLS certificate from Traefik…
2025-03-27T01:23:58+01:00 [1:mail1:get-certificate] Certificate for mysrv.mydomain.tld is unchanged.
2025-03-27T01:23:59+01:00 [1:mail1:systemd] Finished get-certificate.service - Get TLS certificate from Traefik.
2025-03-27T01:23:59+01:00 [1:mail1:systemd] Starting postfix.service - Postfix MTA/MSA server…
2025-03-27T01:24:00+01:00 [1:mail1:podman]
2025-03-27T01:24:01+01:00 [1:mail1:systemd] Started libpod-bc93d8401940578266334063a652e01116445d250d48223b10449b836272f5b3.scope - libcrun container.
2025-03-27T01:24:01+01:00 [1:mail1:podman] 2025-03-27 01:24:01.118615142 +0100 CET m=+0.345231916 container init bc93d8401940578266334063a652e01116445d250d48223b10449b836272f5b3 (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, PODMAN_SYSTEMD_UNIT=postfix.service, io.buildah.version=1.33.7)
2025-03-27T01:24:01+01:00 [1:mail1:podman] 2025-03-27 01:24:01.143741757 +0100 CET m=+0.370358559 container start bc93d8401940578266334063a652e01116445d250d48223b10449b836272f5b3 (image=ghcr.io/nethserver/mail-postfix:1.6.0, name=postfix, io.buildah.version=1.33.7, PODMAN_SYSTEMD_UNIT=postfix.service)
“mysrv.mydomain.tld” is, of course, a placeholder for my externally accessible domain…
Addendum:
The mail1 container is version 1.6.0
Addendum 2:
BOTH problem servers are running the latest CORE versions of NETH8.
BOTH were restarted about 30 hours ago (for different reasons).
Unfortunately, I can’t say at this point whether the problem occurred with the core update or only after the system reboot (all Debian 12). I believe the core update was before that.
Addendum 3:
I’ve now tried stopping the containers (to restart them), but it doesn’t work for individual containers:
SERVICE=mail1
for userhome in /home/$SERVICE ; do moduleid=$(basename $userhome); echo ${moduleid}; echo systemctl stop user@$(id -u $moduleid); echo; done
And it doesn’t work for all containers either:
for userhome in /home/*[0-9]; do moduleid=$(basename $userhome); echo ${moduleid}; echo systemctl stop user@$(id -u $moduleid); echo; done
Nothing is stopped, nothing is restarted.
Do I need to use different commands in the meantime?
I’d like to keep the state for further analysis, but I also need to access emails again. So, unfortunately, I have to restart now.