Upgrade AlmaLinux [and RockyLinux] 9.7 to 9.8 didn't work

NethServer Version: 8
Module:

Sunday i tried to update my server running Nethserver on top of AlmaLinux. Unfortunately i have become a bit lazy with snapshots before updates because everything went very smoothly the last years. After rebooting the server nethserver didn’t start up, i even couldn’t access the cluster-admin page. The log files were flooded with messages that the containers couldn’t get started because of missing env files, but they were where they should be. I tried to figure out what happend, but i’m not experienced enough with podman containers so i decided to try to downgrade AlmaLinux to 9.7 to get the server running again (this time i snapshoted my system). Luckily that worked very easy and after the downgrade the system was running like before.

Think about that before upgrading your AlmaLinux and maybe this effects Rocky as well. Remember to snapshot your system before upgrades, it helps to revert changes, not all downgrades work so flawless.

Now i try to figure out why the upgrade didn’t work and learn about containers.

CoBlau

3 Likes

Hi Corinna,

welcome to NethServer Community!

Thanks for the warning to not upgrade RHEL distros to 9.8.
With Rocky Linux, special NS8 repositories are used to avoid breaking system upgrades.

Great, please share the errors from the log files.
As regards learning about containers, see Howto manage or customize NS8 podman containers

1 Like

Hi Corinna, welcome to our community and thank you for the report!

I’m experiencing a very similar issue on Rocky Linux 9.8. Rootless services such as Traefik fail to start after reboot, making the cluster inaccessible.

The problem is reproducible both after a 9.7 → 9.8 upgrade followed by a reboot and on a fresh Rocky Linux 9.8 installation. In the journal I found messages pointing to systemd failures when starting rootless services:

Jun 03 10:08:45 rocky-linux98 systemd[13089]: Starting Rootless module/traefik1 agent...
Jun 03 10:08:45 rocky-linux98 systemd[13108]: agent.service: Failed at step CHDIR spawning /usr/bin/chmod: No such file or directory
Jun 03 10:08:45 rocky-linux98 systemd[13109]: agent.service: Failed at step CHDIR spawning /usr/local/bin/agent: No such file or directory
Jun 03 10:08:45 rocky-linux98 systemd[13089]: agent.service: Main process exited, code=exited, status=200/CHDIR
Jun 03 10:08:45 rocky-linux98 systemd[13089]: agent.service: Failed with result 'exit-code'.

Although I have not verified this with a downgrade, your observation that the issue disappears on 9.7 points to the systemd update as a possible cause.

Relevant versions:

  • systemd-252-55.el9_7.9.rocky.0.1.x86_64
  • systemd-252-67.el9_8.2.rocky.0.1.x86_64

We’re currently investigating the issue to identify the exact cause. As a precaution, the NS8 Rocky Linux repositories continue to serve Rocky Linux 9.7 packages until this blocker issue is resolved.

As an AlmaLinux user, I recommend staying on 9.7 for the time being and postponing the upgrade to 9.8 until we have identified and fixed the issue.

Edit: bug reference Systemd %S changes in Rocky Linux 9.8 · Issue #8028 · NethServer/dev · GitHub

2 Likes

A downgrade of systemd+deps, followed by a reboot was enough to work around the Traefik service startup error:

[root@rocky-linux-s-2vcpu-2gb-fra1 ~]# dnf history info 12
Transaction ID : 12
Begin time     : Wed 03 Jun 2026 10:18:28 AM UTC
Begin rpmdb    : 3b98cfbe29d25ae1b563693489ff90467bb2f1895f41fb8a4365f582e1941d75
End time       : Wed 03 Jun 2026 10:18:36 AM UTC (8 seconds)
End rpmdb      : 189e549d8944c023ee80dccd8d919aa284f6612b40dc16a6610c7d3cb91a329d
User           : root <root>
Return-Code    : Success
Releasever     : 9
Command Line   : downgrade systemd
Persistence    : Persist
Comment        : 
Packages Altered:
    Downgrade  systemd-252-55.el9_7.9.rocky.0.1.x86_64            @ns-baseos
    Downgraded systemd-252-67.el9_8.2.rocky.0.1.x86_64            @@System
    Downgrade  systemd-libs-252-55.el9_7.9.rocky.0.1.x86_64       @ns-baseos
    Downgraded systemd-libs-252-67.el9_8.2.rocky.0.1.x86_64       @@System
    Downgrade  systemd-pam-252-55.el9_7.9.rocky.0.1.x86_64        @ns-baseos
    Downgraded systemd-pam-252-67.el9_8.2.rocky.0.1.x86_64        @@System
    Downgrade  systemd-resolved-252-55.el9_7.9.rocky.0.1.x86_64   @ns-baseos
    Downgraded systemd-resolved-252-67.el9_8.2.rocky.0.1.x86_64   @@System
    Downgrade  systemd-rpm-macros-252-55.el9_7.9.rocky.0.1.noarch @ns-baseos
    Downgraded systemd-rpm-macros-252-67.el9_8.2.rocky.0.1.noarch @@System
    Downgrade  systemd-udev-252-55.el9_7.9.rocky.0.1.x86_64       @ns-baseos
    Downgraded systemd-udev-252-67.el9_8.2.rocky.0.1.x86_64       @@System
1 Like

The fix passed QA verification on Rocky Linux. The testing commands 1) install a patched core [on node 1] package, 2) upgrade Rocky Linux to version 9.8, 3) reboot the node

api-cli run update-core --data '{"core_url":"ghcr.io/nethserver/core:3.19.1-dev.1","nodes":[1]}' && \
dnf --enablerepo=baseos,appstream --disablerepo=ns-baseos,ns-appstream update -y && \
reboot

Before the fixed version is released, Debian tests must ensure there are no regressions.


Edit 2026-06-05 15:50

Fix released as Core 3.19.1.

:warning: Before upgrading Rocky Linux or AlmaLinux to 9.8, make sure the cluster is updated to Core 3.19.1.

  • Open Cluster Admin → Software Center → Updates
  • Update the Core module to version 3.19.1

Once Core 3.19.1 is installed, it is safe to upgrade the base distribution to Rocky Linux 9.8 or AlmaLinux 9.8 and reboot the system.

See also this Announcements post: NS8 Core 3.19.1 released: required compatibility update for Rocky Linux and AlmaLinux 9.8