Project

General

Profile

Actions

Bug #25252

closed

rudder-cf-serverd was stopped on the server, and nothing restarted it

Added by Nicolas CHARLES 5 months ago. Updated 3 months ago.

Status:
Released
Priority:
N/A
Category:
Server components
Target version:
Severity:
UX impact:
User visibility:
Effort required:
Priority:
0
Name check:
To do
Fix check:
Checked
Regression:
No

Description

some nodes were in error; and process rudder-cf-serverd o nthe server was not running
last logs were

Jul 29 14:03:35 rudder cf-serverd[1677593]: rudder     info: Path does not exist, it's added as-is in access rules: /var/rudder/shared-files/root/
Jul 29 14:03:35 rudder cf-serverd[1677593]: rudder     info: WARNING: this means that (not) having a trailing slash defines if it's (not) a directory!
Jul 29 14:03:35 rudder cf-serverd[1677593]: rudder     info: Failed to canonicalise filename '/var/rudder/shared-files/root/' (realpath: No such file or directory)
Jul 29 14:03:35 rudder cf-serverd[1677593]: rudder     info: Path does not exist, it's added as-is in access rules: /var/rudder/shared-files/root/
Jul 29 14:03:35 rudder cf-serverd[1677593]: rudder     info: WARNING: this means that (not) having a trailing slash defines if it's (not) a directory!
Jul 29 14:03:35 rudder systemd[1]: rudder-cf-serverd.service: Deactivated successfully.
Jul 29 14:03:36 rudder systemd[1]: rudder-cf-serverd.service: Scheduled restart job, restart counter is at 7.
Jul 29 14:03:36 rudder systemd[1]: Stopped CFEngine file server.
Jul 29 14:03:36 rudder systemd[1]: rudder-cf-serverd.service: Start request repeated too quickly.
Jul 29 14:03:36 rudder systemd[1]: rudder-cf-serverd.service: Failed with result 'start-limit-hit'.
Jul 29 14:03:36 rudder systemd[1]: Failed to start CFEngine file server.

running agent didn't change anything
rudder agent check didn't change anything

This component is essential for rudder, it should be started on the rudder server

EDIT: we are not allowed to force a service up if an user chose to stop it. The only case that we need to manage correctly is "if the service crashed, then systemd manage correctly the restart of the service".

Actions #1

Updated by Alexis Mousset 5 months ago

  • Status changed from New to In progress
  • Assignee set to Alexis Mousset
Actions #2

Updated by Alexis Mousset 5 months ago

  • Status changed from In progress to Pending technical review
  • Assignee changed from Alexis Mousset to Félix DALLIDET
  • Pull Request set to https://github.com/Normation/rudder-packages/pull/2899
Actions #3

Updated by Alexis Mousset 5 months ago

  • Status changed from Pending technical review to Pending release
Actions #4

Updated by François ARMAND 3 months ago · Edited

  • Fix check changed from To do to Error - Blocking

EDIT: not correct tests

Actions #5

Updated by François ARMAND 3 months ago

  • Description updated (diff)
  • Fix check changed from Error - Blocking to Checked

I say "check" given the PR, but was not able to reproduce.

Actions #6

Updated by Vincent MEMBRÉ 3 months ago

  • Status changed from Pending release to Released

This bug has been fixed in Rudder 8.1.7 which was released today.

Actions

Also available in: Atom PDF