Project

General

Profile

Actions

Bug #7189

closed

issues with process management on physical hosting LXC containers

Added by LibrA LinuX over 8 years ago. Updated over 8 years ago.

Status:
Released
Priority:
2
Assignee:
Matthieu CERDA
Category:
Packaging
Target version:
Severity:
UX impact:
User visibility:
Effort required:
Priority:
Name check:
Fix check:
Regression:

Description

Seems i have process issue similar to the Bug #4498. Actually i have several physical servers each one have a rudder-agent installed, and hosting several lxc containers, each one having also rudder-agent installed.
Every 5 minutes, on cron job i received the stdout of the rudder cron job :
WARNING: No disable file detected and no CFEngine process neither. Relaunching CFEngine processes... Done

I think the problem is coming from the physical host that see all cf-execd|cf-agent process (its process and the LXC ones)
When launching the cron job on physical host, ther is too many cfengine instance processes running and the job is killing all of them before restarting only its local process.
It then kills all the cfengine process of the LXC containers...

  1. if [ -e /opt/rudder/bin/check-rudder-agent ]; then /opt/rudder/bin/check-rudder-agent; else if [ ! -e /opt/rudder/etc/disable-agent -a `ps -efww | grep -E "(cf-execd|cf-agent)" | grep -E "/var/rudder/cfengine-community/bin/(cf-execd|cf-agent)" | grep -v grep | wc -l` -eq 0 ]; then /var/rudder/cfengine-community/bin/cf-agent -f failsafe.cf >/dev/null 2>\&1 \&\& /var/rudder/cfengine-community/bin/cf-agent >/dev/null 2>\&1; if [ $? != 0 ]; then if [ -f /opt/rudder/etc/rudder-restart-message.txt ]; then cat /opt/rudder/etc/rudder-restart-message.txt; else echo "Rudder agent was unable to restart on $(hostname)."; fi; fi; fi; fi
    WARNING: Too many instance of CFEngine cf-execd processes running. Killing them... Done
    WARNING: No disable file detected and no CFEngine process neither. Relaunching CFEngine processes...
    Done
    root@libra1 ~#

Related issues 4 (0 open4 closed)

Related to Rudder - Bug #4498: Several issues with process management on Proxmox host (and container)RejectedActions
Related to Rudder - Bug #7335: check-rudder-agent silently fails if namespaces are not supportedReleasedVincent MEMBRÉ2015-10-30Actions
Related to Rudder - Bug #7338: All reports are missing (totally orange) for a node due to multiple cf-execd processesReleasedNicolas CHARLES2015-10-30Actions
Related to Rudder - Bug #7381: Process management issues on nodes hosting LXC containersReleasedAlexis MoussetActions
Actions

Also available in: Atom PDF