-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Too many sequential calls to realm permit
can cause a race condition where realmd tries to restart sssd multiple times. Systemd will detect it as failure and can't start it up.
That means NO ONE can log into the machine, it is dead to ssh on the network.
We need some way to recover from this. Suggestions:
- Plant something in a systemd unit override to ensure that sssd is always running, via
Restart=always
or similar - Hosting support should probably implement this themselves
- A root cron that just looks for sssd running every minute and starts it if not
What we cannot do:
Restart sssd with ansible. You can't login to begin with.
Edit: see also (courtesy of colin) SSSD/sssd#6219
Metadata
Metadata
Assignees
Labels
No labels