Open Bug 1516575 Opened Last year Updated 15 days ago
Protect workers against the OOM killer
as has been seen with the landing of bug 1516374 yesterday, workers can end up killed under OOM conditions, making jobs fail as "claim-expired" without any logs. It would be better if the worker wasn't killed, giving it a chance to report an actual failure with OOM messages. It is possible to protect processes against the OOM killer with: echo -17 > /proc/pid/oom_adj. (per https://linux-mm.org/OOM_Killer) It seems we should protect the worker processes this way.
Component: General → Worker
QA Contact: pmoore
You need to log in before you can comment on or make changes to this bug.