Closed Bug 1073907 Opened 10 years ago Closed 10 years ago

High sys% cpu usage on node[1-30].peach.metrics.scl3.mozilla.com with auditd enabled

Categories

(Data & BI Services Team :: Data Warehouse, task)

Operational Work
x86_64
Linux
task
Not set
major

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: tmary, Assigned: tmary)

Details

Attachments

(3 files)

Please review attached graph which shows the effect of disabling auditing (specifically: 'auditctl -D') on a IO intensive job. 'auditctl -D' was issued at ~01:54 UTC and 'service auditd restart' was issued at ~02:05 UTC. - Running 'perf top' on one of these hosts seemed to indicate that 'get_task_cred' and 'audit_filter_rules' were the culprits. --
sys% increases as soon as the IO intensive job starts running at ~00:00 (UTC). The job finished at ~01:40 (UTC)
Re the outliers in Comment#1: node26 -> configured to not run jobs, auditing is enabled node25 -> configured to run jobs, auditing is disabled (auditctl -D) --
Attached file Job details
Jobs being referred to in Comment#0 and Comment#2 are same - it is a job which reads FHR data from HBase and writes to HDFS. Re #s, it reads (and decompresses) approximately 7TB of compressed data, writes approximately 21TB of compressed data --
Group: metrics-private
sys% down by half after pruning auditing rules via BUG 1074478 - TY kang@, digi@ --
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: