Closed Bug 1008147 (b-linux64-ix-0010) Opened 11 years ago Closed 11 years ago

b-linux64-ix-0010 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task, P3)

x86_64
Linux

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: cbook, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity])

seems this slave is not happy -> https://tbpl.mozilla.org/php/getParsedLog.php?id=39349537&tree=Mozilla-Inbound abort: No space left on device disabled this slave in slave alloc
Taking a look...
Whoops. Carelessly, based on https://bugzilla.mozilla.org/show_bug.cgi?id=1001518#c10 I ran: [cltbld@bld-linux64-ix-036.build.scl1.mozilla.com ~]$ python tools/buildfarm/maintenance/purge_builds.py -s 20 Deleting ./.repoconfig Deleting ./.pki Deleting ./tmp Deleting ./nthomas Deleting ./.pip Deleting ./.mozilla Deleting ./.ssh Error: unable to free 20.00 GB of space. Free space only 0.00 GB [cltbld@bld-linux64-ix-036.build.scl1.mozilla.com ~]$ I think I should have cd'd into /builds/slaves first. Whoops. I should have checked before running. I'll get this slave reimaged, as I've now destroyed it! :(
So ......... I've recovered directories: * /home/cltbld/.repoconfig * /home/cltbld/.pki * /home/cltbld/.pip * /home/cltbld/.ssh from another slave (bld-linux64-ix-034) so I think a reimage is not necessary. From inspecting the files in these directories, they look generic so I believe they should be the same on bld-linux64-ix-034 and bld-linux64-ix-036, which I why I opted for a straight copy (via scp). The /home/cltbld/.mozilla directory was a crash reports directory, so did not need to be copied. The /home/cltbld/nthomas directory is lost, sorry Nick. Also, /home/cltbld/tmp presumably was created by a release engineer, and has been lost too. I tried to free up 20GB space running: > python tools/buildfarm/maintenance/purge_builds.py -s 20 /builds/slave It reported: > "57.27 GB of space available". Since this was not quite what I requested (20GB vs. 57GB), I checked to see if this was across multiple mounts: > [cltbld@bld-linux64-ix-036.build.scl1.mozilla.com ~]$ df -h > Filesystem Size Used Avail Use% Mounted on > /dev/sda3 227G 158G 58G 74% / > tmpfs 3.9G 0 3.9G 0% /dev/shm > /dev/sda1 97M 52M 41M 56% /boot It might just be that the last directory to be freed was around 40GB(?) and so when it deleted it, the free space suddenly shot up?! The last-to-be-deleted directory was: > /builds/slave/rel-m-rel-lx_uv_4-000000000000 In any case, the machine now has available disk space again. I will check buildbot is running ok, and reenable in slavealloc... Pete
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Alias: bld-linux64-ix-036 → b-linux64-ix-0010
Summary: bld-linux64-ix-036 problem tracking → b-linux64-ix-0010 problem tracking
Please do not re-enable this slave. We are retiring linux hardware build slaves in bug 1106922.
Blocks: 1106922
Resolution: FIXED → WONTFIX
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.