New round of disconnects

RESOLVED WORKSFORME

Status

Infrastructure & Operations
NetOps
--
minor
RESOLVED WORKSFORME
8 years ago
5 years ago

People

(Reporter: catlee, Assigned: dmoore)

Tracking

Details

(Whiteboard: [tracking bug])

(Reporter)

Description

8 years ago
Recently we've been having disconnects of slaves, and problems slaves uploading to stage.

This bug will track these new occurrences.

mv-moz2-linux-ix-slave12.build.mozilla.org to stage.mozilla.org:22 at 10:55:36
eg http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1277266826.1277269662.22081.gz

scp codesize-auto.log stage.mozilla.org:/home/ftp/pub/firefox/tinderbox-builds/mozilla-central-linux/
codesize-auto.log                               0%    0     0.0KB/s   --:-- ETA
codesize-auto.log                              46% 8208KB   8.0MB/s   00:01 ETA
codesize-auto.log                              73%   13MB   7.7MB/s   00:00 ETA
codesize-auto.log                              73%   13MB   6.9MB/s   00:00 ETA
codesize-auto.log                              73%   13MB   6.2MB/s   00:00 ETA
codesize-auto.log                              73%   13MB   5.6MB/s   00:00 ETA
codesize-auto.log                              73%   13MB   5.0MB/s   00:00 ETA
codesize-auto.log                              73%   13MB   4.5MB/s - stalled -
codesize-auto.log                              73%   13MB   4.1MB/s - stalled -
...
codesize-auto.log                              73%   13MB   0.0KB/s - stalled -
codesize-auto.log                              73%   13MB   0.0KB/s - stalled -
Read from remote host stage.mozilla.org: Connection timed out
lost connection
program finished with exit code 1
elapsedTime=936.794230

mv-moz2-linux-ix-slave12 at Tue Jun 22 21:49:39 2010, so Castro -> MPT.
(Assignee)

Updated

8 years ago
Assignee: server-ops → dmoore
(Reporter)

Comment 2

8 years ago
12:42:01 mw32-ix-slave04 disconnected from production-master01
12:40:54 mw32-ix-slave18 disconnected from production-master01
(Reporter)

Comment 3

8 years ago
June 25th, 05:14:40 mw32-ix-slave19:
Read from remote host stage.mozilla.org: Connection reset by peer
lost connection

Updated

8 years ago
Whiteboard: [tracking bug]

Updated

8 years ago
Component: Server Operations → Server Operations: Projects

Updated

8 years ago
Component: Server Operations: Projects → Server Operations: Netops

Comment 4

8 years ago
Nothing since June, inclined to close (and these hosts are moving out of Castro anways).
Status: NEW → RESOLVED
Last Resolved: 8 years ago
Resolution: --- → WORKSFORME
(Reporter)

Comment 5

8 years ago
Just had one today:

Tue Sep 28 15:11:48 2010 w32-ix-slave34 disconnected from buildbot-master1.build.mozilla.org

Want to re-open or file a new bug?
(Reporter)

Comment 10

8 years ago
We suspect most (all?) of the above due to physical bumping of the machines / power / network cables while others are getting moved to Santa Clara.
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.