557440 - Color depth on -ix- slaves is sometimes too low to run reftests, resulting in failing modules/libpr0n/test/reftest/colordepth.html (and 135 others)

Reporter

Description

•

14 years ago

The first failure in http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270530540.1270530972.9262.gz is the one that was added to ensure that the tinderbox trying to run reftests has at least 24 bit color, because the others will fail if it doesn't. So apparently at least mw32-ix-slave14 does not - I didn't look back to see whether other -ix- slaves have successfully run reftests.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 1

•

14 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270590569.1270592375.23429.gz
s: mw32-ix-slave02

Timothy Nikkel (:tnikkel)

Comment 2

•

14 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox3.6/1270594426.1270595184.643.gz
s: mw32-ix-slave15

Lukas Blakk [:lsblakk] use ?needinfo

Comment 3

•

14 years ago

The colour depth is set to 24bits on these slaves, Ben do you have ideas on where else we should be looking into this?

Lukas Blakk [:lsblakk] use ?needinfo

Comment 4

•

14 years ago

The display size is set to 1024 X 640 though, instead of 1280 X 1024 like the win32 VMs.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 5

•

14 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270599449.1270599934.14507.gz
WINNT 5.2 mozilla-central opt test reftest on 2010/04/06 17:17:29

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 6

•

14 years ago

(continued from previous comment... oops)
s: mw32-ix-slave15

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 7

•

14 years ago

(In reply to comment #0)
> http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270530540.1270530972.9262.gz

(In reply to comment #1)
> http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270590569.1270592375.23429.gz
> s: mw32-ix-slave02

(In reply to comment #2)
> http://tinderbox.mozilla.org/showlog.cgi?log=Firefox3.6/1270594426.1270595184.643.gz
> s: mw32-ix-slave15

(In reply to comment #5)
> http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270599449.1270599934.14507.gz


I checked these in reftest-analyzer.  They all say:

ERROR: color depth is only 16.

(no longer active)

Comment 8

•

14 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270599449.1270599934.14507.gz
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270600233.1270600686.16477.gz

Phil Ringnalda (:philor)

Reporter

Comment 9

•

14 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1270599905.1270601418.18629.gz
WINNT 5.2 mozilla-central debug test reftest on 2010/04/06 17:25:05
s: mw32-ix-slave04

Phil Ringnalda (:philor)

Reporter

Updated

•

14 years ago

Summary: Color depth on mw32-ix-slave14 (and other -ix- slaves?) too low to run reftests → Color depth on -ix- slaves is sometimes too low to run reftests, resulting in failing modules/libpr0n/test/reftest/colordepth.html (and 135 others)

bhearsum@mozilla.com (:bhearsum)

Comment 10

•

14 years ago

(In reply to comment #3)
> The colour depth is set to 24bits on these slaves, Ben do you have ideas on
> where else we should be looking into this?

Based on the comments, we should repro in staging, and then try changing the color depth to see if that fixes it.

bhearsum@mozilla.com (:bhearsum)

Comment 11

•

14 years ago

Oh, I'm also wondering why this only came up now. There's been no configuration changes to the machines, and they've been running in production for close to a month. Did the test change in some way?

Phil Ringnalda (:philor)

Reporter

Comment 12

•

14 years ago

colordepth.html hasn't changed since December 2008. What has changed is that fairly recently the machines went from being prioritized for builds to not being allowed to do builds since they were saturating the mpt-castro connection (and if they previously did reftests, and failed once a week in the middle of the night, we might well have just ignored it), and judging by the nagios-spam, a whole lot of restarting of them. I remember back when IT did restarting of tinderboxes, there were explicit instructions about how you could and couldn't connect to them and what to do while restarting, to avoid this exact problem. Is someone maybe, or maybe just sometimes, not doing the current equivalent of those instructions?

bhearsum@mozilla.com (:bhearsum)

Comment 13

•

14 years ago

These machines reboot after every job. They come back up automatically, no intervention involved. I'm not saying it's not possible that the screen depth is an issue, I'm simply wondering why it didn't come up, or wasn't noticed until a month later. If the answer to that is just that it was missed, that's fine, but if the problem truly didn't occur until a few days ago then either the tests have changed, or the machine configuration has changed, or we have the strangest orange we've seen yet.

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 14

•

14 years ago

(In reply to comment #13)
> ... I'm simply wondering why it didn't come up, or wasn't noticed
> until a month later. If the answer to that is just that it was missed, that's
> fine, but if the problem truly didn't occur until a few days ago then either
> the tests have changed, or the machine configuration has changed, or we have
> the strangest orange we've seen yet.

1) Good question, bhearsum. These machines have been in production since early March, if bug#545136 is to be believed. bhearsum/philor: Do we have any examples of this orangeness before a few days ago? That would help us figure out whats going on here.

2) Until this is resolved, are all the win32 ix machines removed from production pool to avoid orange, or are some machines still working correctly in production?

Chris Cooper [:coop] (he/him)

Comment 15

•

14 years ago

Is this happening? Any recent examples that I can jump in and debug?

Phil Ringnalda (:philor)

Reporter

Comment 16

•

14 years ago

No. It happened on Monday and Tuesday of that week, then stopped. It wasn't happening and being ignored before that, and it isn't happening and being ignored after that.

Chris Cooper [:coop] (he/him)

Comment 17

•

14 years ago

(In reply to comment #13)
> the tests have changed, or the machine configuration has changed

Given that neither of these changed and the situation hasn't recurred in 2 weeks, I'm marking this with "badslave?" for tracking and moving on.

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → WORKSFORME

Whiteboard: [orange] → [orange][badslave?]

Nobody; OK to take it and work on it

Assignee

Updated

•

12 years ago

Keywords: intermittent-failure

Nobody; OK to take it and work on it

Assignee

Updated

•

12 years ago

Whiteboard: [orange][badslave?] → [badslave?]

Nobody; OK to take it and work on it

Assignee

Updated

•

11 years ago

Product: mozilla.org → Release Engineering

Bugzilla

Quick Search

Color depth on -ix- slaves is sometimes too low to run reftests, resulting in failing modules/libpr0n/test/reftest/colordepth.html (and 135 others)

Categories

(Release Engineering :: General, defect)

Tracking

(Not tracked)

People

(Reporter: philor, Unassigned)

References

Details

(Keywords: intermittent-failure, Whiteboard: [badslave?])

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Updated

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 16

Comment 17

Updated

Updated

Updated