818968 - B2G emulator reftests should be runnable on AWS

Ed Morley [:emorley]

Reporter

Description

•

12 years ago

...to reduce the impact on linux32 wait times.

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 1

•

12 years ago

The original idea of moving to linux64 was to help offload our physical linux32 slaves. However now that we have some linux32 tests running on AWS in production, we'd prefer to run B2G tests on AWS also. From irc, seems this has been discussed already, and some work already in progress, so I've modified summary to match reality. ahal, per ctalbert, I think this should be assigned to you, as you are working on this? If not, please kick it back to me!

Assignee: nobody → ahalberstadt

Summary: B2G emulator tests should also run on Linux64 slaves, not just Linux32 → B2G emulator tests should be runnable on AWS

Andrew Halberstadt [:ahal]

Comment 2

•

12 years ago

Yep, the B2G tests are running on the Ubuntu 32 pool on Cedar already. There's quite a few regular failures/crashes though. I have a loaner VM currently and am trying to figure out why. I suspect we'll either have to disable the crashing tests, or try running these on the 64 bit pool with ia32-libs installed (as it seemed to work fine there when I tested it).

Status: NEW → ASSIGNED

Andrew Halberstadt [:ahal]

Comment 3

•

12 years ago

The good news is I can reproduce the results we see on cedar. The bad news is I see: A) Intermittent socket connection failures B) Intermittent b2g process crashes (with varying logcat output accompanying them) C) Reliable emulator crashes on certain tests Rail, how much work would it be to re-image the Ubuntu 64 pool with a few additional libraries? I'd hate to make you do a bunch of extra work only to run into the same problems though.

Flags: needinfo?(rail)

Rail Aliiev [:rail]

Comment 4

•

12 years ago

We don't need to reimage the whole farm, puppet will install needed packages after a reboot.

Flags: needinfo?(rail)

Rail Aliiev [:rail]

Updated

•

12 years ago

Blocks: 837268

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 5

•

12 years ago

(In reply to Andrew Halberstadt [:ahal] from comment #2) > Yep, the B2G tests are running on the Ubuntu 32 pool on Cedar already. > There's quite a few regular failures/crashes though. I have a loaner VM > currently and am trying to figure out why. > > I suspect we'll either have to disable the crashing tests, or try running > these on the 64 bit pool with ia32-libs installed (as it seemed to work fine > there when I tested it). Not a requirement, but my preference would be just run the tests on linux32, instead of running on linux64-with-32bit-libraries installed. Somehow it sounds more stable! :-) (In reply to Andrew Halberstadt [:ahal] from comment #3) > The good news is I can reproduce the results we see on cedar. cool! > The bad news is I see: > A) Intermittent socket connection failures > B) Intermittent b2g process crashes (with varying logcat output accompanying > them) > C) Reliable emulator crashes on certain tests Are these intermittent problems different to any intermittent problems we already see on physical linux32 test machines? Where can we see details of these 3 crashes/failures?

Andrew Halberstadt [:ahal]

Comment 6

•

12 years ago

(In reply to John O'Duinn [:joduinn] from comment #5) > Not a requirement, but my preference would be just run the tests on linux32, > instead of running on linux64-with-32bit-libraries installed. Somehow it > sounds more stable! :-) Well the odd thing is that I originally tested on the 64 bit VM and they all worked perfectly fine! > Are these intermittent problems different to any intermittent problems we > already see on physical linux32 test machines? Nope, these problems do not exist on the fedora-32 pool > Where can we see details of these 3 crashes/failures? Sorry, should have pasted some logs (there is a logcat dump after the failure): https://tbpl.mozilla.org/php/getParsedLog.php?id=19668211&tree=Cedar&full=1 https://tbpl.mozilla.org/php/getParsedLog.php?id=19668227&tree=Cedar&full=1 https://tbpl.mozilla.org/php/getParsedLog.php?id=19668419&tree=Cedar&full=1 https://tbpl.mozilla.org/php/getParsedLog.php?id=19668897&tree=Cedar&full=1 https://tbpl.mozilla.org/php/getParsedLog.php?id=19641485&tree=Cedar&full=1 Sadly each of those logs is a slightly different failure (though I imagine at least some of them share the same root cause).

Andrew Halberstadt [:ahal]

Comment 7

•

12 years ago

(In reply to Rail Aliiev [:rail] from comment #4) > We don't need to reimage the whole farm, puppet will install needed packages > after a reboot. So do you think we could try this? Or should we really push to get them working on 32 bit? I'm not really how to solve the problems in those logs. It isn't a missing package/dependency problem as the tests run and sometimes even pass. I can try disabling tests but due to the intermittent nature of the failures I think we might just hit them on new tests once I do. Also disabling tests is as much of a band-aid solution as switching to 64 bit machines.

Rail Aliiev [:rail]

Comment 8

•

12 years ago

Before we try to run those on 64-bit platform, I'm going to add some 32-bit tests vms with different kernel. Maybe it helps...

Andrew Halberstadt [:ahal]

Comment 9

•

12 years ago

The generic kernel didn't seem to do anything. Can we try installing ia32-libs and running on the 64 bit pool?

Rail Aliiev [:rail]

Comment 10

•

12 years ago

I'll test how the 64-bit slaves behave with ia32-libs installed first, then we can try this option.

Rail Aliiev [:rail]

Updated

•

12 years ago

Depends on: 843179

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Updated

•

12 years ago

Depends on: 843100

Andrew Halberstadt [:ahal]

Updated

•

12 years ago

Depends on: 843201

Andrew Halberstadt [:ahal]

Comment 11

•

12 years ago

These are running on the 64 bit slaves no on cedar and already look much better. There was 1 failure in mochitest-1, but the other 8 chunks and the marionette tests were green. I re-triggered them a few times to see if they're stable or not.

John O'Duinn [:joduinn] (please use "needinfo?" flag)

Comment 12

•

12 years ago

any news?

Rail Aliiev [:rail]

Comment 13

•

12 years ago

From What I see on cedar (https://tbpl.mozilla.org/?tree=Cedar&rev=81d021bb66df) we have all green opt and all orange (except X) for debug.

Rail Aliiev [:rail]

Updated

•

12 years ago

Blocks: 844989

Andrew Halberstadt [:ahal]

Comment 14

•

12 years ago

I'm no longer working on this now that mochitests, marionette and xpcshell tests are running on the ubuntu 64 vm's. However there's still: 1. The failures on the 32 bit pool 2. Debug failures on both 3. Reftests So we can leave the bug open for those.

Assignee: ahalberstadt → nobody

Status: ASSIGNED → NEW

Rail Aliiev [:rail]

Updated

•

12 years ago

Blocks: 850105

cmtalbert

Comment 15

•

12 years ago

(In reply to Andrew Halberstadt [:ahal] from comment #14) > I'm no longer working on this now that mochitests, marionette and xpcshell > tests are running on the ubuntu 64 vm's. However there's still: > > 1. The failures on the 32 bit pool Totally not certain why we need to run emulators on both 64bit OS's and 32bit OS's. Running them *only* on 64bit OS's should be plenty. If they run there and are stable, then let's save ourselves the headache. Can anyone tell me why we should run them on 32bit OS's? > 2. Debug failures on both Yes, we need to fix these, I don't see bugs filed for this. Do they exist? > 3. Reftests Likewise, I don't see bugs here. Reftests actually use physical hardware for some tests. How many can we actually put in AWS? For things we can't get in AWS can we run them on Ubuntu ix machines? > > So we can leave the bug open for those.

Ed Morley [:emorley]

Reporter

Comment 16

•

12 years ago

(In reply to Clint Talbert ( :ctalbert ) from comment #15) > Totally not certain why we need to run emulators on both 64bit OS's and > 32bit OS's. Running them *only* on 64bit OS's should be plenty. If they run > there and are stable, then let's save ourselves the headache. Can anyone > tell me why we should run them on 32bit OS's? Added capacity of being able to hand out B2G emulator tasks to either 32bit or 64bit machines, iiuc.

Chris AtLee [:catlee]

Comment 17

•

12 years ago

(In reply to Ed Morley [:edmorley UTC+1] from comment #16) > (In reply to Clint Talbert ( :ctalbert ) from comment #15) > > Totally not certain why we need to run emulators on both 64bit OS's and > > 32bit OS's. Running them *only* on 64bit OS's should be plenty. If they run > > there and are stable, then let's save ourselves the headache. Can anyone > > tell me why we should run them on 32bit OS's? > > Added capacity of being able to hand out B2G emulator tasks to either 32bit > or 64bit machines, iiuc. I don't know that we should be trying to do that. It introduces another source of platform differences and possible test failures. We also can add more EC2 machines as we need to, so it's not like we're constrained by 64-bit capacity.

Ed Morley [:emorley]

Reporter

Comment 18

•

12 years ago

Oh ha yeah true. I'm still in the 'fedora physical machine and limited in number' mentality :-)

[checked-in] configure Elm to run b2g reftests on Fedora and Ubuntu 11 years ago Armen [:armenzg] 3.93 KB, patch	rail : review+	Details \| Diff \| Splinter Review
show builders differences after attachment 8374088 11 years ago Armen [:armenzg] 17.38 KB, patch	rail : feedback+	Details \| Diff \| Splinter Review
[checked-in] enable b2g reftests on EC2 for m-i 11 years ago Armen [:armenzg] 1.34 KB, patch	bhearsum : review+	Details \| Diff \| Splinter Review
enable b2g reftests on EC2 across the board 11 years ago Armen [:armenzg] 6.83 KB, patch	rail : review+	Details \| Diff \| Splinter Review
change of list of builders from attachment 8393664 11 years ago Armen [:armenzg] 24.35 KB, text/plain	rail : feedback+	Details
disable failures on b2g28, pass 1 11 years ago Andrew Halberstadt [:ahal] 29.50 KB, patch	jgriffin : review+	Details \| Diff \| Splinter Review
disable failures on b2g26, pass 1 11 years ago Andrew Halberstadt [:ahal] 11.80 KB, patch	jgriffin : review+	Details \| Diff \| Splinter Review
[checked-in] disable some chunks for b2g26 and some for b2g28t 11 years ago Armen [:armenzg] 1.77 KB, patch	rail : review+	Details \| Diff \| Splinter Review