Closed Bug 1683392 Opened 5 years ago Closed 5 years ago

High CPU load for "mach puppeteer-test" since using "npm ci"

Tracking

(firefox87 fixed)

Status:

RESOLVED FIXED

Milestone:

87 Branch

Tracking Flags:

Tracking

Status

firefox87

---

fixed

People

(Reporter: whimboo, Assigned: whimboo)

References

(Regression)

Details

(Keywords: regression, Whiteboard: [puppeteer-beta2-mvp])

Attachments

(1 file)

Bug 1683392 - [puppeteer] Use "npm ci" only when tests are run on TaskCluster. 5 years ago Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08) 48 bytes, text/x-phabricator-request		Details \| Review

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Description

•

5 years ago

Right now each call to mach puppeteer-test will install puppeteer. This causes lots of CPU usage for both nodejs and especially watchman. The latter runs even a minute after the mach command finished with more than 100% cpu load on my machine.

So we should only install puppeteer when it hasn't been done before, or if files for Puppeteer have been modified since the last time it got installed.

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Comment 1

•

5 years ago

The problem here has actually been caused by the changes on bug 1670286, which changed the npm install step to npm ci.

Here some timings that do not include the watchman load afterward:
npm ci: 66.26s user 16.74s system 153% cpu 53.982 tota
npm install: 46.21s user 4.49s system 146% cpu 34.507 total

By using npm ci all the installed node modules are getting removed and re-installed. That seems to be unnecessary for doing it for every mach puppteer-test call. Maybe we could do:

If node modules haven't been installed yet, run npm ci.
If pinned modules have been changed, run npm ci.
If options above don't apply run npm install.

Or would install not take care of the pinned versions? Maja, maybe you can still help us here?

Flags: needinfo?(mjzffr)

Keywords: regression

Regressed by: 1670286

Summary: High CPU load for "mach puppeteer-test" because it always installs puppeteer → High CPU load for "mach puppeteer-test" since using "npm ci"

Maja Frydrychowicz :impossibus (was :maja_zf) (needinfo me)

Comment 2

•

5 years ago

I see a spike from watchman with npm ci, too, likely due to the clobber of node_modules, although the spike isn't as noticeable on my machine.

Without watchman, npm ci should actually be faster than npm install because it skips some steps. See https://docs.npmjs.com/cli/v6/commands/npm-ci#description

I think you should probably configure your watchman to ignore changes in node_modules before changing anything in the mach command.

Or you could consider using npm ci in our CI only. I think there's a TASKCLUSTER environment variable that you could check to determine whether the command is running in automation?

(In reply to Henrik Skupin (:whimboo) [⌚️UTC+2] from comment #1)

If node modules haven't been installed yet, run npm ci.

If pinned modules have been changed, run npm ci.

npm ci exits with an error if the dependencies list in the package-lock.file don't match what gets generated from package.json. When package.json changes, you need to run npm install (and this is done automatically when running the vendoring mach command)

Or would install not take care of the pinned versions? Maja, maybe you can still help us here?

npm install writes a new package-lock.json file, which we need to do only when we sync with upstream.

Flags: needinfo?(mjzffr)

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Comment 3

•

5 years ago

Mike, do we have any rules in where required npm packages need to be installed when running tests via mach? Right now all the packages for the Puppeteer jobs end-up in the source tree, and removing and reinstalling all of these dependencies cause a high CPU load for watchman for me. So I wonder if those packages should better be installed in the obj dir. If that's the preferred location we might also have to instruct the TypeScript compilation step to output the content there.

Flags: needinfo?(mh+mozilla)

Mike Hommey [:glandium]

Comment 4

•

5 years ago

302 dmose

Flags: needinfo?(mh+mozilla) → needinfo?(dmose)

Dan Mosedale (:dmosedale, :dmose)

Comment 5

•

5 years ago

@Henrik: there are no specific rules about where node_modules get installed at this point. In some ideal world, they'd get installed in the objdir, but I'm not aware of any code that currently works that way. I suspect supporting that would be a bunch of work, but I don't know for sure.

Comment 2 proposes:

Or you could consider using npm ci in our CI only. I think there's a TASKCLUSTER environment variable that you could check to determine whether the command is running in automation?

This seems like The Right Thing, and it also sounds like it should be easy to implement. Why not just do this?

Flags: needinfo?(dmose) → needinfo?(standard8)

Mark Banner (:standard8)

Comment 6

•

5 years ago

I guess I'm needinfo'ed for ideas. On the ESLint front, we package node_modules into a zip file and upload that to the build infrastructure. It then gets included in the docker images, and used as necessary. This also means the builders have a defined version available and is reproducible, it also means that we're not dependent on the nodejs infrastructure (though it doesn't normally cause a problem).

We are looking to move the top level node_modules into the source tree, however that vendoring work is still in progress as there's various policies to be defined around it.

Flags: needinfo?(standard8)

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Comment 7

•

5 years ago

Thanks for the feedback Mark! So we actually want a quick fix here, and using the build infra for that seems to be a bit of work involved. As such I would stick with keeping the node modules where they are right now and use an environment variable.

I discussed that with Ben Hearsum on Matrix, and as result we should better go with our own environment variable set in the Taskcluster configuration: https://searchfox.org/mozilla-central/source/taskcluster/ci/source-test/remote.yml

Reason is that we do not want to depend on a variable that might change at any time, and would cause side-effects.

Assignee: nobody → hskupin

Status: NEW → ASSIGNED

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Comment 8

•

5 years ago

Actually I will make use of a --ci command line argument, which will be better observable when running mach puppeteer-test --help.

Henrik Skupin [:whimboo][⌚️UTC+2] (away 07/21 - 08/08)

Assignee

Comment 9

•

5 years ago

Attached file Bug 1683392 - [puppeteer] Use "npm ci" only when tests are run on TaskCluster. — Details

This patch adds a new "--ci" argument for the
"mach puppeteer-test" command. As such it can
also be used locally to simulate a test job in CI.

Pulsebot

Comment 10

•

5 years ago

Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/d2f8cb04ce26 [puppeteer] Use "npm ci" only when tests are run on TaskCluster. r=remote-protocol-reviewers,jdescottes

Andreea Pavel [:apavel]

Comment 11

•

5 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/d2f8cb04ce26

Status: ASSIGNED → RESOLVED

Closed: 5 years ago

status-firefox87: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 87 Branch

You need to log in before you can comment on or make changes to this bug.

Bugzilla

High CPU load for "mach puppeteer-test" since using "npm ci"

Categories

(Remote Protocol :: Agent, defect, P2)

Tracking

(firefox87 fixed)

People

(Reporter: whimboo, Assigned: whimboo)

References

(Regression)

Details

(Keywords: regression, Whiteboard: [puppeteer-beta2-mvp])

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Attachment

General

Description

File Name

Content Type