Open Bug 606897 Opened 15 years ago Updated 1 year ago

Profiling makes us much slower on the Celtic Kane Conway benchmark

Tracking

()

Status:

NEW

Tracking Flags:

Tracking

Status

blocking2.0

---

People

(Reporter: bzbarsky, Unassigned)

References

Details

(Keywords: perf)

Attachments

(1 file)

Testcase 15 years ago Boris Zbarsky [:bzbarsky] 2.13 KB, text/plain		Details

Boris Zbarsky [:bzbarsky]

Reporter

Description

•

15 years ago

Attached file Testcase — Details

The attached shell testcase is more or less a copy of the Conway benchmark at <http://jsbenchmark.celtickane.com/Run.aspx>. The number it prints is the score; higher is better. I see these numbers over here: -m: 26.6 -j: 49.98 -m -j: 53.19 -m -j -p: 26.14

Boris Zbarsky [:bzbarsky]

Reporter

Comment 1

•

15 years ago

For reference, v8 and jsc both score about 45 on this testcase; jsc about the same. So we may be able to get there with JM only... I think the loops on lines 55 and 56 (well, and 49 and 60) are the core of the benchmark; if I make sure we trace those I see scores around 44. The loop on 56 gets blacklisted both because maybeShortLoop is true for it and because selfOpsMult is 16100 (presumably due to those error-checking if statements; in this case, unlike the cases with unreached error-check bodies, I think we do hit all the 16 possible branches... but that's ok!). Also, the array copy loops (talk about slow ways to copy arrays!) don't get traced because the loop bodies are short; I assume JM optimizes dense arrays pretty well, though. If I take out the loop on line 49 and everything inside it, JM ends up scoring 153 while TM scores 176... So the array copies are faster in TM, but not hugely.

Boris Zbarsky [:bzbarsky]

Reporter

Updated

•

15 years ago

Blocks: 580468

blocking2.0: --- → ?

Keywords: regression

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 2

•

15 years ago

I'm not having a lot of luck getting the profiler to trace this one. There are multiple issues. All the loops execute for only a few iterations. There's lots of loop nesting. And the instruction mix doesn't have a lot of math in it; it's mostly control-flow stuff. I tried adding array access and comparisons to the goodOps calculation, but even that wasn't enough (unless I used really big multipliers). Getting this to trace without regressing other stuff seems hard.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 3

•

15 years ago

Hmm. So I guess one question is why _is_ this faster with TM than with JM (or with other methodjits, though the difference there is 15%, not 2x)? Can we address this by just fixing something in JM?

Robert Sayre

Updated

•

15 years ago

blocking2.0: ? → -

Ryan VanderMeulen [:RyanVM]

Comment 4

•

13 years ago

Interp: 1.89 TM: 1.95 JM: 25.19 JM+TI: 38.67 d8: 52.53 Looks like JM+TI got back some of the performance in the attached testcase, but v8 is about 1.4x faster. Obviously the profiling part of this bug is no longer relevant, but it appears that there are still improvements to be made.

Tom S. (please needinfo tschuster)

Comment 5

•

12 years ago

js: 135-140 d8: 160-170 We still have some room for improvements.

Tom S. (please needinfo tschuster)

Comment 6

•

12 years ago

NVM this still didn't have --enable-threadsafe, going to remeasure.

Tom S. (please needinfo tschuster)

Comment 7

•

12 years ago

/s/still/shell/

Chris Peterson [:cpeterson]

Updated

•

11 years ago

Keywords: regression → perf

Nobody; OK to take it and work on it

Assignee

Updated

•

11 years ago

Assignee: general → nobody

BMO Automation

Updated

•

3 years ago

Severity: normal → S3

Mayank Bansal

Comment 8

•

1 year ago

We are about 1.8x slower here.
Nightly Score: 250 (https://share.firefox.dev/4dIdGZY)
Chrome Score: 450

Is this test still useful?

Flags: needinfo?(iireland)

Iain Ireland [:iain]

Comment 9

•

1 year ago

This benchmark doesn't look particularly interesting. Another instance of array-heavy code with lots of samples spent unboxing values. Maybe worth keeping open to see whether improvements we make elsewhere help it; not worth prioritizing.

Flags: needinfo?(iireland)

Priority: -- → P5

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Profiling makes us much slower on the Celtic Kane Conway benchmark

Categories

(Core :: JavaScript Engine, defect, P5)

Tracking

()

People

(Reporter: bzbarsky, Unassigned)

References

Details

(Keywords: perf)

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Updated

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Comment 7

Updated

Updated

Updated

Comment 8

Comment 9

Attachment

General

Description

File Name

Content Type