Closed Bug 765930 Opened 13 years ago Closed 13 years ago

Reader Mode: Optimize readability check

Tracking

(firefox16+ fixed)

Status:

RESOLVED FIXED

Milestone:

Firefox 16

Tracking Flags:

Tracking

Status

firefox16

fixed

People

(Reporter: lucasr, Assigned: mfinkle)

References

Details

(Keywords: perf, Whiteboard: [sps])

Attachments

(4 files)

Log while readability heuristics are running 13 years ago Kartikaya Gupta (email:kats@mozilla.staktrace.com) 85.26 KB, text/plain		Details
part 1: skip DOM manipulations on check 13 years ago Mark Finkle (:mfinkle) (use needinfo?) 2.21 KB, patch	bnicholson : review+	Details \| Diff \| Splinter Review
part 2: wait until after page load to perform check 13 years ago Mark Finkle (:mfinkle) (use needinfo?) 2.29 KB, patch	bnicholson : review+	Details \| Diff \| Splinter Review
part 3: make browsertoolbar smarter about page actions 13 years ago Mark Finkle (:mfinkle) (use needinfo?) 5.72 KB, patch	sriram : review+	Details \| Diff \| Splinter Review

Lucas Rocha (:lucasr)

Reporter

Description

•

13 years ago

The readability check introduced in bug 750683 involves cloning the DOM document to find the top candidate node with the article content inside the page. We need to optimize this readability check as much as possible and avoid the document node cloning altogether. Ideas: - Avoid removing any node from document to process the page - Look for more steps in the parsing process that are unnecessary for the readability check (we already skip prepDocument and removeScripts, there might be more)

Peter Retzer (:pretzer)

Comment 1

•

13 years ago

Does this include optimizing current false positive cases? For example I'm seeing the reader icon on Bugzilla bug lists, which doesn't really make sense. In-scope or new bug?

Lucas Rocha (:lucasr)

Reporter

Comment 2

•

13 years ago

(In reply to pretzer from comment #1) > Does this include optimizing current false positive cases? For example I'm > seeing the reader icon on Bugzilla bug lists, which doesn't really make > sense. In-scope or new bug? Please file a separate bug for false positives. The readability detection code definitely has to be improved. This bug is about the efficiency of the readability checking code.

Peter Retzer (:pretzer)

Comment 3

•

13 years ago

Filed bug 767956

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 4

•

13 years ago

Attached file Log while readability heuristics are running — Details

+1 on optimizing the readability check. If you load a page like https://crash-stats.mozilla.com/report/index/bp-2b7c0808-37df-40f0-a4d2-a3ed42120703 and then immediately zoom in, the page remains blurry for ~30 seconds before it repaints (on a Galaxy Tab 10.1). Logcat (attached) indicates that it is running the readability detection heuristics during this time. In the log I did the pinch at around 11:47:16 and the page didn't render sharply until around 11:47:48 which is a ridiculously long time.

Benoit Girard (:BenWa)

Comment 5

•

13 years ago

This script is showing up as a major source of main thread contention on mobile. Readability.js blocks for 1.7 seconds on cnn.com page load. Profile: http://people.mozilla.com/~bgirard/cleopatra/?report=AMIfv94gvUJfBU9O1bx1V8cewSf-Qz01_PLNizR0f4SKW39Cp0AXMKsIMNB3fCW4zwYS7ZnGoxLPBBfyfYWi3AuZaua2WXG5bVBTB6_ZM5GP4ZFEv5BxH04d7oHGjKDgtceS1GPFS3x0KJfq4I6xxjL0afF0O9SkOQ I think we need to escalate this.

Blocks: check2

tracking-firefox16: --- → ?

Aaron Train [:aaronmt]

Updated

•

13 years ago

status-firefox16: --- → affected

Keywords: perf

Benoit Girard (:BenWa)

Updated

•

13 years ago

Whiteboard: [sps]

Mark Finkle (:mfinkle) (use needinfo?)

Assignee

Comment 6

•

13 years ago

Attached patch part 1: skip DOM manipulations on check — Details — Splinter Review

This patch tries to skip any DOM manipulations when performing a check. It also fixes a typo "doc" -> "node". This part seems to reduce the time to check on nytimes by half

Assignee: nobody → mark.finkle

Attachment #641447 - Flags: review?(bnicholson)

Mark Finkle (:mfinkle) (use needinfo?)

Assignee

Comment 7

•

13 years ago

Attached patch part 2: wait until after page load to perform check — Details — Splinter Review

We are currently doing the reader check on "DOMContentLoaded" which is too early. The page scripts have not run and the network isn't even really finished yet. This makes it seem to take longer for the page to load. This patch moves to "pageshow", which is also called when going back/forward in session history too.

Attachment #641448 - Flags: review?(bnicholson)

Mark Finkle (:mfinkle) (use needinfo?)

Assignee

Comment 8

•

13 years ago

Part 2 introduces a bug. It seems that the Java BrowserToolbar code is designed in a way that assumes the reader check is done _before_ the network progress is complete. Therefore, the reader icon is not displayed unless you change the orientation and the toolbar is refreshed. Patch coming up...

Mark Finkle (:mfinkle) (use needinfo?)

Assignee

Comment 9

•

13 years ago

Attached patch part 3: make browsertoolbar smarter about page actions — Details — Splinter Review

BrowserToolbar assumes that the security mode and reader mode will be set beofre the network progress finishes. That is not a good idea. This patch: * Renames setReaderVisiblilty to setReaderMode, since it does not actually force the icon to be visible. * Renames setStopVisibility to setPageActionVisibility since it's about more than "stop" now. * setReaderMode and setSecurityMode both now call setPageActionVisibility in case the icon needs to be shown now. The only real hack I had was using mStop.getVisibility() as a param to setPageActionVisibility, but it works well enough for now.

Attachment #641620 - Flags: review?(sriram)

Sriram Ramasubramanian [:sriram]

Comment 10

•

13 years ago

Comment on attachment 641620 [details] [diff] [review] part 3: make browsertoolbar smarter about page actions Review of attachment 641620 [details] [diff] [review]: ----------------------------------------------------------------- This looks good to me.

Attachment #641620 - Flags: review?(sriram) → review+

Brian Nicholson (:bnicholson)

Comment 11

•

13 years ago

Comment on attachment 641447 [details] [diff] [review] part 1: skip DOM manipulations on check Review of attachment 641447 [details] [diff] [review]: ----------------------------------------------------------------- These changes look pretty harmless, and they didn't affect my readability test cases. ::: mobile/android/chrome/content/Readability.js @@ -1300,5 @@ > > check: function() { > - // Set proper flags for parsing document in readability check mode > - this._flags = this.FLAG_READABILITY_CHECK | > - this.FLAG_STRIP_UNLIKELYS; I wonder if this change might actually hurt performance. It avoids removing them, which should help, but that means we leave more nodes to be processed when parsing - especially if these nodes have lots of children.

Attachment #641447 - Flags: review?(bnicholson) → review+

Brian Nicholson (:bnicholson)

Comment 12

•

13 years ago

Comment on attachment 641448 [details] [diff] [review] part 2: wait until after page load to perform check Review of attachment 641448 [details] [diff] [review]: ----------------------------------------------------------------- Hopefully this will fix bug 767959.

Attachment #641448 - Flags: review?(bnicholson) → review+

Mark Finkle (:mfinkle) (use needinfo?)

Assignee

Comment 13

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/f1c05fde89c7 https://hg.mozilla.org/integration/mozilla-inbound/rev/d0bd171a6f4e https://hg.mozilla.org/integration/mozilla-inbound/rev/7601c7ace552

Alex Keybl [:akeybl]

Updated

•

13 years ago

tracking-firefox16: ? → +

Ryan VanderMeulen [:RyanVM]

Comment 14

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/f1c05fde89c7 https://hg.mozilla.org/mozilla-central/rev/d0bd171a6f4e https://hg.mozilla.org/mozilla-central/rev/7601c7ace552

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Target Milestone: --- → Firefox 16

Scoobidiver (away)

Updated

•

13 years ago

status-firefox16: affected → fixed

Nobody; OK to take it and work on it

Updated

•

5 years ago

Product: Firefox for Android → Firefox for Android Graveyard

You need to log in before you can comment on or make changes to this bug.