Closed Bug 1141476 Opened 9 years ago Closed 1 month ago

Fetch & parse page metadata in the background for items added via sync and missing metadata

Categories

(Firefox Graveyard :: Reading List, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED INCOMPLETE

People

(Reporter: Unfocused, Unassigned)

References

(Blocks 1 open bug)

Details

(Whiteboard: [reader-ui])

We can't assume all items were added to the server had complete metadata. For those items, we'll want to fetch the page in the background and extract the relevant metadata - then store and sync to the server.

Most notable case of this happening is when a URL is shared to Firefox on Android.
Priority: -- → P1
Flags: qe-verify+
Flags: firefox-backlog+
Need bug 1149336 to be able to deal with syncing the updated item.
Depends on: 1149336
Blocks: 1132074
Whiteboard: [reader-ui]
Assignee: nobody → mhammond
Status: NEW → ASSIGNED
Iteration: --- → 40.2 - 27 Apr
I guess there are 2 ways we could approach this:
1) Load the entire page in the background (just like we do for thumbnails) and fetch the data from that.
2) Fetch just the top-level page, parse that using JSDOMParser and extract it from that.

(1) should always be correct but slow. (2) should be faster but will not pick up things like JS mutating the DOM to setup the metadata we look for.  (2) *might* be fast enough to help with bug 1155014 (ie, PageMetaData.jsm could fallback to (2) if it finds the current DOM doesn't reflect reality)

Any thoughts on this?
Points: --- → 8
Assignee: mhammond → nobody
Status: ASSIGNED → NEW
Iteration: 40.2 - 27 Apr → ---
Product: Firefox → Firefox Graveyard
Status: NEW → RESOLVED
Closed: 1 month ago
Resolution: --- → INCOMPLETE
You need to log in before you can comment on or make changes to this bug.