Open Bug 1854812 Opened 9 months ago Updated 9 months ago

isProbablyReaderable should detect non-JSON-LD valid schema RDFa lite structured data

Categories

(Toolkit :: Reader Mode, enhancement)

enhancement

Tracking

()

UNCONFIRMED

People

(Reporter: pledgling, Unassigned)

References

(Blocks 3 open bugs, )

Details

Steps to reproduce:

I was using the OpenLink Structured Data Sniffer on various pages the reader mode icon does not appear for and was surprised to see valid data on them since on the readability github it says "when extracting page metadata, Readability gives precendence to Schema.org fields specified in the JSON-LD format" until I checked and saw the RDFa attributes were just directly in the HTML of the pages.

Prefixing the url with about:reader?url= worked fine for all of them, there isn't a single P element for the algorithm so my suggestion is to show the reader icon when there is any valid structured data on a page, not just when it's JSON-LD formatted, especially when property="text" and property="image" attributes exist.

Actual results:

Reader mode icon did not appear, example pages with valid RDFa schema attributes:
https://deconfiguration.glitch.me/DellPowerEdgeR730.html
https://deconfiguration.glitch.me/SunM4000.html
https://deconfiguration.glitch.me/HPProliantDL380P.html
https://deconfiguration.glitch.me/CiscoAIR-CAP3702I.html

Expected results:

Reader mode icon should appear.

You need to log in before you can comment on or make changes to this bug.