Closed Bug 861157 (tagged-pdf) Opened 11 years ago Closed 11 months ago

Support tagged PDFs in pdf.js

Categories

(Firefox :: PDF Viewer, enhancement, P3)

enhancement

Tracking

()

RESOLVED FIXED

People

(Reporter: MarcoZ, Unassigned)

References

(Depends on 3 open bugs, )

Details

(Keywords: access, Whiteboard: [pdfjs-ux])

Tagged PDFs are accessible PDFs with semantic info to make screen readers for the visually impaired read PDFs better. Headings, form fields, tables, and the general flow of the text are determined by a very specific tag hierarchy, much similar to HTML.

Currently, pdf.js does not support tagged PDFs, it ignores tags and only deduces the text from the general text info.

All information about PDF accessibility can be found from this entry site:
http://www.adobe.com/accessibility/

There is also PDF/UA, an initiative to help user agents such as pdf.js to make the most out of accessible PDFs.
Priority: -- → P3
Whiteboard: [pdfjs-c-ux]
Let's continue info and discussion here. Jamie wrote in bug 727819 comment #14:

> Two things worth noting:
> * Support for tagged PDF (and guessing where there aren't tags) will very much change the structure of the HTML representation of the content. Aside from headings, tables, etc., text should also flow better. That is, a single block of content (e.g. a paragraph) should appear in a single block element instead of multiple block elements. Right now, text breaks in awkward places.
> * Tagged PDF can specify the reading order of the content. In extreme cases, the reading order can actually mix content from different pages. There are valid use cases for this; e.g. a 2-page brochure where you are meant to read some parts across both pages instead of reading all of one page and then the other.

Related issue in PDF.js repo: https://github.com/mozilla/pdf.js/issues/6269

If/when Firefox generates tagged PDFs (bug 1657973), it would be nice to support tagged PDFs in PDF.js too.

Depends on: 1705139
Depends on: 1706814
Type: defect → enhancement
Depends on: 1704661
Depends on: 1708035
Depends on: 1708038
Depends on: 1708040
Alias: tagged-pdf
Severity: normal → --
Whiteboard: [pdfjs-c-ux] → [pdfjs-ux]

We support tagged PDFs now, specific problems with the implementation are tracked in individual bugs.

Status: NEW → RESOLVED
Closed: 11 months ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.