Find will not find text if entered with diacritics ("nikud") in Hebrew for PDF viewer
Categories
(Firefox :: PDF Viewer, defect, P1)
Tracking
()
People
(Reporter: eyalgruss, Assigned: calixte)
References
Details
(Keywords: intl, Whiteboard: [pdfjs-ux][pdfjs-text-search])
Attachments
(1 file)
trying to find הספר with match diacritics turned off does not find הַסֵּפֶר in the following pdf:
https://www.machonso.org/uploads/images/%D7%95%D7%95%D7%9C%D7%A3.pdf
Reporter | ||
Updated•4 years ago
|
Comment 1•4 years ago
|
||
The severity field is not set for this bug.
:bdahl, could you have a look please?
For more information, please visit auto_nag documentation.
Updated•4 years ago
|
Any update on this? I've the same issue with Greek text. I now have to use Chrome just for this... :(
Assignee | ||
Updated•4 years ago
|
Chiming in with my experience with PDF, search and diacritics which isn't exactly the same as what OP describes but surely related. In my case it's Spanish language so it's for á é í ó & ú. Basically search in this case is always diacritic-sensitive regardless of whether "Match Diacritic" is set on or off.
Example PDF: https://es.wikipedia.org/api/rest_v1/page/pdf/Ejemplo
Searching for "retórica" does find matches (contrary to OP's Hebrew experience if I understand correctly) but searching for "retorica" never matches even when it should (with Match Diacritic = off).
Assignee | ||
Comment 4•3 years ago
|
||
I proposed a patch upstream:
https://github.com/mozilla/pdf.js/pull/13261
:eyaler, I tested with your pdf and it seems ("seem" because I don't read hebrew) to work fine.
Assignee | ||
Updated•3 years ago
|
eyal gruss, can you verify that the bug is solved in a current Firefox release?
Assignee | ||
Comment 6•2 years ago
|
||
The patch I worked on is still a wip and so has never landed.
Consequently, there are no chance this bug is fixed.
Updated•2 years ago
|
Assignee | ||
Updated•1 year ago
|
Comment 7•1 year ago
|
||
Updated•1 year ago
|
Description
•