Add hyphenation patterns for Indian languages
Categories
(Core :: Internationalization, enhancement)
Tracking
()
Tracking | Status | |
---|---|---|
firefox97 | --- | fixed |
People
(Reporter: santhosh.thottingal, Assigned: jfkthame)
References
(Blocks 1 open bug)
Details
(Keywords: dev-doc-needed)
Attachments
(1 file)
Reporter | ||
Updated•9 years ago
|
Reporter | ||
Updated•9 years ago
|
Comment 1•3 years ago
|
||
Is it possible to move this forward? It seems that Santhosh has done much of the work.
See also https://w3c.github.io/iip/gap-analysis/taml-gap#issue79_hyphenation
Tamil is a language that really needs hyphenation support, because it has long words, and there are others that are similar, such as Malayalam (see https://r12a.github.io/scripts/malayalam/#linebreak).
Assignee | ||
Comment 3•3 years ago
|
||
Yes, I think we could try adding these patterns. I'll put up a patch.
It's unclear to me what degree of review & testing these have actually had among the relevant communities; they're extremely simple rule-based patterns -- apparently derived (without attribution) from a simple proof-of-concept that I originally posted to the XeTeX mailing list back in 2004 -- that may not fully handle cases that go beyond the simple "orthographic cluster" structure of these scripts, but they should be sufficient as a starting point.
Assignee | ||
Comment 4•3 years ago
|
||
Using hyphenation patterns from https://github.com/santhoshtr/hyphenation.
The tests here are implemented as Mozilla reftests rather than added to WPT because I don't think
we can reasonably have such tests in WPT. The specific set of languages for which the UA supports
auto-hyphenation is not a normative requirement, and nor is the particular dictionary or algorithm
that will be used for any specific language. As such, the exact results are not defined by the
spec. (They may also change over time, if the hyphenation rules we use are updated, in which case
the tests will have to change accordingly.)
Updated•3 years ago
|
Comment 6•3 years ago
|
||
bugherder |
Updated•3 years ago
|
Comment 7•3 years ago
|
||
bugherder |
Description
•