289938 - Should use real astral chars (not PUA) for math chars outside the Basic Multilingual Plane

Henri Sivonen (:hsivonen) (temporarily away from Bugzilla)

Reporter

Description

•

20 years ago

In order to keep internal strings as UCS2, Mozilla fakes astral math entities by mapping them to PUA characters. Since strings are now UTF-16, I think this hack should be removed and the real characters should be allowed in the DOM and passed to the gfx with ATSUI/Pango/Uniscribe rendering the correct glyphs provided a properly mapped font is installed. Pure UTF-8 already goes the real astral route, so I think it would make sense to fix the gfx implementations to make sure the pure UTF-8 route is the first-class route. Actual results: Astral entities map to PUA chars which are special cased in Win32 and X11 gfx impls but (it seems) not on Mac. Expected results: Astral entities map to the right astral chars and gfx impls on all platforms deal with those chars appropriately.

rbs

Comment 1

•

17 years ago

We use fictional Unicode points (from the so-called Private Use Area - PUA) to reference the special math glyphs that do not have official Unicode assignments. This is in principle what the PUA is meant for. The special math glyphs are needed especially in the stretching process because the process involves "half pieces" that will never get separate individual Unicode points. However, with Cairo, we can modernize the remapping and avoid the detour via the PUA. Doing this will involve pushing the glyph lookup process down to thebes (a lot of the code in nsMathMLChar.cpp). We needed nsMathMLChar.cpp to factor the common functionality away from the disparate GFX platforms. Now with Cairo, we have a common gateway to these platforms, and could push the lookup functionality there, and in the process eliminate our internal assignments to the PUA. This is a major work, but would provide a more elegant approach. Without a unifying Cairo, it would be a nightmare to have multiple GFX implementations of this process.

script used to update our entity list 17 years ago Karl Tomlinson (:karlt) 6.22 KB, text/plain		Details
mathml.dtd patch [checked-in] 17 years ago Karl Tomlinson (:karlt) 125.73 KB, patch	pavlov : review+	Details \| Diff \| Splinter Review
entity changes in sorted pseudo-unified-diff format 17 years ago Karl Tomlinson (:karlt) 23.96 KB, text/plain		Details
include in short arrow entities 17 years ago Karl Tomlinson (:karlt) 1.84 KB, patch		Details \| Diff \| Splinter Review
include ZWSP in short arrow entities (including slarr and srarr) [checked in] 17 years ago Karl Tomlinson (:karlt) 1.85 KB, patch	pavlov : review+	Details \| Diff \| Splinter Review
operator dictionary changes consistent with entity changes [checked-in] 17 years ago Karl Tomlinson (:karlt) 36.58 KB, patch	pavlov : review+	Details \| Diff \| Splinter Review
corresponding nsIEntityConverter table changes [checked-in] 17 years ago Karl Tomlinson (:karlt) 20.64 KB, patch	pavlov : review+	Details \| Diff \| Splinter Review