Closed Bug 712876 Opened 13 years ago Closed 13 years ago

Replace ISO-8859-9 (latin5, etc.) decoder with windows-1254 decoder per HTML5/Encoding spec

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla12

People

(Reporter: GPHemsley, Assigned: emk)

References

(
URL
)

Details

Attachments

(3 files, 2 obsolete files)

Encoding label selection test 13 years ago Masatoshi Kimura [:emk] 1.73 KB, text/html		Details
Compare iso-8859-9 encoder vs. windows-1254 encoder 13 years ago Masatoshi Kimura [:emk] 641 bytes, application/hta		Details
Compare iso-8859-9 encoder vs. windows-1254 encoder 13 years ago Masatoshi Kimura [:emk] 641 bytes, application/hta		Details
patch 13 years ago Masatoshi Kimura [:emk] 6.67 KB, patch	smontagu : review+	Details \| Diff \| Splinter Review
patch for check in. r=smontagu 13 years ago Masatoshi Kimura [:emk] 6.76 KB, patch	emk : review+	Details \| Diff \| Splinter Review

Gordon P. Hemsley [:GPHemsley]

Reporter

Description

•

13 years ago

According to the recently-spun-off Encoding Standard [1], Gecko does not currently support the full list of aliases for the windows-1254 encoding, which are as follows: "csisolatin5", "iso-8859-9", "iso-ir-148", "l5", "latin5", and "windows-1254". It is noted in [1] that these aliases should already be supported per the HTML(5| Living) Standard. For the most recent version of the Encoding Standard, see [2]. I don't know the implementation details of such a thing, but this seems to me to be a candidate Good First Bug. [1] http://dvcs.w3.org/hg/encoding/raw-file/8cafea8b65f9/Overview.html#windows-1254 [2] http://dvcs.w3.org/hg/encoding/raw-file/tip/Overview.html#windows-1254

Boris Zbarsky [:bzbarsky]

Comment 1

•

13 years ago

> I don't know the implementation details of such a thing You add the relevant entries to intl/locale/src/charsetalias.properties

Gordon P. Hemsley [:GPHemsley]

Reporter

Comment 2

•

13 years ago

Oh, perhaps the problem here is that they're all mapped to ISO-8859-9 instead of windows-1254: http://hg.mozilla.org/mozilla-central/file/ed47a41ba26a/intl/locale/src/charsetalias.properties#l270 270 # 271 # Aliases for ISO-8859-9 272 # 273 latin5=ISO-8859-9 274 iso_8859-9=ISO-8859-9 275 # Currently .properties cannot handle : in key 276 #iso_8859-9:1989=ISO-8859-9 277 iso-ir-148=ISO-8859-9 278 l5=ISO-8859-9 279 csisolatin5=ISO-8859-9