127755 - ISO-8859-11 (Latin/Thai) Support

Reporter

Description

•

23 years ago

At present time (Mozilla 0.9.8 & 20020224xx nightly build), Mozilla supports only one Thai char encoding, TIS-620. ISO-8859-11 is not supported yet. ISO-8859-11 Information technology -- 8-bit single-byte coded graphic character sets -- Part 11: Latin/Thai alphabet http://www.iso.ch/iso/en/CatalogueDetailPage.CatalogueDetail? CSNUMBER=28263&ICS1=35&ICS2=40&ICS3= most of the ISO-8859-11 is taken from TIS-620, you may use TIS-620 standard as co-reference. TIS-620 http://www.nectec.or.th/it-standards/std620/std620.htm ---- for testing, Thai language webpage that use ISO-8859-11 http://www.bababorbor.com/

Christopher Hoess (gone)

Comment 1

•

23 years ago

->il8n

Assignee: asa → yokoyama

Component: Browser-General → Internationalization

QA Contact: doronr → ruixu

Rui Xu

Updated

•

23 years ago

Keywords: intl

QA Contact: ruixu → ylong

Boris Zbarsky [:bzbarsky]

Updated

•

23 years ago

Status: UNCONFIRMED → NEW

Ever confirmed: true

Roy Yokoyama

Comment 2

•

23 years ago

-> ftang

Assignee: yokoyama → ftang

Frank Tang

Assignee

Comment 3

•

23 years ago

can you tell me what is the difference betwen TIS-620 and ISO-8859-11 ? Does the final version of ISO-8859-11 the same as what they specified in http://www.nectec.or.th/it-standards/iso8859-11/ ?

Status: NEW → ASSIGNED

Arthit Suriyawongkul

Comment 4

•

23 years ago

the document at link http://www.nectec.or.th/it-standards/iso8859-11/ is quite old - it just a draft.

Roland Mainz

Comment 5

•

23 years ago

Does someone here know where we can get ISO-8859-11-encoded fonts from (PS Type1, TrueType etc.) ?

Arthit Suriyawongkul

Reporter

Comment 6

•

23 years ago

(taken from OpenOffice.org mailing list) Theppitak <thep@links.nectec.or.th> has answered questions about differences of TIS-620 vs ISO-8859-11 ---- Q: about TIS-620 0xA0, should it be mapped to U+00A0 (NBSP) or considered as UNASSIGNED ? Actually, 0xA0 is _unassigned_ in TIS-620. Although NBSP becomes well-known when TIS-620 is put in row with encoding tables of ISO-8859 series, not every legacy system under TIS-620 standards recognizes and handles this character. So, IMO, it should be treated _unassigned_ as such. But the distiction seems to become more relaxed as time goes by, anyway. :) Q: The differences among MS874, TIS-620, ISO-8859-11. MS874 = TIS-620 + { NBSP, ellipsis, quote_left, quote_right, doublequote_left, doublequote_right, bullet, en_dash, em_dash } ISO-8859-11 = TIS-620 + { NBSP } -Thep. ----

Frank Tang

Assignee

Updated

•

23 years ago

Blocks: thai

Arthit Suriyawongkul

Reporter

Comment 7

•

23 years ago

changed severity to MAJOR, since we are about to migrate from TIS-620 to ISO-8859-11. ftang: i changed only the severity -- i understand your team situation, a priority is still up to your team :)

Severity: normal → major

Masaki Katakai

Comment 8

•

23 years ago

Frank, Will this go to 1.0?

Roland Mainz

Comment 9

•

23 years ago

ToDo list (Unix/Linux only): - We need a ISO-8859-11 converter - We need to hook-up entries for ISO-8859-11 in mozilla/gfx/src/gtk/nsFontMetricsGTK.cpp and mozilla/gfx/src/xlib/nsFontMetricsXlib.cpp Is this list complete ?

Frank Tang

Assignee

Comment 10

•

23 years ago

ok, so http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT is MS874, right ? if I take out the 0x80 0x20AC #EURO SIGN 0x85 0x2026 #HORIZONTAL ELLIPSIS 0x91 0x2018 #LEFT SINGLE QUOTATION MARK 0x92 0x2019 #RIGHT SINGLE QUOTATION MARK 0x93 0x201C #LEFT DOUBLE QUOTATION MARK 0x94 0x201D #RIGHT DOUBLE QUOTATION MARK 0x95 0x2022 #BULLET 0x96 0x2013 #EN DASH 0x97 0x2014 #EM DASH then it become ISO-8859-11 and if I then take out 0xA0 0x00A0 #NO-BREAK SPACE then it become TIS-620 ? Is that the case? You didn't mention euro, assume TIS-620 do not have euro neither ISO-8859-11

Frank Tang

Assignee

Comment 11

•

23 years ago

Attached file iso885911.uf — Details

Frank Tang

Assignee

Comment 12

•

23 years ago

Attached file iso885911.ut — Details

Frank Tang

Assignee

Comment 13

•

23 years ago

Attached file tis620.uf — Details

Frank Tang

Assignee

Comment 14

•

23 years ago

Attached file tis620.ut — Details

Frank Tang

Assignee

Comment 15

•

23 years ago

ok, I make the uf and ut by the following way 1. copy http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT into intl/uconv/tools 2. in intl/uconv/tools , make umaptable 3. vi cp874.txt and remove those line said undefine 4. umaptable -uf < cp874.txt > cp874.uf 5. umaptable -ut < cp874.txt > cp874.ut the result of 4 and 5 are identical to the current cp874.uf and cp874.ut if I use cvs -b . the difference is in the comment section of licensing now I copy the cp874.txt to iso885911.txt and remove those entreis I menetion: 0x80 0x20AC #EURO SIGN 0x85 0x2026 #HORIZONTAL ELLIPSIS 0x91 0x2018 #LEFT SINGLE QUOTATION MARK 0x92 0x2019 #RIGHT SINGLE QUOTATION MARK 0x93 0x201C #LEFT DOUBLE QUOTATION MARK 0x94 0x201D #RIGHT DOUBLE QUOTATION MARK 0x95 0x2022 #BULLET 0x96 0x2013 #EN DASH 0x97 0x2014 #EM DASH and run the umaptable to generate iso885911.uf and ut then I copy iso885911.txt to tis620.txt but remove 0xa0 and then run umaptable to generate tis620.uf and ut roland or masaki, if you guy really care to support, you can take these .uf and ut and copy the code we do for cp874 to make convert for it after that you do need to change charsetData.properties and charsetName.properties and probably also navigator.properties alecf is changing how we create the converter. He is moving away from the one class per converter to pass in table in the base contructor. I don't want to generate any stuff which might conflict with his work untill he finish. I have consider subclass the cp874 converter and add if code for those characters. But after look at the size of the table it seems not worthy because each of these tables are 32 bytes only. I don't think any if statmenet is less than 32 bytes in machine code :) roland, do you want to take this ?

iso885911.uf 23 years ago Frank Tang 2.62 KB, text/plain		Details
iso885911.ut 23 years ago Frank Tang 2.62 KB, text/plain		Details
tis620.uf 23 years ago Frank Tang 2.62 KB, text/plain		Details
tis620.ut 23 years ago Frank Tang 2.62 KB, text/plain		Details
a patch(not yet fully tested) 22 years ago Jungshik Shin 37.74 KB, patch		Details \| Diff \| Splinter Review
a new patch with gtk/xlib changes added 22 years ago Jungshik Shin 35.02 KB, patch		Details \| Diff \| Splinter Review
the same patch with the license change and the 'pollution' removed 22 years ago Jungshik Shin 34.97 KB, patch	smontagu : review+ rbs : superreview+	Details \| Diff \| Splinter Review