Open Bug 793881 Opened 12 years ago Updated 2 years ago

Komi-Permyak language (koi) is confused for Korean (ko)

Tracking

()

Status:

NEW

People

(Reporter: amir.aharoni, Unassigned)

References

(Depends on 1 open bug)

Details

Attachments

(1 file)

an HTML file with a demostration 12 years ago Amir Aharoni 398 bytes, text/html		Details

Amir Aharoni

Reporter

Description

•

12 years ago

Attached file an HTML file with a demostration — Details

The Komi-Permyak language has the language code "koi". If the lang attribute of an element is defined to this code, a font is applied to it as if it was Korean ("ko"). This language is written in the Cyrillic script, so forcing a Korean font on it is weird and wrong.

This doesn't happen in Chromium.

This affects, among other sites, the Wikipedia in that language ( https://koi.wikipedia.org ), and all other Wikipedias that link to it.

Simon Montagu :smontagu

Comment 1

•

12 years ago

I don't understand what is happening here: the equivalent error doesn't seem to occur for other three-letter language codes

Component: Internationalization → Layout: Text

Simon Montagu :smontagu

Comment 2

•

12 years ago

Hmm, I also can't reproduce if I select an explicit font for "Other languages" in fonts prefs, rather than the default "serif", "sans-serif" etc.

Jonathan Kew [:jfkthame]

Comment 3

•

12 years ago

I think this may be a fontconfig issue. On my Linux VM, it looks like fontconfig is defaulting to a Korean font for language codes it doesn't recognize, as well as to some (but not all) languages that would be expected to use Cyrillic.

jkew@jkew-vb:~$ fc-match 
DejaVuSans.ttf: "DejaVu Sans" "Book"

jkew@jkew-vb:~$ fc-match :lang=en
DejaVuSans.ttf: "DejaVu Sans" "Book"

jkew@jkew-vb:~$ fc-match :lang=ja
fonts-japanese-gothic.ttf: "TakaoPGothic" "Regular"

jkew@jkew-vb:~$ fc-match :lang=zh
wqy-microhei.ttc: "文泉驛微米黑" "Regular"

jkew@jkew-vb:~$ fc-match :lang=ko
NanumGothic.ttf: "NanumGothic" "Regular"

jkew@jkew-vb:~$ fc-match :lang=uk
DejaVuSans.ttf: "DejaVu Sans" "Book"    # seems reasonable for Ukrainian

jkew@jkew-vb:~$ fc-match :lang=ru
NanumGothic.ttf: "NanumGothic" "Regular"    # surprising

jkew@jkew-vb:~$ fc-match :lang=koi
NanumGothic.ttf: "NanumGothic" "Regular"    # maybe FC doesn't recognize "koi"?

jkew@jkew-vb:~$ fc-match :lang=xxx
NanumGothic.ttf: "NanumGothic" "Regular"

Amir Aharoni

Reporter

Comment 4

•

12 years ago

Oh BTW, forgot to mention that I'm trying this on Linux. On other systems the behavior may be different.

There is a somewhat similar bug in the Android OS: If you ask to see the interface of your device in the Sakha language (sah), you'll get it in Sanskrit (sa) instead. I guess that in both cases somebody was checking just the first two letters instead of parsing the language code properly.

Simon Montagu :smontagu

Comment 5

•

12 years ago

(In reply to Amir Aharoni from comment #4)
> I guess that in both cases somebody was checking just
> the first two letters instead of parsing the language code properly.

That was my first assumption too, but from my and Jonathan's test results, it looks as if something different is going on here, and that this is probably a fontconfig bug.

Simon Montagu :smontagu

Comment 6

•

12 years ago

Bug 835074 may fix this, since it assigns Cyrillic script to koi and a bunch of other languages that were previously undefined, though judging by comment 3 the problem may still occur with other languages.

Depends on: 835074

Andre Klapper

Updated

•

11 years ago

See Also: → https://bugzilla.wikimedia.org/show_bug.cgi?id=40492

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Komi-Permyak language (koi) is confused for Korean (ko)

Categories

(Core :: Layout: Text and Fonts, defect)

Tracking

()

People

(Reporter: amir.aharoni, Unassigned)

References

(Depends on 1 open bug)

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Updated

Updated

Attachment

General

Description

File Name

Content Type