Closed Bug 210798 Opened 21 years ago Closed 13 years ago

stop.hu - Doesn't handle unicode 4 Hungarian characters in forms (o" u" O" U").

Categories

(Tech Evangelism Graveyard :: Other, defect)

x86
Windows XP
defect
Not set
normal

Tracking

(Not tracked)

RESOLVED INCOMPLETE

People

(Reporter: kio77, Unassigned)

References

()

Details

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.4b) Gecko/20030516 Mozilla Firebird/0.6
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.4b) Gecko/20030516 Mozilla Firebird/0.6

In forums, and in webmails, when I type in Hungarian accented text, in the form
it shows up well. However, when I load the page, only the unicode style encoding
is present instead of the character itself. These are the doubleacute o u O and
U. Like here, I enter
o" - ő
u" - ű
O" - Ő
U" - Ű

Sometimes, software developers simply replace o" with o~ and u" with u^. This
solution is not nice, but is better than seeing the Unicode codes. I think this
is because all other Hungarian letters can be find in the 255 ASCII table,
however ő ű Ő Ű are in the Unicode region.

Many programs have these problem, like Microsoft Office X for the Macintosh
simply displays _ (underline characters) for ő ű Ő Ű, or Opera 6.0 for Windows
had the same problem when it came out (they have fixed this in later versions).
Or Corel Wordperfect 2003 is completely incapable to handle well these characters.

Interestingly, not all sites have this problem. For example, the forum of
hwsw.hu (the biggest site for IT news in Hungary), does not have this problem,
but in Squirrel mail at https://inno.bme.hu/webmail this is a problem.

Reproducible: Always

Steps to Reproduce:
1. Go to http://www.stop.hu/apronet/felvitel.tdp to post an ad.
2. Select the first category in the drop down menu *Rovat:
3. Type in the text into the field *A hírdetés szövege (max 500 karakter):
I have entered ő ű Ő Ű
4. Type in a name in the *Neved: field.
5. Typen in your e-mail address where the activation link will be sent at the
*e-mail címed field.
6. Click on Tovább >>.
7. Click on Tovább >> once again.
8. You can see the preview of your ad.
9. If you want to post your ad with the broken text, click on Tovább >>.
10. In your e-mail box click on the long link under the text "A regisztraciot
ezen a linken tudod elesiteni:" that will publish your ad.
11. To see your ad, go to
http://www.stop.hu/apronet/lista.tdp?alrovat=19&submit=++OK+ (this is the first
category in the drop down list where we published or ad).
Actual Results:  
The characters o" u" O" U" are shown as o" - ő O" - Ő u" - ű U" -
Ű in the text body of the ad, with their respective Unicode codes.

Expected Results:  
The characters should have been displayed normally, not as Unicode codes.
Changing the View\Character coding to UTF-8 does not fix the problem.

The page http://www.inf.elte.hu/valerie/speci/html/htmltananyag/gyak1/alapok.htm
contains a list with the html and unicode encodings used in Hungarian.

You can contact the website admin at
http://www.stop.hu/stuff/otlet.tdp
You might install the Hungarian keyboard in Windows to be able to type in o" u".
If you install the Hungarian keyboard, o" is next to P to the right, and u" is
usually on the backslash key on US keyboard.
Does change the charset to ISO-8859-2 could get display properly?
When entering text, if the encoding is ISO-8859-1
When viewing text, independently of the encoding
The display is incorrect

When entering text, if the encoding is ISO-8859-2
When viewing the resulting page, if the encoding is ISO-8859-2
The display is correct, both for the entered text and for the rest of the page.

When entering text, if the encoding is UTF-8
When viewing the resulting page, if the encoding is UTF-8
The display of the entered text is correct, but elswhere the page is full of ?
marks.

So it seems, the setting the default encoding to ISO-8859-2 partly solves the
problem. Only partly, because most of the websites in Hungary are set up for
ISO-8859-1 encoding, as this is the most common, and usually browser default to
this. And I have to remember personally which site is in which encoding, and
other browsers (Internet Explorer, Opera) handle this automatically.

The remedy would be to have FireBird correctly detect the page encoding at the
beginning, and submit the form in the corresponding encoding.
>The remedy would be to have FireBird correctly detect the page encoding at the
>beginning, and submit the form in the corresponding encoding.

I thought that was what we always did. If not, it's a bug in form handling.

If I look on the html source of the page that defines the forms, it tells the
code page to be ISO-8859-2 (http://www.stop.hu/apronet/felvitel.tdp).
meta HTTP-EQUIV="CHARSET" CONTENT="text/html; charset="ISO-8859-2
If the display is correctly with charset set to iso-8859-2, then we are doing fine:

The problem is the meta-tag charset of the page has an error:
<meta HTTP-EQUIV="CHARSET" CONTENT="text/html; charset="ISO-8859-2>
instead of:
<meta HTTP-EQUIV="CHARSET" CONTENT="text/html; charset="ISO-8859-2">

So, it's an evangelism bug rather browser bug.
You are true - so I have changed it for Tech Evangelism. I just don't
understand, why IE 6.0 can handle the same situation well even when there is a
small error.
Component: Internationalization → English Other
Product: Browser → Tech Evangelism
Version: Trunk → unspecified
Component: English Other → Other
An e-mail reporting the error has been sent to the webmaster.
Summary: Doesn't handle unicode 4 Hungarian characters in forms (o" u" O" U"). → stop.hu - Doesn't handle unicode 4 Hungarian characters in forms (o" u" O" U").
Assignee: smontagu → other
QA Contact: amyy → other
Status: UNCONFIRMED → NEW
Ever confirmed: true
INCOMPLETE due to lack of activity since the end of 2009.

If someone is willing to investigate the issues raised in this bug to determine whether they still exist, *and* work with the site in question to fix any existing issues, please feel free to re-open and assign to yourself.

Sorry for the bugspam; filter on "NO MORE PRE-2010 TE BUGS" to remove.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → INCOMPLETE
Product: Tech Evangelism → Tech Evangelism Graveyard
You need to log in before you can comment on or make changes to this bug.