pages are sometimes interpreted as windows-1252 instead of UTF-8 until a reload is done
Categories
(Core :: DOM: Core & HTML, defect)
Tracking
()
People
(Reporter: vincent-moz, Unassigned)
Details
Attachments
(4 files)
| Reporter | ||
Comment 1•9 years ago
|
||
| Reporter | ||
Comment 2•9 years ago
|
||
| Reporter | ||
Comment 4•9 years ago
|
||
| Reporter | ||
Comment 5•9 years ago
|
||
| Reporter | ||
Comment 6•9 years ago
|
||
| Reporter | ||
Comment 8•9 years ago
|
||
| Reporter | ||
Comment 9•9 years ago
|
||
Comment 10•9 years ago
|
||
| Assignee | ||
Updated•6 years ago
|
Comment 11•3 years ago
|
||
Looks like this problem has reappeared in 100.0.1 mint001 on Linux Mint, 64 Bit. Also seen on Windows 7, 64 Bit.
Visit http://biblio.aktionsgruppe.de/obiblio/opac/index.php and search for Marek. The site is an instance of Obiblio library software.
The pages that it generates are clearly marked <META http-equiv="content-type" content="text/html; charset=iso-8859-1"> which Firefox also reports when accessing Ctrl-I. There Firefox states
content-type text/html; charset=iso-8859-1
but above that, it reads
Text encoding Windows1252
and all accented characters are wrong. They are correct in the database (checked with phpMyAdmin), the web page is correctly marked, but still...
Possibly linked to this behaviour is some odd problem I see on various systems, Obiblio one of them, but also instances of Tiki Wiki, since a couple of weeks:
On sites that worked flawlessly for years, which did NOT get a software update of the PHP software or a change of collation in the database.
When a user fills in a text entry field that the PHP program will use to perform a search: Is there a collation information sent from Firefox to the website?
When you enter text to be searched, the (PHP) programs now throw an error "illegal mix of collations". Looks like in text entry fields, firefox does not pass the pages own encoding as collation (which would be latin1_german2_ci for iso8859), but utf8_general_ci, if accented characters are present in the user input... It does not happen when there are no accented characters in the user input...
Thanks
hman
| Reporter | ||
Comment 12•3 years ago
|
||
The HTTP response headers contain:
Content-Type: text/html; charset=OBIB_CHARSET
OBIB_CHARSET is not a valid charset. So the issue would be the configuration of the web server, not Firefox.
Comment 13•3 years ago
|
||
In HTML the meta statement is <META http-equiv="content-type" content="text/html; charset=iso-8859-1">
Comment 14•3 years ago
|
||
And Firefox detects ISO 8859-1, but does not use it, see screenshot. Hm, cannot attach a screenshot here?
Comment 15•3 years ago
|
||
Codepage detection info page.
Comment 16•3 years ago
|
||
Comment 17•3 years ago
|
||
Uh, yes, despite the META, network analysis showed indeed Content-Type: text/html; charset=OBIB_CHARSET. Thanks for pointing me to that.
And I got lucky. I did not write Open Biblio software, but this was the result of a bug that was quickly found and corrected, thank you.
Now the correct Content-Type text/html; charset=iso-8859-1 is written in the HTTP request header.
But - Firefox still doesn't do it right. Firefox still correctly recognizes iso-8859-1, but still uses windows-1252 and produces dysfunctional diacritics...
| Reporter | ||
Comment 18•3 years ago
|
||
Concerning windows-1252 while the page is declared as iso-8859-1 is related to this bug. This is bug 897302 / bug 890478.
| Reporter | ||
Comment 19•3 years ago
|
||
I meant is unrelated to this bug.
Comment 20•3 years ago
|
||
That bug was closed as invalid. Declaring windows1252 can never be correct on a Linux machine, because that code page does not exist. Also, the actual text rendering is obviously wrong and NOT iso8859-1, just look at it.
Comment 21•3 years ago
|
||
Table is set to biblio_status_dm InnoDB 10 Dynamic 9 1820 16384 0 0 0 NULL 2022-05-29 11:57:30 NULL NULL latin1_german2_ci NULL row_format=DYNAMIC
latin1_german2_ci = iso-8859-1 character set. And this is the rendered result:
Comment 22•3 years ago
|
||
Render result.
Comment 23•3 years ago
|
||
Text content in the table .
| Reporter | ||
Comment 24•3 years ago
|
||
(In reply to Oliver Kluge from comment #22)
Bildschirmfoto vom 2022-06-11 02-22-06.png
This looks incorrect because the characters are encoded in UTF-8 while the server declares iso-8859-1. You need to fix the configuration of the server so that it declares utf-8 (at the same time, this will avoid the windows-1252 nonsense).
Description
•