The URL is a random example: the same problem occurs in almost any l10n diff in a non-ASCII script. Notice the "illegal character" �� symbols at the beginning and ends of lines in the URL, and that lines with Japanese are wrapping much too short. This is because the lines are wrapping at 80 bytes, not at 80 UTF-8 characters, so wrapped lines have less than 80 characters and sometimes a UTF-8 multibyte sequence is split between lines.
arg. who voided the warranty on bonsai by claiming that content was utf8? wrapping to 80 characters is stupid for multilingual systems because they can be of different widths. for mxr diff, i recently bumped the size from 50 to 80. but i'm inclined to just bump it again...
(In reply to comment #1) > wrapping to 80 characters is stupid for multilingual systems because they can > be of different widths. True enough, but even for other values of "80", the more important issue here is the invalid characters. Editing summary to focus attention on that.
Summary: cvsview.cgi should linewrap at 80 characters, not bytes → cvsview.cgi should linewrap at character boundaries
Bonsai was decommissioned, closing all remaining bugs "wontfix"
Status: NEW → RESOLVED
Last Resolved: 2 years ago
Resolution: --- → WONTFIX
Product: Webtools → Webtools Graveyard
You need to log in before you can comment on or make changes to this bug.