Closed Bug 285435 Opened 20 years ago Closed 9 years ago

ISO-8859-1 page detected as BIG-5 by universal auto-detect

Categories

(Core :: Internationalization, defect)

defect
Not set
normal

Tracking

()

RESOLVED WONTFIX

People

(Reporter: jmdesp, Assigned: jmdesp)

References

Details

Attachments

(1 file)

This case shows the problem is not only with SJIS and GB18030. Will need to check if the problem is more similar to bug 168526 (after filtering, the content looks too much like big5), or to bug 181344 (we hit a key sequence that is directly reported as big5)
Attached file Reproduction sample
Yes, the reproduction sample is the content of a bugzilla page. Saved as attachement because modifications to the original page could change the behaviour.
Summary: ISO-8859-1 page detected as BIG-5 → ISO-8859-1 page detected as BIG-5 by universal auto-detect
The content is being detected as being syntactically valid Big5. Some of the character sequences, particularly a6 61, are high frequency characters, causing the Big5 prober to return a high confidence.
QA Contact: amyy → i18n
Attachment #176876 - Attachment mime type: text/html → text/html; charset=
Chinese detector is gone.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → WONTFIX
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: