Closed Bug 906032 Opened 11 years ago Closed 11 years ago

Export Telemetry data CHARSET_OVERRIDE_USED by locale

Categories

(Mozilla Metrics :: Data/Backend Reports, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED
Unreviewed

People

(Reporter: mreid, Assigned: mreid)

References

Details

Attachments

(4 files)

Provide a by-locale breakdown of the telemetry flag CHARSET_OVERRIDE_USED on the release channel since Firefox 22 in a) desktop Firefox and b) Firefox for Android to inform the inclusion of the character encoding over a feature in a) Australis and b) B2G.

Data to be provided in a form like:

date, locale, app, on_count, off_count
2013-08-16, pt-BR, Firefox, 4, 500
Assignee: nobody → mreid
Group: metrics-private
OS: Linux → All
Hardware: x86_64 → All
See Also: → 847919
Here is one day's exported data.  I will export more days if this looks OK.  I added in the Firefox version - if you don't want/need that, I can take it out.
Attachment #792831 - Flags: feedback?(hsivonen)
Flags: needinfo?(hsivonen)
This looks great. Thank you.
Flags: needinfo?(hsivonen)
Comment on attachment 792831 [details]
bug906032.20130818.clean.txt

Oops. I didn't realize there was another pending flag as well. Sorry.
Attachment #792831 - Flags: feedback?(hsivonen) → feedback+
Here is an export of the data going back to August 8th.  I will export more data and attach it here as it becomes available.

Please let me know once you've seen enough :)
Usage:
cat *.txt | python count_charset.py > result.html
(In reply to Mark Reid [:mreid] from comment #4)
> Please let me know once you've seen enough :)

I think the data you attached is sufficient. Thank you!
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Comment on attachment 796536 [details]
Summary data as HTML

Some observations: Unsurprisingly, locales that don't heed to the advice given in https://developer.mozilla.org/en-US/docs/Localizations_and_character_encodings are the locales that sort to the top by character encoding overwrite usage. Next up is Traditional Chinese. My guess is that people who use Traditional Chinese browser UI read Simplified Chinese content often enough that the difference in the legacy encodings between the two explains the need to use the menu.

I guess for starters, I should file bugs on the locales that default to UTF-8 and are high up on the list and on the locales whose fall back is listed as "(broken value)".
Blocks: 910163
Blocks: 910165
Blocks: 910169
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: