Closed
Bug 816082
Opened 12 years ago
Closed 12 years ago
B2G SMS: UCS2 chars mapping to 7 bits default alphabet
Categories
(Core :: DOM: Device Interfaces, defect, P2)
Tracking
()
People
(Reporter: carlosmartinez, Assigned: vicamo)
References
Details
Attachments
(7 files, 8 obsolete files)
|
50.91 KB,
application/pdf
|
Details | |
|
438.06 KB,
application/pdf
|
Details | |
|
5.62 KB,
patch
|
vicamo
:
review+
|
Details | Diff | Splinter Review |
|
10.95 KB,
patch
|
vicamo
:
review+
|
Details | Diff | Splinter Review |
|
2.45 KB,
patch
|
vicamo
:
review+
|
Details | Diff | Splinter Review |
|
10.29 KB,
patch
|
vicamo
:
review+
|
Details | Diff | Splinter Review |
|
57.36 KB,
application/vnd.oasis.opendocument.text
|
Details |
Related to: https://bugzilla.mozilla.org/show_bug.cgi?id=815564
Attached you can find a document describing the desired mapping between UCS2 and 7 bits default alphabet regarding SMS sending.
| Reporter | ||
Updated•12 years ago
|
blocking-basecamp: --- → ?
| Reporter | ||
Updated•12 years ago
|
OS: Windows 7 → Gonk (Firefox OS)
Hardware: x86_64 → ARM
| Reporter | ||
Comment 1•12 years ago
|
||
| Reporter | ||
Comment 2•12 years ago
|
||
First attached document doesn´t have the correct format, please use the 2nd one.
Comment 4•12 years ago
|
||
This is critical for product certification in latin countries.
| Assignee | ||
Updated•12 years ago
|
Attachment #686095 -
Attachment is obsolete: true
| Assignee | ||
Comment 5•12 years ago
|
||
Character "₤"(U+20a4) is not part of default 7-bit alphabet.
| Reporter | ||
Comment 6•12 years ago
|
||
Hi Vicamo,
According to the documentation we have, is part of the default alphabet, you can found it in: http://www.turkcell.com.tr/downloads/hakkimizda/pdf/23038-820.pdf or attached to the bug. The table we have used to create the list is in page 18.
If this document is not correct or you have a better one, please share it with us so we can correct the list.
Thank you.
| Reporter | ||
Comment 7•12 years ago
|
||
| Assignee | ||
Comment 8•12 years ago
|
||
(In reply to Carlos Martínez Toral [:carlosmartinez] from comment #6)
> If this document is not correct or you have a better one, please share it
> with us so we can correct the list.
http://www.3gpp.org/ftp/Specs/html-info/23038.htm
v11.0.0 released on 2012-09-21, "₤"(U+20a4) is not included.
| Assignee | ||
Comment 9•12 years ago
|
||
Won't request review because we still have problem with the list.
| Assignee | ||
Comment 10•12 years ago
|
||
| Assignee | ||
Comment 11•12 years ago
|
||
Depends on bug 759529 because test script needs it.
Depends on: 759529
| Assignee | ||
Comment 12•12 years ago
|
||
(In reply to Carlos Martínez Toral [:carlosmartinez] from comment #6)
> Hi Vicamo,
>
> According to the documentation we have, is part of the default alphabet, you
> can found it in:
> http://www.turkcell.com.tr/downloads/hakkimizda/pdf/23038-820.pdf or
> attached to the bug. The table we have used to create the list is in page 18.
Actually, I can't find "₤"(U+20a4) in that whole document, either. Are you sure that you don't mean "£"(U+ffe1)? The updated list contains "₤" => "₤", but "₤" is not part of default alphabet, so it's actually ineffective and will still be sent with UCS2 encoding.
| Assignee | ||
Comment 13•12 years ago
|
||
Comment on attachment 686649 [details] [diff] [review]
Part 1: more entries in GSM_SMS_STRICT_7BIT_CHARMAP
Review of attachment 686649 [details] [diff] [review]:
-----------------------------------------------------------------
::: dom/system/gonk/ril_consts.js
@@ +1811,5 @@
> + "\u00cc": "\u0049", // "Ì" => "I"
> + "\u00cd": "\u0049", // "Í" => "I"
> + "\u00ce": "\u0049", // "Î" => "I"
> + "\u00cf": "\u0049", // "Ï" => "I"
> + "\u00d1": "\u00d1", // "Ñ" => "Ñ"
already in default alphabet
@@ +1889,5 @@
> + "\u017c": "\u007a", // "ż" => "z"
> + "\u017d": "\u005a", // "Ž" => "Z"
> + "\u017e": "\u007a", // "ž" => "z"
> + "\u025b": "\u0045", // "ɛ" => "E"
> + "\u0398": "\u0398", // "Θ" => "Θ"
ditto
| Assignee | ||
Updated•12 years ago
|
Assignee: nobody → vyang
Comment 14•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #12)
> (In reply to Carlos Martínez Toral [:carlosmartinez] from comment #6)
>
> Actually, I can't find "₤"(U+20a4) in that whole document, either. Are you
> sure that you don't mean "£"(U+ffe1)? The updated list contains "₤" => "₤",
> but "₤" is not part of default alphabet, so it's actually ineffective and
> will still be sent with UCS2 encoding.
If 7bit code scheme is set, the sms should be codified using it even when the user introduce a character that it is not covered in the conversion table. Please use a wildcard character when it happens to avoid the codification in any other system.
| Reporter | ||
Comment 15•12 years ago
|
||
Hi Vicamo,
You´re right, we have confused "₤"(U+20a4) and "£"(U+ffe1), please use the correct one "£"(U+ffe1), and sorry for the typo.
Thank you.
Carlos
Comment 16•12 years ago
|
||
Looks like this is required for certification in market. Marking blocking.
blocking-basecamp: ? → +
Flags: needinfo?(clee)
| Assignee | ||
Comment 17•12 years ago
|
||
(In reply to Carlos Martínez Toral [:carlosmartinez] from comment #15)
> You´re right, we have confused "₤"(U+20a4) and "£"(U+ffe1), please use the
> correct one "£"(U+ffe1), and sorry for the typo.
And there is also problem with "€"(U+20ac) => "E". "€"(U+20ac) is already in default 7-Bit shift table, which means you don't have to map it into "E" and it will still be encoded in 7-Bit alphabet. Is this really what you want?
(In reply to Beatriz Rodríguez [:brg] from comment #14)
> If 7bit code scheme is set, the sms should be codified using it even when
> the user introduce a character that it is not covered in the conversion
> table. Please use a wildcard character when it happens to avoid the
> codification in any other system.
You really have to make up your mind. What's the FINAL version of this feature? That's exactly why I don't want to enable it in Gecko by default. And maybe we shouldn't have implemented it at first place at all.
blocking-basecamp: + → ?
| Assignee | ||
Comment 18•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #17)
> blocking-basecamp: + → ?
Oops, sorry! I don't mean it.
| Reporter | ||
Comment 20•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #17)
> (In reply to Carlos Martínez Toral [:carlosmartinez] from comment #15)
> > You´re right, we have confused "₤"(U+20a4) and "£"(U+ffe1), please use the
> > correct one "£"(U+ffe1), and sorry for the typo.
>
> And there is also problem with "€"(U+20ac) => "E". "€"(U+20ac) is already in
> default 7-Bit shift table, which means you don't have to map it into "E" and
> it will still be encoded in 7-Bit alphabet. Is this really what you want?
>
> (In reply to Beatriz Rodríguez [:brg] from comment #14)
> > If 7bit code scheme is set, the sms should be codified using it even when
> > the user introduce a character that it is not covered in the conversion
> > table. Please use a wildcard character when it happens to avoid the
> > codification in any other system.
>
> You really have to make up your mind. What's the FINAL version of this
> feature? That's exactly why I don't want to enable it in Gecko by default.
> And maybe we shouldn't have implemented it at first place at all.
You´re right again, "€"(U+20ac) is included in the GSM 7 bit default alphabet extension table but I didn´t know that this is included by default, please use the € instead of E in this case, and sorry again.
Comment 21•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #17)
>
> (In reply to Beatriz Rodríguez [:brg] from comment #14)
> > If 7bit code scheme is set, the sms should be codified using it even when
> > the user introduce a character that it is not covered in the conversion
> > table. Please use a wildcard character when it happens to avoid the
> > codification in any other system.
>
> You really have to make up your mind. What's the FINAL version of this
> feature? That's exactly why I don't want to enable it in Gecko by default.
> And maybe we shouldn't have implemented it at first place at all.
We need to support 7bit code scheme when sending sms with latin languages.If code scheme is set to 7bit, the platform should use it. Platform cannot change it automatically when a character not listed is typed by the user, that is why I suggest to put a wildcard character.
Comment 22•12 years ago
|
||
Mass Modify: All un-milestoned, unresolved blocking-basecamp+ bugs are being moved into the C3 milestone. Note that the target milestone does not mean that these bugs can't be resolved prior to 12/10, rather C2 bugs should be prioritized ahead of C3 bugs.
Target Milestone: --- → B2G C3 (12dec-1jan)
Updated•12 years ago
|
Priority: -- → P2
Updated•12 years ago
|
Blocks: b2g-v1-certification
| Assignee | ||
Comment 23•12 years ago
|
||
comment out another three already in default alphabet entries.
Attachment #686649 -
Attachment is obsolete: true
Attachment #691260 -
Flags: review?(htsai)
| Assignee | ||
Comment 24•12 years ago
|
||
options passed to ril_worker should contain full per segment info even it contains only one segment. Overridden text strings are included now, so duplicated logic in ril_worker can be removed.
Attachment #691262 -
Flags: review?(htsai)
| Assignee | ||
Comment 25•12 years ago
|
||
Attachment #691263 -
Flags: review?(htsai)
| Assignee | ||
Comment 26•12 years ago
|
||
Add both xpcshell & marionette test cases. Note that marionette test cases depend on https://github.com/mozilla-b2g/platform_external_qemu/pull/10 .
Attachment #686650 -
Attachment is obsolete: true
Attachment #691296 -
Flags: review?(htsai)
Updated•12 years ago
|
Attachment #691260 -
Flags: review?(htsai) → review+
Updated•12 years ago
|
Attachment #691262 -
Flags: review?(htsai) → review+
Comment 27•12 years ago
|
||
Comment on attachment 691263 [details] [diff] [review]
Part 3/4: replace non-7bit-alphabet chars with '*' when strict7BitEncoding is enabled
Review of attachment 691263 [details] [diff] [review]:
-----------------------------------------------------------------
Please see my comment below, thank you :)
::: dom/system/gonk/RadioInterfaceLayer.js
@@ +2042,5 @@
> + c = '*';
> + if (langTable.indexOf(c) >= 0) {
> + length++;
> + } else {
> + length += 2;
Would you please comment why 'length += 2' here? Thanks!
Attachment #691263 -
Flags: review?(htsai) → review+
Comment 28•12 years ago
|
||
Comment on attachment 691296 [details] [diff] [review]
Part 4/4: add test scripts : v2
Review of attachment 691296 [details] [diff] [review]:
-----------------------------------------------------------------
Nice!
::: dom/sms/tests/marionette/test_strict_7bit_encoding.js
@@ +178,5 @@
> + }
> +
> + // When strict7BitEncoding is enabled, we should replace characters that
> + // can't be encoded with GSM 7-Bit alphabets with '*'.
> + add_case("\u65b0\u5e74\u5feb\u6a02", "****"); // "Happy New Year" in Chinese.
:D
Attachment #691296 -
Flags: review?(htsai) → review+
| Assignee | ||
Comment 29•12 years ago
|
||
Add r=hsinyi
Attachment #691260 -
Attachment is obsolete: true
Attachment #692829 -
Flags: review+
| Assignee | ||
Comment 30•12 years ago
|
||
Add r=hsinyi
Attachment #691262 -
Attachment is obsolete: true
Attachment #692831 -
Flags: review+
| Assignee | ||
Comment 31•12 years ago
|
||
1) Address review comment #27.
In previous revision, I used the fact that '*' is either in locking shift table or in single shift table. It's true for current released specifications but there is no guarantee in the future.
In this revision, I increase 'length' by 2 only when we can find '*' in single shift table and return -1 otherwise.
2) Add r=hsinyi
Attachment #691263 -
Attachment is obsolete: true
Attachment #692833 -
Flags: review+
| Assignee | ||
Comment 32•12 years ago
|
||
Add r=hsinyi
Attachment #691296 -
Attachment is obsolete: true
Attachment #692835 -
Flags: review+
| Assignee | ||
Comment 33•12 years ago
|
||
In stressed marionette runs, I find the marionette test script can fail following ones. It inserts a lot (200+) unfinished database transactions, quits successfully, and leaves unfinished tasks counted on the next test script. When the next test script happens to have a short MARIONETTE_TIMEOUT value set, it will probably starve.
This issue doesn't affect the integrity and correctness of the SMS databases, but is an implementation weakness for timed automation tests. I remove the marionette part in this revision.
Attachment #692835 -
Attachment is obsolete: true
Attachment #692892 -
Flags: review+
| Assignee | ||
Comment 34•12 years ago
|
||
| Assignee | ||
Comment 35•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #33)
> This issue doesn't affect the integrity and correctness of the SMS
> databases, but is an implementation weakness for timed automation tests. I
> remove the marionette part in this revision.
Bug 822207 filed to address this problem.
Comment 36•12 years ago
|
||
https://hg.mozilla.org/mozilla-central/rev/609a53b92cee
https://hg.mozilla.org/mozilla-central/rev/07baed38f016
https://hg.mozilla.org/mozilla-central/rev/2a616878a5b1
https://hg.mozilla.org/mozilla-central/rev/f0d7097db421
Status: NEW → RESOLVED
Closed: 12 years ago
Flags: in-testsuite+
Resolution: --- → FIXED
Comment 37•12 years ago
|
||
https://hg.mozilla.org/releases/mozilla-aurora/rev/176672e2e40b
https://hg.mozilla.org/releases/mozilla-aurora/rev/88c7005b6a29
https://hg.mozilla.org/releases/mozilla-aurora/rev/7cd948d04ace
https://hg.mozilla.org/releases/mozilla-aurora/rev/0adc3bb27f5b
https://hg.mozilla.org/releases/mozilla-b2g18/rev/ef9e5c5703ce
https://hg.mozilla.org/releases/mozilla-b2g18/rev/6d856daeb583
https://hg.mozilla.org/releases/mozilla-b2g18/rev/40f01f3d6860
https://hg.mozilla.org/releases/mozilla-b2g18/rev/3128ce43bf1a
Comment 38•12 years ago
|
||
With today build on Unagi there is a problem with the € symbol:
Gaia 73270c3
Geckonb8d7310
Using a VIVO SIM Card from Brazil, that characther '€' is sent like '?e'
According to the pdf attached, it should be sent as 'E'
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
| Reporter | ||
Comment 39•12 years ago
|
||
Sorry for the confusion, € should be sent as €, as it was included in the default extension table, as per comment 17.
Comment 40•12 years ago
|
||
Vicamo, should the problem in comment 39 be handled in a followup bug or is it a major issue with the patches that landed here?
| Assignee | ||
Comment 41•12 years ago
|
||
(In reply to Dietrich Ayala (:dietrich) from comment #40)
> Vicamo, should the problem in comment 39 be handled in a
> followup bug or is it a major issue with the patches that
> landed here?
It's not even a problem. Please update that PDF.
Status: REOPENED → RESOLVED
Closed: 12 years ago → 12 years ago
Resolution: --- → FIXED
| Assignee | ||
Comment 42•12 years ago
|
||
Bug 824612 is the marionette test cases for this issue. As you can see in https://tbpl.mozilla.org/?tree=Try&rev=64003f2f846f , all test cases, inclusive of '\u20ac', passed with all greens. However, this is based on emulator. I'll test it on Unagi next morning.
Comment 43•12 years ago
|
||
With Unagi and latest build 01/014/2012 the '€' symbol sent from a VIVO SIM Card is received as '?e'. Please correct us if we are not right but our understanding is that is should be received as '€'
| Assignee | ||
Comment 44•12 years ago
|
||
(In reply to Isabel Rios [:isabel_rios] from comment #43)
> With Unagi and latest build 01/014/2012 the '€' symbol sent
> from a VIVO SIM Card is received as '?e'. Please correct us if
> we are not right but our understanding is that is should be
> received as '€'
Could you help me to capture some debug log messages? I just solved bug 828633 today. The test cases run correctly with expected results. And last time you reported the same thing, I was then told it's actually a false alarm.
Anyway, detailed steps for enable debug messages are:
1) Connect usb cable
2) Execute `adb pull /system/b2g/defaults/pref/user.js .`
3) Edit pulled file user.js. Change value of "ril.debugging.enabled" to "true"
4) Execute `adb push user.js /system/b2g/defaults/pref`
5) reboot Unagi
6) Execute `adb logcat | tee filename.log` and send a SMS message that contains '€' in GUI. Press Ctrl-C.
7) Attach captured log.
Thanks.
Comment 45•12 years ago
|
||
Comment 46•12 years ago
|
||
(In reply to Vicamo Yang [:vicamo][:vyang] from comment #44)
> (In reply to Isabel Rios [:isabel_rios] from comment #43)
> > With Unagi and latest build 01/014/2012 the '€' symbol sent
> > from a VIVO SIM Card is received as '?e'. Please correct us if
> > we are not right but our understanding is that is should be
> > received as '€'
>
> Could you help me to capture some debug log messages? I just solved bug
> 828633 today. The test cases run correctly with expected results. And last
> time you reported the same thing, I was then told it's actually a false
> alarm.
>
> Anyway, detailed steps for enable debug messages are:
> 1) Connect usb cable
> 2) Execute `adb pull /system/b2g/defaults/pref/user.js .`
> 3) Edit pulled file user.js. Change value of "ril.debugging.enabled" to
> "true"
> 4) Execute `adb push user.js /system/b2g/defaults/pref`
> 5) reboot Unagi
> 6) Execute `adb logcat | tee filename.log` and send a SMS message that
> contains '€' in GUI. Press Ctrl-C.
> 7) Attach captured log.
>
> Thanks.
Hi Vivamo,
Attached you can find the log file you asked for. Hope it helps. In case it is not correct or whatever, please let me know so I will attach another one.
Thanks
Comment 47•12 years ago
|
||
There is a new version of the table of characters to convert in:
https://bug887745.bugzilla.mozilla.org/attachment.cgi?id=776220
Comment 48•9 years ago
|
||
This is critical for product certification in latin countries, i'm getting error from this page http://www.ucakbileti.com.tr but there is no problem when i'm using chrome browser
You need to log in
before you can comment on or make changes to this bug.
Description
•