Closed Bug 752780 Opened 14 years ago Closed 13 years ago

Convert invalid UTF-16 to UTF-8 in websocket send()

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla16

People

(Reporter: jduell.mcbugs, Assigned: jduell.mcbugs)

Details

Attachments

(1 file)

v1 fix 13 years ago Jason Duell 9.48 KB, patch	sicking : review+	Details \| Diff \| Splinter Review

Jason Duell

Assignee

Description

•

14 years ago

The W3C Candidate Recommendation 08 December 2011 says we should throw an exception if text with unpaired surrogates is passed to send(). The latest editor's draft has changed that to say that we should convert the incoming argument to Unicode if it's not already (I'm an ignoramus about Unicode: apparently this involves inserting replacement characters). Not sure when we should ship a fix: now, or wait until the Editor's draft becomes another W3C recommendation?

Jonas Sicking (:sicking) No longer reading bugmail consistently

Updated

•

14 years ago

Summary: Convert invalid UTF-8 to UTF-8 in websocket send() → Convert invalid UTF-16 to UTF-8 in websocket send()

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 1

•

14 years ago

Consensus in the working group was to take this change (though it came down to a close vote if we should make the change now or wait until v2 of the protocol). So I feel pretty confident that this change will stick, so I see no reason to wait.

Jason Duell

Assignee

Comment 2

•

13 years ago

Attached patch v1 fix — Details — Splinter Review

Why do I ever think little things like this will take "just 15 minutes"? Anyway: here's what seems to be a working fix. It turns out that the existing, custom conversion code (nsWebSocket::ConvertTextToUTF8) doesn't insert replacement characters correctly (instead of replacing with '0xef 0xbf 0xbd', aka '\ufffd', it replaced with '0xed 0xa0 0x80'. Which seems odd, because the code explicitly tells the converter to use UTF_8_REPLACEMENT_CHAR. But I didn't spent too much time worrying about it, because it seems like we're fine just using CopyUTF16toUTF8(), which is what we're already using everywhere else in the logic.

Assignee: nobody → jduell.mcbugs

Status: NEW → ASSIGNED

Attachment #638996 - Flags: review?(jonas)

Jonas Sicking (:sicking) No longer reading bugmail consistently

Updated

•

13 years ago

Attachment #638996 - Flags: review?(jonas) → review+

Jason Duell

Assignee

Comment 3

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/5a104d7e8e7c

Ryan VanderMeulen [:RyanVM]

Comment 4

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/5a104d7e8e7c

Status: ASSIGNED → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla16

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Convert invalid UTF-16 to UTF-8 in websocket send()

Categories

(Core :: Networking: WebSockets, defect)

Tracking

()

People

(Reporter: jduell.mcbugs, Assigned: jduell.mcbugs)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Updated

Comment 1

Comment 2

Updated

Comment 3

Comment 4

Attachment

General

Description

File Name

Content Type