Open Bug 306627 Opened 19 years ago Updated 2 years ago

Make sure nsIDOMKeyEvent::GetCharCode () works fine with non-0-plane characters

Tracking

()

Status:

NEW

People

(Reporter: jonitis, Unassigned)

Details

Attachments

(1 file, 2 obsolete files)

Possible solution 19 years ago Dainis Jonitis 13.52 KB, patch		Details \| Diff \| Splinter Review
Possible solution 19 years ago Dainis Jonitis 14.22 KB, patch		Details \| Diff \| Splinter Review
Possible solution 19 years ago Dainis Jonitis 14.09 KB, patch		Details \| Diff \| Splinter Review

Dainis Jonitis

Reporter

Description

•

19 years ago

This is a followup bug 295228.

The problem is that current code throws away Unicode plane information by
casting the UCS4 character code that is returned by GetCharCode () function to
PRUnichar.

Boris sugests (295228#c18 and 295228#c20) that it is not a good idea to extend
the nsIDomNSUIEvent interface with new methods. Instead I created new methods
that perform conversion from UCS4 character to UTF16 string and back. This
solution is endian independent.

At least on Windows there is also a problem in nsWindow.cpp widget code that is
not ready to receive the high surogate character codes from WM_CHAR message. It
is easy to fix and I will later attach patch.

Now I want to hear some feadback whether changes that I've made are ok. I've put
UCS4<->UTF16 conversion routines in intl/unicharutil, but probably they better
fit in xpcom/string.

Dainis Jonitis

Reporter

Comment 1

•

19 years ago

Attached patch Possible solution (obsolete) — Details — Splinter Review

Simon Montagu :smontagu

Comment 2

•

19 years ago

See bug 297943. It looks as if we have been approaching the problem from
different ends :)

Dainis Jonitis

Reporter

Comment 3

•

19 years ago

Great, I will use your version of Win32 nsWindow::OnChar.

Dainis Jonitis

Reporter

Comment 4

•

19 years ago

Note to self:
Boris in bug 316394 added function AppendUCS4ToUTF16 () that can be used instead of UCS4CharToUTF16 (). Look if there is equivalent for UTF16ToUCS4Char (). Need to update the patch with new functions.

Boris Zbarsky [:bzbarsky]

Comment 5

•

19 years ago

> Look if there is equivalent for UTF16ToUCS4Char

Yes, I moved those in the same patch -- see macros in nsCharTraits.h.

Dainis Jonitis

Reporter

Comment 6

•

19 years ago

Attached patch Possible solution (obsolete) — Details — Splinter Review

Updated the patch to use the new function AppendUCS4ToUTF16 ().
I do not see how using only macros from nsCharTraits.h extract UCS4 code from nsAString (so that caller is only one line). So I moved my UTF16ToUCS4Char () helper function from nsUniCharUtil.cpp to nsReadableUtils.cpp.
I think this function can be helpful. I searched Mozilla source code for users of SURROGATE_TO_UCS4. Majority of them work with PRUnichar* params, but there are a couple of users that take nsAString as a source:
  nsUnicodeNormalizer.cpp, mdn_normalize () 
  nsTextBoxFrame.cpp, nsTextBoxFrame::CalculateTitleForWidth ()
  nsIDNService.cpp, utf16ToUcs4
Some of callers use it in loop and for them it could be helpful to also return the number of UTF16 codepoints that were used to construct the returned UCS4 character.
Probably it could be more useful instead of:
  NS_COM PRUint32 UTF16ToUCS4Char (const nsAString& aSource);
have something like:
  NS_COM PRUint32 UTF16ToUCS4Char (const nsAString& aSource, PRUint32& aDest)

This will return number of UTF16 codepoints consumed, but result will be in aDest. It will better match the AppendUCS4ToUTF16 (). Still not sure about best name for this function.

Boris, what do you think?

Attachment #194479 - Attachment is obsolete: true

Boris Zbarsky [:bzbarsky]

Comment 7

•

19 years ago

Before I start trying to dig into this, does the UTF16CharEnumerator enumerator class in nsUTF8Utils.h (yeah, kinda odd place for it to be) do what you're trying to do?  Seems to me like it might...

Dainis Jonitis

Reporter

Comment 8

•

19 years ago

Yeah, seems that PRUint32 UTF16CharEnumerator::NextChar(nsAString::const_iterator& iter, const nsAString::const_iterator& end, PRBool *err = nsnull) is doing what I need. The only problem that each caller then should use four lines instead of one. 

 nsAString::const_iterator start, end;
 aSource.BeginReading(start);
 aSource.EndReading(end);
 PRUint32 ucs4 = UTF16CharEnumerator::NextChar(start, end);

Not a big deal, but on the onther hand we can add third implementation of NextChar to UTF16CharEnumerator that takes "const nsAString&" as a param for those callers who do no care about updated iterator values.

Something like that:

static PRUint32 NextChar(const nsAString& aSource)
{
    nsAString::const_iterator start, end;
    aSource.BeginReading(start);
    aSource.EndReading(end);
    return NextChar (start, end);
}

Is it worth adding a new method?

Boris Zbarsky [:bzbarsky]

Comment 9

•

19 years ago

Probably worth it, yeah.  Might want to call it FirstChar() (or FirstUCS4Char()?), though, since that's what it returns.

Dainis Jonitis

Reporter

Comment 10

•

19 years ago

Attached patch Possible solution — Details — Splinter Review

Add function FirstUCS4Char () to nsReadableUtils.cpp that internally calls UTF16CharEnumerator::NextChar (). It avoids the need to callers to adittionally include nsUTF8Utils.h that is not an obvious place where to expect this function. I've intentionally have not made it inline in nsReadableUtils.h to avoid every user of nsReadableUtils.h implicitly including nsUTF8Utils.h and thus reducing build time.

Attachment #203309 - Attachment is obsolete: true

Phil Ringnalda (:philor)

Updated

•

15 years ago

Assignee: events → nobody

QA Contact: ian → events

Firefox Bug Husbandry Bot

Comment 11

•

6 years ago

https://bugzilla.mozilla.org/show_bug.cgi?id=1472046

Move all DOM bugs that haven’t been updated in more than 3 years and has no one currently assigned to P5.

If you have questions, please contact :mdaly.

Priority: -- → P5

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Make sure nsIDOMKeyEvent::GetCharCode () works fine with non-0-plane characters

Categories

(Core :: DOM: Events, defect, P5)

Tracking

()

People

(Reporter: jonitis, Unassigned)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file, 2 obsolete files)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Comment 11

Updated

Attachment

General

Description

File Name

Content Type