Closed Bug 125077 Opened 23 years ago Closed 23 years ago

Change nsITextTransform::Change to take const PRUnichar* as an input

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla0.9.9

People

(Reporter: nhottanscp, Assigned: nhottanscp)

Details

(Keywords: intl)

Attachments

(2 files, 1 obsolete file)

Added an method to take PRUnichar* as an input string. 23 years ago nhottanscp 1.59 KB, patch		Details \| Diff \| Splinter Review
Changed to pass a string length. 23 years ago nhottanscp 1.65 KB, patch	ftang : review+ blizzard : superreview+	Details \| Diff \| Splinter Review
Client side change, in mail code, changed to use the non nsString function, plus clean up. 23 years ago nhottanscp 6.39 KB, patch	bugzilla : review+	Details \| Diff \| Splinter Review

nhottanscp

Assignee

Description

•

23 years ago

nsITextTransform::Change takes an input as nsString& which may require the
client an extra allocation.
I think the input does not need to change, so we can use const PRUnichar* as an
input.

http://lxr.mozilla.org/seamonkey/source/intl/unicharutil/public/nsITextTransform.h

Takayuki Tei

Comment 1

•

23 years ago

add dependency.  set TM to 0.9.9.

Blocks: 73403

Target Milestone: --- → mozilla0.9.9

nhottanscp

Assignee

Comment 2

•

23 years ago

Attached patch Added an method to take PRUnichar* as an input string. (obsolete) — Details — Splinter Review

Frank Tang

Comment 3

•

23 years ago

Comment on attachment 69112 [details] [diff] [review]
Added an method to take PRUnichar* as an input string.

I think we should pass in the
length. so in case the caller
already know the length, we don't
need to call strlen again here.

nhottanscp

Assignee

Comment 4

•

23 years ago

Attached patch Changed to pass a string length. — Details — Splinter Review

Attachment #69112 - Attachment is obsolete: true

Frank Tang

Comment 5

•

23 years ago

Comment on attachment 69181 [details] [diff] [review]
Changed to pass a string length.

r=ftang

Attachment #69181 - Flags: review+

Christopher Blizzard (:blizzard)

Comment 6

•

23 years ago

Comment on attachment 69181 [details] [diff] [review]
Changed to pass a string length.

sr=blizzard

Attachment #69181 - Flags: superreview+

nhottanscp

Assignee

Comment 7

•

23 years ago

The patch was checked in, remove the dependency.
Keep the bug open for the client side change.

No longer blocks: 73403

Status: NEW → ASSIGNED

Rui Xu

Updated

•

23 years ago

Keywords: intl

nhottanscp

Assignee

Comment 8

•

23 years ago

Attached patch Client side change, in mail code, changed to use the non nsString function, plus clean up. — Details — Splinter Review

Jean-Francois Ducarroz

Comment 9

•

23 years ago

Comment on attachment 69684 [details] [diff] [review]
Client side change, in mail code, changed to use the non nsString function, plus clean up.

is nsIAtom::GetUnicode create a new unicode string? if yes, you are leaking
charsetName! in that case I would propose you to use an nsXPIDLCString to solve
the problem. Else, R=ducarroz

Attachment #69684 - Flags: review+

nhottanscp

Assignee

Comment 10

•

23 years ago

 51 void GetUnicode([shared, retval] out wstring aResult);
http://lxr.mozilla.org/seamonkey/source/xpcom/ds/nsIAtom.idl#51

The implementation does not allocate memory, it returns a pointer of the
internal buffer.
http://lxr.mozilla.org/seamonkey/source/xpcom/ds/nsAtomTable.cpp#241

So, charsetName is not leaked.

(not reading, please use seth@sspitzer.org instead)

Comment 11

•

23 years ago

1)

+ !nsCRT::strcmp(charsetName, NS_LITERAL_STRING("ISO-8859-1").get()) ?

why not do something like:

!strcmp(NS_LossyConvertUCS2toASCII(charsetName).get(), "ISO-8859-1") ?

2)

+  if (!nsCRT::strcmp(charsetName, NS_LITERAL_STRING("ISO-2022-JP").get())) {

why not do something like:

if (!strcmp(NS_LossyConvertUCS2toASCII(charsetName).get(), "ISO-2022-JP")) {

standard libc strcmp of two c strings is faster than nsCRT::strcmp() of two UCS2
strings, right?

3)

+        res = textTransform->Change(inString, nsCRT::strlen(inString), mapped);

can't you do:

inString.Length()?

address those 3 items, and then sr=sspitzer

nhottanscp

Assignee

Comment 12

•

23 years ago

1) & 2)
I thought the conversion by NS_LossyConvertUCS2toASCII takes more time than
strlen for PRUnichar*, note charset names are usually short like "ISO-8859-1".

3) inString in PRUnichar*, so cannot do that.

(not reading, please use seth@sspitzer.org instead)

Comment 13

•

23 years ago

> I thought the conversion by NS_LossyConvertUCS2toASCII takes more time than
> strlen for PRUnichar*, note charset names are usually short like "ISO-8859-1".

I'm not sure it matters how slow strlen is, maybe you mean strcmp.

NS_LossyConvertUCS2toASCII() does look to be kind of heavy weight.  I guess the
question is which is worse:  

NS_LossyConvertUCS2toASCII() and native strcmp() 
or a hand rolled nsCRT::strcmp().

looking at NS_LossyConvertUCS2toASCII() and nsCRT::strcmp(), you are right. 
your way is faster.

(general question, why are charsets not char strings?)

> inString in PRUnichar*, so cannot do that.

oops, I was looking at other inStrings in nsMsgI18N.cpp

sr=sspitzer

nhottanscp

Assignee

Comment 14

•

23 years ago

checked in

>(general question, why are charsets not char strings?)
uconv uses nsIAtom for charset and its nsIAtom is PRUnichar* base.
Because charset name is always ASCII, we can use char* until we come to the
point where we need unicode conversion (or need a canonicalized name). After
that, we can use the charset name in const PRUnichar* from nsIAtom which needs
no memory allocation and no need to strcasecmp (because of the canonicalization).

Status: ASSIGNED → RESOLVED

Closed: 23 years ago

Resolution: --- → FIXED

Rui Xu

Comment 15

•

23 years ago

Should IQA be involved in this bug?

nhottanscp

Assignee

Comment 16

•

23 years ago

verified by lxr

Status: RESOLVED → VERIFIED

QA Contact: ruixu → nhotta

You need to log in before you can comment on or make changes to this bug.