Closed Bug 200984 Opened 22 years ago Closed 22 years ago

[FIX]Arabic text in Javascript "unescape" function returns the wrong output

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla1.4beta

People

(Reporter: neokuwait, Assigned: bzbarsky)

References

Details

Attachments

(3 files, 2 obsolete files)

unescape testcase 22 years ago Thamer Mahmoud 562 bytes, text/html		Details
Proposed patch 22 years ago Boris Zbarsky [:bzbarsky] 4.93 KB, patch		Details \| Diff \| Splinter Review
Oops, that had an extra chunk in it... 22 years ago Boris Zbarsky [:bzbarsky] 3.91 KB, patch	jst : superreview+	Details \| Diff \| Splinter Review
UTF-16 test case 22 years ago Waldemar Horwat 1.44 KB, text/html		Details
Updated to tip 22 years ago Boris Zbarsky [:bzbarsky] 3.92 KB, patch	waldemar : review+	Details \| Diff \| Splinter Review

Thamer Mahmoud

Reporter

Description

•

22 years ago

Gecko/20030328 TO REPRODUCE: 1. view testcase ACTUAL: the unescape function produces Latin characters instead of Arabic EXPECTED: only characters in their %xx.. hexadecimal form should be affected? (see IE6)

Thamer Mahmoud

Reporter

Comment 1

•

22 years ago

Attached file unescape testcase — Details

Boris Zbarsky [:bzbarsky]

Assignee

Comment 2

•

22 years ago

Intl -- not arabic specific.

Assignee: mkaply → smontagu

Status: UNCONFIRMED → NEW

Component: BiDi Hebrew & Arabic → Internationalization

Ever confirmed: true

OS: Windows XP → All

QA Contact: zach → ylong

Hardware: PC → All

Boris Zbarsky [:bzbarsky]

Assignee

Comment 3

•

22 years ago

Attached patch Proposed patch (obsolete) — Details — Splinter Review

Boris Zbarsky [:bzbarsky]

Assignee

Comment 4

•

22 years ago

Attached patch Oops, that had an extra chunk in it... (obsolete) — Details — Splinter Review

Attachment #119646 - Attachment is obsolete: true

Boris Zbarsky [:bzbarsky]

Assignee

Comment 5

•

22 years ago

Comment on attachment 119647 [details] [diff] [review] Oops, that had an extra chunk in it... Would you review? The key here is that ToNewCString does lossy "conversion" to a CString.... This patch makes the conversion use the proper charset encoder, does a bit of cleanup, and removes the unnecessary call to Reset() (new decoders already come blank).

Attachment #119647 - Flags: superreview?(jst)

Attachment #119647 - Flags: review?(smontagu)

Boris Zbarsky [:bzbarsky]

Assignee

Comment 6

•

22 years ago

mine.

Assignee: smontagu → bzbarsky

Priority: -- → P1

Summary: Arabic text in Javascript "unescape" function returns the wrong output → [FIX]Arabic text in Javascript "unescape" function returns the wrong output

Target Milestone: --- → mozilla1.4beta

Johnny Stenback (:jst)

Comment 7

•

22 years ago

Comment on attachment 119647 [details] [diff] [review] Oops, that had an extra chunk in it... // Allocate a buffer of the maximum length PRUnichar *dest = (PRUnichar*)nsMemory::Alloc(sizeof(PRUnichar) * maxLength); + NS_ENSURE_TRUE(dest, NS_ERROR_OUT_OF_MEMORY); + ... + rv = decoder->Convert(encodedData, &unescapedByteCount, dest, &destLen); + if (NS_FAILED(rv)) { nsMemory::Free(dest); - return result; + return rv; } aReturn.Assign(dest, destLen); nsMemory::Free(dest); Could you make dest an nsAutoPtr<PRUnichar *> here? If not, you may want to flip the code around so that you need only one call to nsMemory::Free(dest)... sr=jst

Attachment #119647 - Flags: superreview?(jst) → superreview+

Boris Zbarsky [:bzbarsky]

Assignee

Comment 8

•

22 years ago

nsAutoPtr will call "delete" on the pointer, so that's no good here (unless I actually do a "new PRUnichar[]" and cast or something like that.... I'll flip the test around; good catch.

Simon Montagu :smontagu

Comment 9

•

22 years ago

Before I review, can you quiet my suspicions that this is a non-problem? Specifically, is non-ASCII text legal input to unescape(), and does calling unescape() with such text have defined output?

Simon Montagu :smontagu

Comment 10

•

22 years ago

... and another question (possibly for another bug) : is %DA%D1%C8%ED the correct escaping of the string in the testcase? Why not %0639%0631%0628%064A? Or possibly %D8%B9%D8%B1%D8%A8%D9%89?

Simon Montagu :smontagu

Comment 11

•

22 years ago

Comment on attachment 119647 [details] [diff] [review] Oops, that had an extra chunk in it... OK, I accept that we need this for compatibility, although we need a new bug to consider what the correct behaviour is, especially of escape(). Opera, IE, and Konqueror all render the second line of the testcase as: escaped: %u0639%u0631%u0628%u064A%20%u0639%u0631%u0628%u064A >+ // To gracefully deal with encoding issues, we have to do the following: Nit: don't split infinitives r=smontagu

Attachment #119647 - Flags: review?(smontagu) → review+

Boris Zbarsky [:bzbarsky]

Assignee

Comment 12

•

22 years ago

Comment on attachment 119647 [details] [diff] [review] Oops, that had an extra chunk in it... Waldemar, Simon asked me to make sure you were OK with this patch.. could you take a look, please?

Attachment #119647 - Flags: review+ → review?(waldemar)

Simon Montagu :smontagu

Comment 13

•

22 years ago

The "new bug" I requested in comment 11 would be a dupe of bug 44272 :-)

•

22 years ago

Attached patch Updated to tip — Details — Splinter Review

Attachment #119647 - Attachment is obsolete: true

Boris Zbarsky [:bzbarsky]

Assignee

Updated

•

22 years ago

Attachment #119647 - Flags: review?(waldemar)

Boris Zbarsky [:bzbarsky]

Assignee

Updated

•

22 years ago

Attachment #125691 - Attachment filename: 琀攀猀琀⸀瀀愀琀挀栀 → test.patch

Attachment #125691 - Flags: review?(waldemar)

Waldemar Horwat

Updated

•

22 years ago

Attachment #125691 - Flags: review?(waldemar) → review+

Boris Zbarsky [:bzbarsky]

Assignee

Comment 22

•

22 years ago

Checked in.

Status: NEW → RESOLVED

Closed: 22 years ago

Resolution: --- → FIXED

Yuying Long

Comment 23

•

22 years ago

Verified fixed in 06-17 trunk build on WinXP except the problem in bug 44272.

Status: RESOLVED → VERIFIED

You need to log in before you can comment on or make changes to this bug.