Closed Bug 172699 Opened 22 years ago Closed 22 years ago

JS UTF-8 decoder accepts overlong sequences

Tracking

()

Status:

VERIFIED FIXED

People

(Reporter: jgmyers, Assigned: rogerl)

References

Details

(Keywords: js1.5, Whiteboard: [Have filed bug 173180 against Rhino for the same issue])

Attachments

(1 file)

Proposed fix 22 years ago John G. Myers 1.25 KB, patch	rogerl : review+	Details \| Diff \| Splinter Review

John G. Myers

Reporter

Description

•

22 years ago

Another UTF-8 decoder, another instance of the same botch as bug 50702 and bug 74198. This one is in Utf8ToOneUCS4Char() in js/src/jsstr.c

John G. Myers

Reporter

Comment 1

•

22 years ago

Attached patch Proposed fix — Details — Splinter Review

Phil Schwartau

Comment 2

•

22 years ago

cc'ing reviewers for the provided patch -

Assignee: rogerl → khanson

Brendan Eich [:brendan]

Comment 3

•

22 years ago

rogerl, this looks like your code originally (jsstr.c rev 3.20) -- can you r= and get the patch in? Thanks, and thanks to jgmyers for the find and fix. /be

Assignee: khanson → rogerl

Brendan Eich [:brendan]

Updated

•

22 years ago

Keywords: js1.5, mozilla1.2

rogerl (gone)

Assignee

Comment 4

•

22 years ago

Comment on attachment 101774 [details] [diff] [review] Proposed fix r=rogerl. Phil - a test case is decodeURI("%C0%AF").charCodeAt(0) which should result in 65533

Attachment #101774 - Flags: review+

rogerl (gone)

Assignee

Comment 5

•

22 years ago

Fix checked in (inferring sr from Brendan). Waldemar - the algorithm in 15.1.3.1 doesn't address this issue, should this be raised at ECMA or leave this as a Netscape security 'extension'?

Status: NEW → RESOLVED

Closed: 22 years ago

Resolution: --- → FIXED

John G. Myers

Reporter

Comment 6

•

22 years ago

Unicode 3.1 prohibits UTF-8 decoders from accepting overlong sequences. Also see http://www.unicode.org/versions/corrigendum1.html I suggest bringing this up with ECMA, as this would appear to be an inconsistency with the Unicode standard.

rogerl (gone)

Assignee

Comment 7

•

22 years ago

Adding Waldemar for ECMA comments.

Phil Schwartau

Comment 8

•

22 years ago

Testcase added to JS testsuite: mozilla/js/tests/js1_5/Regress/regress-172699.js

Phil Schwartau

Comment 9

•

22 years ago

Marking Verified FIXED. The above testcase now passes. It used to fail as follows: *-* Testcase js1_5/Regress/regress-172699.js failed: Bug Number 172699 STATUS: UTF-8 decoder should not accept overlong sequences Failure messages were: FAILED!: Section 1 of test - FAILED!: Expected value '65533', Actual value '37' The test is currently failing in Rhino in exactly this way. I have filed bug 173180 against Rhino for this issue -

Status: RESOLVED → VERIFIED

Summary: js UTF-8 decoder accepts overlong sequences → JS UTF-8 decoder accepts overlong sequences

Whiteboard: [Have filed bug 173180 against Rhino for the same issue]

Igor Bukanov

Comment 10

•

22 years ago

But why the fix treats overlongs by replacing them by 0xFFFD and not throwing an exception like any other invalid UTF-8 would do? My understanding of http://www.unicode.org/unicode/uni2errata/UTF-8_Corrigendum.html is that overlongs are as broken as any other invalid UTF-8 sequences and should not be treated in a different way.

John G. Myers

Reporter

Comment 11

•

22 years ago

Throwing an exception would be acceptable, though it doesn't look like UTF8ToOneUCS4Char() can do that directly.

Bob Clary [:bc] (inactive)

Updated

•

20 years ago

Flags: testcase+

Masahiro YAMADA

Updated

•

15 years ago

Depends on: 511859

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

JS UTF-8 decoder accepts overlong sequences

Categories

(Core :: JavaScript Engine, defect)

Tracking

()

People

(Reporter: jgmyers, Assigned: rogerl)

References

Details

(Keywords: js1.5, Whiteboard: [Have filed bug 173180 against Rhino for the same issue])

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Updated

Updated

Attachment

General

Description

File Name

Content Type