Closed Bug 61107 Opened 24 years ago Closed 23 years ago

bad link interpretation in utf-8 document (Build ID: 2000101014)

Tracking

()

Status:

VERIFIED INVALID

Milestone:

Future

People

(Reporter: st.pecha, Assigned: harishd)

Details

(Keywords: testcase)

Attachments

(1 file)

reduced testcase 24 years ago David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla) 338 bytes, text/html		Details

Stanislav Pecha

Reporter

Description

•

24 years ago

Page: http://gw.internet.pb.cz/noconv/spoluzaci/index2.php?lang=cs two links in menu: 1.http://gw.internet.pb.cz/noconv/spoluzaci/vyber.php?did=[some number]%E2%8C%A9=en 2.http://gw.internet.pb.cz/noconv/spoluzaci/nova.php?lang=en&did=[some number] Both links are same but 1st link has switch did first and lang 2nd. Second link has lang 1st and did 2nd. When is lang 2nd mozilla change string '&lang' to '%E2%8C%A9' It do only with switch named 'lang' if it is not used like first switch in anchor All the page is in UTF-8 charcode. When i look on page source, all is fine. It is critical for my web (I must rebuild it) but not for Mozilla. It should be something worse in Mozilla code and it's better to check it. IE 5.5 has no problem with this.

Keyser Sose

Comment 1

•

24 years ago

Reporter is this still a problem in the latest nightlies?

Keyser Sose

Comment 2

•

24 years ago

WORKSFORME Platform: PC OS: Linux 2.2.16 Mozilla Build: 20002808 M18 Trunk Build Marking as such.

Keyser Sose

Comment 3

•

24 years ago

Really marking WORKSFORME.

Status: UNCONFIRMED → RESOLVED

Closed: 24 years ago

Resolution: --- → WORKSFORME

Stanislav Pecha

Reporter

Comment 4

•

24 years ago

Build ID: 2000122820 still do it. http://gw.internet.pb.cz/spoluzaci/index2.php?lang=en

Status: RESOLVED → UNCONFIRMED

Resolution: WORKSFORME → ---

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 5

•

24 years ago

Confirming in Linux build 2001-01-02-11-Mtrunk. I'm not sure that we follow the links incorrectly but they certainly show up wrong in the toolbar.

Status: UNCONFIRMED → NEW

Component: HTML Element → Internationalization

Ever confirmed: true

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 6

•

24 years ago

Attached file reduced testcase — Details

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 7

•

24 years ago

Oops. I see the problem now. This has nothing to do with Layout's mucking about with URI character encodings... This bug is probably invalid, but if it's not it's a parser issue. The problem is that you need to escape & in URLs within HTML to &. Otherwise we interpret &lang as the HTML entity for a left angle bracket. Harish - you should probably check what the correct set of termination characters for entities is. I'm not sure we implement that correctly.

Assignee: clayton → harishd

Component: Internationalization → Parser

QA Contact: lorca → janc

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 8

•

24 years ago

er, entity references, not entities

harishd

Assignee

Comment 9

•

24 years ago

David, the bug ( not sure if it's a bug! ) is confined only to the strict documents. Removing the DOCTYPE should exhibit the same behavior as in Nav 4.x and IE. Btw, entity parsing, for attributes, happens in the content sink.

Status: NEW → ASSIGNED

harishd

Assignee

Updated

•

24 years ago

Target Milestone: --- → mozilla0.9.1

harishd

Assignee

Comment 10

•

24 years ago

This bug has been marked "future" because the original netscape engineer working on this is over-burdened. If you feel this is an error, that you or another known resource will be working on this bug,or if it blocks your work in some way -- please attach your concern to the bug for reconsideration -----

Target Milestone: mozilla0.9.1 → Future

Stanislav Pecha

Reporter

Updated

•

24 years ago

URL: http://gw.internet.pb.cz/noconv/spolu...

bsharma

Comment 11

•

24 years ago

updated qa contact.

QA Contact: janc → bsharma

Moied

Updated

•

23 years ago

QA Contact: bsharma → moied

Christopher Hoess (gone)

Comment 12

•

23 years ago

Per SGML, the part of the general entity reference between the "&" and the reference end must be name characters (the actual production is a bit more complicated, but this is all we have to deal with). The reference end is normally ; or a record end, but it may be omitted "if the reference is not followed by a character that could occur in the reference". This is indeed the case (since "=" cannot appear in names), the entity is terminated, and our parsing is correct. INVALID.

Status: ASSIGNED → RESOLVED

Closed: 24 years ago → 23 years ago

Keywords: testcase

Resolution: --- → INVALID

Moied

Comment 13

•

23 years ago

Verified invalid

Status: RESOLVED → VERIFIED

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

bad link interpretation in utf-8 document (Build ID: 2000101014)

Categories

(Core :: DOM: HTML Parser, defect, P3)

Tracking

()

People

(Reporter: st.pecha, Assigned: harishd)

References

Details

(Keywords: testcase)

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Updated

Comment 10

Updated

Comment 11

Updated

Comment 12

Comment 13

Attachment

General

Description

File Name

Content Type