Closed Bug 665706 Opened 14 years ago Closed 13 years ago

semicolon in URI path handled incorrectly

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla9

People

(Reporter: julian.reschke, Assigned: julian.reschke)

References

(Depends on 1 open bug,
URL
)

Details

(Keywords: addon-compat, dev-doc-complete)

Attachments

(1 file, 1 obsolete file)

rough patch, work in progress 13 years ago Julian Reschke 27.84 KB, patch		Details \| Diff \| Splinter Review
remove special (historic) treatment of ";" in URIs 13 years ago Julian Reschke 18.97 KB, patch	jesup : review+	Details \| Diff \| Splinter Review

Julian Reschke

Assignee

Description

•

14 years ago

User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:2.0.1) Gecko/20100101 Firefox/4.0.1 Build Identifier: Given a URI such as http://example.org/foo/bar;pathparam?query#frag the URL decomposition DOM attributes return /foo/bar as pathname. Should be /foo/bar;pathparam See http://lists.w3.org/Archives/Public/public-iri/2011Apr/0035.html for Boris' explanation why this happens. Reproducible: Always

Boris Zbarsky [:bzbarsky]

Comment 1

•

14 years ago

This has nothing to do with the DOM.

Status: UNCONFIRMED → NEW

Component: DOM → Networking

Ever confirmed: true

QA Contact: general → networking

Julian Reschke

Assignee

Comment 2

•

14 years ago

(In reply to comment #1) > This has nothing to do with the DOM. ...but it does behave correctly network-wise... That being said; any idea where the code is located causing this?

Julian Reschke

Assignee

Comment 3

•

14 years ago

OK, the relevant code is in nsURLParsers.cpp, it seems. The clean approach would be to wipe out any special treatment of ";" even in the interfaces, but I'm not sure how disruptive this would be...

Mounir Lamouri (:mounir)

Comment 4

•

14 years ago

(In reply to comment #3) > OK, the relevant code is in nsURLParsers.cpp, it seems. > > The clean approach would be to wipe out any special treatment of ";" even in > the interfaces, but I'm not sure how disruptive this would be... Do not hesitate to give it a try if your are able to write a patch that seems correct to you and attach it here for a review. Don't be scared of doing something wrong: the reviewer will catch it.

OS: Windows 7 → All

Hardware: x86 → All

Version: unspecified → Trunk

Julian Reschke

Assignee

Comment 5

•

13 years ago

Attached patch rough patch, work in progress (obsolete) — Details — Splinter Review

This compiles, and seems to work. TODO: escapes ";" in paths while it doesn't have to TODO: update test cases

Mounir Lamouri (:mounir)

Updated

•

13 years ago

Assignee: nobody → julian.reschke

Julian Reschke

Assignee

Comment 6

•

13 years ago

Attached patch remove special (historic) treatment of ";" in URIs — Details — Splinter Review

- removes "param" component from URL interfaces - updates nsEscape not to escape ";" - fixes test case that assumed that http://example.com/; and http://example.com; should be the same - fixes two unmaintained CPP based test cases to compile, but did not fix them otherwise (see related bug 677248)

Attachment #551346 - Attachment is obsolete: true

Attachment #551508 - Flags: review?(rjesup)

Julian Reschke

Assignee

Updated

•

13 years ago

Target Milestone: --- → mozilla9

Randell Jesup [:jesup] (needinfo me)

Comment 7

•

13 years ago

Comment on attachment 551508 [details] [diff] [review] remove special (historic) treatment of ";" in URIs r=me on all but the http://example.com; issue Reading RFC 3986, it doesn't appear valid to end an authority section with ';' - does the uri1.spec = "http://example.com;" fail? (Of course, we may not force compliance with the BNF.) Do we fail if you put other not-allowed characters in that position? (Or rather, does it now do the same with ';' as it does with other incorrect authority characters?) If it is the same handling as other incorrect ones, then r=me on the entire thing. If we're not handling it the same as other incorrect characters, then r-. If we're handling incorrect characters in authority incorrectly but consistently, then please file a new bug on that and r=me on this. I'll mark it r+ for now

Attachment #551508 - Flags: review?(rjesup) → review+

Masatoshi Kimura [:emk]

Comment 8

•

13 years ago

> authority = [ userinfo "@" ] host [ ":" port ] > host = IP-literal / IPv4address / reg-name > reg-name = *( unreserved / pct-encoded / sub-delims ) > sub-delims = "!" / "$" / "&" / "'" / "(" / ")" > / "*" / "+" / "," / ";" / "=" ";" looks to be allowed for the authority part.

Julian Reschke

Assignee

Comment 9

•

13 years ago

(In reply to Randell Jesup [:jesup] from comment #7) > Reading RFC 3986, it doesn't appear valid to end an authority section with > ';' - does the uri1.spec = "http://example.com;" fail? (Of course, we may > not force compliance with the BNF.) Yes, ";" is allowed in the authority component. But the primary reason why this change should be ok is that ";" has no special meaning in the URI; it doesn't affect the way it's split into components. Thus, it's handled just like "a", for instance. This makes it different from the other things that were tested in that test class; those were about characters that are indeed special, such as "?" and "#". > Do we fail if you put other not-allowed characters in that position? (Or > rather, does it now do the same with ';' as it does with other incorrect > authority characters?) If it is the same handling as other incorrect ones, > then r=me on the entire thing. If we're not handling it the same as other > incorrect characters, then r-. If we're handling incorrect characters in > authority incorrectly but consistently, then please file a new bug on that > and r=me on this. > > I'll mark it r+ for now As stated above, it *is* allowed. That being said, for the purpose of URI parsing it is handled exactly like something that is not allowed, such as "{"; it's just not special, and thus ends up in the component it appears in. Does this clarify?

Julian Reschke

Assignee

Updated

•

13 years ago

Status: NEW → ASSIGNED

Keywords: checkin-needed

Mounir Lamouri (:mounir)

Updated

•

13 years ago

Flags: in-testsuite+

Keywords: checkin-needed

Whiteboard: [inbound]

Mounir Lamouri (:mounir)

Comment 10

•

13 years ago

Pushed: http://hg.mozilla.org/mozilla-central/rev/1a09781a5480

Status: ASSIGNED → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Whiteboard: [inbound]

Masatoshi Kimura [:emk]

Updated

•

13 years ago

Blocks: 682762

Boris Zbarsky [:bzbarsky]

Updated

•

13 years ago

No longer blocks: 682762

Depends on: 682762

Boris Zbarsky [:bzbarsky]

Comment 11

•

13 years ago

Why was this checked in without super-review? It's got API changes! I filed bug 682845 on the obvious sr-level issues a cursory glance shows.

Boris Zbarsky [:bzbarsky]

Updated

•

13 years ago

Depends on: 682845

Boris Zbarsky [:bzbarsky]

Comment 12

•

13 years ago

We probably need to document the API changes here, at least.

Keywords: dev-doc-needed

Jorge Villalobos [:jorgev] (he/him)

Updated

•

13 years ago

Keywords: addon-compat

Eric Shepherd [:sheppy]

Comment 13

•

13 years ago

Documentation updated: https://developer.mozilla.org/en/nsIURL https://developer.mozilla.org/en/XPCOM_Interface_Reference/nsIURLParser Also mentioned on Firefox 9 for developers and Updating add-ons for Firefox 9.

Keywords: dev-doc-needed → dev-doc-complete

Daniel Stenberg [:bagder]

Comment 14

•

13 years ago

Ok, isn't this so that with this change now Firefox accepts this URI? "www.example.com;foo=bar" Is that really intended? It seems to me that Firefox is now rather lonely in doing this interpretation and I think it hurts us all to have this different opinion on how to parse URIs...

Julian Reschke

Assignee

Comment 15

•

13 years ago

(In reply to Daniel Stenberg from comment #14) > Ok, isn't this so that with this change now Firefox accepts this URI? > > "www.example.com;foo=bar" That's not the intent, and I believe it accepted those before; it just treated them differently. > Is that really intended? It seems to me that Firefox is now rather lonely in > doing this interpretation and I think it hurts us all to have this different > opinion on how to parse URIs... If you think that ";" in authorities should get special treatment, you may be right; maybe open a separate bug?

Daniel Stenberg [:bagder]

Comment 16

•

13 years ago

Thanks, I've submitted #713472 as a new bug for this.

:Gavin Sharp [email: gavin@gavinsharp.com]

Updated

•

13 years ago

Depends on: 732567

Ryan VanderMeulen [:RyanVM]

Updated

•

11 years ago

Blocks: 713472

You need to log in before you can comment on or make changes to this bug.