Closed Bug 146441 Opened 23 years ago Closed 21 years ago

Anchors and BASE conflict

Tracking

()

Status:

RESOLVED DUPLICATE of bug 275689

People

(Reporter: donguana, Assigned: harishd)

References

Details

(Keywords: testcase)

Attachments

(1 file)

Sample of the base and anchor conflict 23 years ago Ian 744 bytes, text/html		Details

Ian

Reporter

Description

•

23 years ago

I'm not sure if this is a bug or not - I checked the w3c and could not find any information on this. If I create an html page containing images and I want to send it to someone, so I place a <BASE> tag so that the user receiving the html code can see how the page was intended to look by loading the images from my server. By placing the BASE tag it messes up the anchors linking to content within the page. In this case I don't store the html content on the server only the images. So when the user clicks on an anchor there come to my server and get a "page not found" error. I tested this with other browsers and they all do the same logic but I don't think the logic is correct. I will send a sample as an attachement.

Ian

Reporter

Comment 1

•

23 years ago

Attached file Sample of the base and anchor conflict — Details

Christopher Hoess (gone)

Comment 2

•

23 years ago

<URL:http://www.w3.org/TR/html401/struct/links.html#h-12.4> Almost certainly invalid.

Christopher Hoess (gone)

Comment 3

•

23 years ago

RFC 1808 explicitly states the current behavior to be correct, but RFC 2396 is less clear. bz?

Christopher Hoess (gone)

Comment 4

•

23 years ago

Scratch that, this is valid. <URL:http://lists.w3.org/Archives/Public/www-html/2001Dec/0045.html> ->Networking

Assignee: harishd → new-network-bugs

Status: UNCONFIRMED → NEW

Component: Parser → Networking

Ever confirmed: true

QA Contact: moied → benc

Ian

Reporter

Comment 5

•

23 years ago

On the page specified above there is the following text: "When the link is defined in the same document: #anchor-one" If the link is in the same document then why should it append the base-source to the anchor?

Boris Zbarsky [:bzbarsky]

Comment 6

•

23 years ago

Ugh. RFC 2396 makes this suck... <a href="#foo"> and <form action="#foo"> should load different(!) things, according to that, if the base href and the document href differ.

Severity: normal → major

OS: Windows 2000 → All

Hardware: PC → All

Christopher Hoess (gone)

Comment 7

•

23 years ago

EXPN? (And action="#foo" isn't a very common idiom, is it?)

Boris Zbarsky [:bzbarsky]

Comment 8

•

23 years ago

what does EXPN mean? And action="" also means a different thing from href="" according to that RFC. This latter _is_ pretty common.

Christopher Hoess (gone)

Comment 9

•

23 years ago

EXPN == "explain", an SMTP-ism. Having re-read RFC 2396, I can only assume there's a good reason for section 4.2...

benc

Comment 10

•

23 years ago

+cc andreas.

Andreas Otte

Comment 11

•

23 years ago

We should go with RFC2396, we do it nearly everywhere else in necko. However this can not be solved within the scope of networking/necko. Necko takes a base url and a string and resolves that into a new url (::Resolve). Which url is given as base is up to the caller, in this case I think the htmlparser or layout/content. This part of RFC2396 has to be implemented there.

Christopher Hoess (gone)

Comment 12

•

23 years ago

*** Bug 136811 has been marked as a duplicate of this bug. ***

Christopher Hoess (gone)

Comment 13

•

23 years ago

->Parser

Assignee: new-network-bugs → harishd

Component: Networking → Parser

Keywords: testcase

QA Contact: benc → moied

Ian

Reporter

Comment 14

•

23 years ago

I did some more test and it is also related to the mime header "Content-Base:" in an email. I beleive the "Content-Base:" is similar to using the <BASE> tag. I'm not sure if I'm correct with this assumption. Here is a test to demonstrate this problem 1 - Go to the following url. http://www.mozilla.org/quality/browser/standards/html/a_name.html#anchor 2 - Go file -> send page 3 - Send the page to yourself 4 - Check you mail 5 - Place your browser in offline mode 6 - Look the the page that you just sent to yourself and follow the instructions on the test. 7 - You will no be able to click on the link because of the anchor. Is there any way around this bug?

dsirnapalli

Updated

•

23 years ago

Priority: -- → P2

QA Contact: moied → dsirnapalli

Bob Dingman

Comment 15

•

22 years ago

As another test case of the original bug (not the mime-header-related phenomenon) have a look at: the "<>" hypertext link in the "Abnormal Examples" section on http://www.ics.uci.edu/%7Efielding/url/test1.html In this example, <base href="http://a/b/c/d;p?q"> is embedded in the document yet Mozilla 1.3 uses it to resolve the URI. Sections 4.2, 5.2 step 2), and C.2 of RFC 2396 clearly state that "An empty reference refers to the start of the current document." Presumably, this behaviour is irrespective of whatever base URI exists. This contradicts RFC 1808 which states: "An empty reference resolves to the complete base URL . . ." W3C HTML 4.01 states that RFC 2396 "updates" RFC 1808. If we assume "updates" means that RFC 2396 takes precedence over RFC 1808, then Mozilla 1.3 is non-compliant with the newer spec. What confuses the issue for me, however, is the "however" caveat in section 4.2 of RFC 2396. The "however" does not appear limited to HTML FORMS: [from RFC 2396] 4.2. Same-document References A URI reference that does not contain a URI is a reference to the current document. In other words, an empty URI reference within a document is interpreted as a reference to the start of that document, and a reference containing only a fragment identifier is a reference to the identified fragment of that document. Traversal of such a reference should not result in an additional retrieval action. However, if the URI reference occurs in a context that is always intended to result in a new request, as in the case of HTML's FORM element, then an empty URI reference represents the base URI of the current document and should be replaced by that URI when transformed into a request. If one were to interpret this to mean that any URI reference occurs "in a context that is always intended to result in a new request", then the empty URI should always be replaced by the base URI of the current document. I don't think that is what was intended by this caveat, but it raises the question of how an HTML parser is supposed to know that level of detail about a URI's context? This material is new to me so maybe the context is more apparent than I realize, but the spec seems a little ambiguous on this point. Any thoughts?

Anne (:annevk)

Comment 16

•

21 years ago

*** Bug 221915 has been marked as a duplicate of this bug. ***

Sjoerd Visscher

Comment 17

•

21 years ago

*** This bug has been marked as a duplicate of 275689 ***

Status: NEW → RESOLVED

Closed: 21 years ago

Resolution: --- → DUPLICATE

You need to log in before you can comment on or make changes to this bug.