Closed Bug 87996 Opened 24 years ago Closed 24 years ago

Anchors with special characters do not work properly

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla0.9.7

People

(Reporter: eh, Assigned: adamlock)

References

(
URL
)

Details

Attachments

(6 files)

Testcase showing space problems 24 years ago John Keiser (jkeiser) 1.02 KB, text/html		Details
Fixed test case 24 years ago Adam Lock 1.16 KB, patch		Details \| Diff \| Splinter Review
Patch fixes case 1, notes follow 24 years ago Adam Lock 1.41 KB, patch		Details \| Diff \| Splinter Review
How about this in stead? (untested) 24 years ago Johnny Stenback (:jst) 2.13 KB, patch		Details \| Diff \| Splinter Review
Same as above, but only unescape name for anchors, as jkeiser pointed out to me. 24 years ago Johnny Stenback (:jst) 2.34 KB, patch	john : review+ rpotts : superreview+	Details \| Diff \| Splinter Review
Ok, we can do that too :-) 24 years ago Johnny Stenback (:jst) 2.41 KB, patch		Details \| Diff \| Splinter Review

Reporter

Description

•

24 years ago

From Bugzilla Helper: User-Agent: Mozilla/5.0 (X11; U; Linux 2.2.16-22 i586; en-US; rv:0.9.1) Gecko/20010608 BuildID: 2001060810 The standards (at least HTML 4.01) have limitations on what can be included in a NAME tag ("ID and NAME tokens must begin with a letter ([A-Za-z]) and may be followed by any number of letters, digits ([0-9]), hyphens ("-"), underscores ("_"), colons (":"), and periods (".")." (http://www.w3.org/TR/html4/types.html#type-cdata)) and my example doesn't quite fit to those limitations, but it's kind of strange that if open a link with special characters in the fragment identifier in a new window, it works just fine (goes where it's supposed to), but if I try to use a link like this in the page, it doesn't do anything at all. So either these special characters in the fragment identifier (#foo) should be supported or then they shouldn't, but the way it works (and doesn't work) now is completely illogical. Reproducible: Always Steps to Reproduce: 1. Go to a page with an anchor with special characters (and a link to that anchor) 2. Try to press the link (doesn't do anything) 3. Try to open the link in a new window (opens the page and scrolls straight to the anchor) Actual Results: Described above Expected Results: Either it should ignore anchors with special characters all the time or then it should fully support them (better idea IMHO).

Asa Dotzler [:asa]

Comment 1

•

24 years ago

-> HTML Element

Assignee: asa → clayton

Status: UNCONFIRMED → NEW

Component: Browser-General → HTML Element

Ever confirmed: true

QA Contact: doronr → bsharma

Boris Zbarsky [:bzbarsky]

Comment 2

•

24 years ago

Linux build 2001-06-26-06 I see this with the link to http://www.ifsociety.com/herodishonest/lyrics.php#Don%27t+you+fucking+get+it%3F Now, the name attribute of the target anchor is invalid (apart from using weird chars). It should not be HTML-escaped. Replacing %27 with ' and %3F with ? in the name attribute (but not the href attribute, which _should_ be escaped) makes this link work fine in Mozilla in both cases... We should still behave consistently, however. Over to docshell.

Assignee: clayton → adamlock

Component: HTML Element → Embedding: Docshell

QA Contact: bsharma → adamlock

Peter Davis

Comment 3

•

24 years ago

This works in things like Sun's javadoc http://java.sun.com/j2se/1.4/docs/api/index.html (where there are wierd chars (parens, spaces) but not URL-encoded) and completely breaks in, for example, the New Hacker Dictionary where anchors often have spaces. Wierd.

John Keiser (jkeiser)

Comment 4

•

24 years ago

The Perl manpages all show this problem. What's happening there is, the link says http://www.perldoc.com/perl5.6/lib/ExtUtils/MakeMaker.html#make%20install and the anchor says <a NAME="make%20install">. (I checked the source.) Not being able to jump around in the Perl manpages makes this a larger problem, IMO, since I anything *anywhere* generated by Perldoc will exhibit this problem--and that means all Perl packages. While these are identical, I suspect Mozilla is translating one to "make install" and not the other, and thus is unable to find it. It seems like there would be zero negative impact and 100% positive if we translated %20 to space (among others), since right now it's *impossible* to find these links with Mozilla (if I understand the problem right). The only case where it could affect browser behavior in an unexpected way is if someone had *both* <A NAME="make install"> and <A NAME="make%20install"> in their page. IMO, the way to fix this problem is to politely shoot the HTML developer in question.

John Keiser (jkeiser)

Comment 5

•

24 years ago

I just made a test page that includes all three variants of link ("#make%20install", "#make+install", "#make install") and all three variants of <A NAME=...> (same three variants). None of the three links does anything at all. Specifying them on the browser address bar doesn't work either. So now the question is: how do you put a space into <A NAME> and have it work? My answer is: read A NAME the same way you read URIs. I think the spec intends the same, but I can't even find the place in the spec where it says %20 is legal in a URI--maybe you can. It does seem to be implied by the excerpted text below, however. http://www.w3.org/TR/html4/appendix/notes.html#non-ascii-chars , which admittedly describes handling an illegal case, seems to indicate that both URIs and NAME attribute of anchors should be escaped in the same way. 1. Represent each character in UTF-8 (see [RFC2279]) as one or more bytes. 2. Escape these bytes with the URI escaping mechanism (i.e., by converting each byte to %HH, where HH is the hexadecimal notation of the byte value). They even include a special note: "The same conversion based on UTF-8 should be applied to values of the name attribute for the A element." If they were being escaped in the same way in Mozilla, I doubt this bug would be here:

John Keiser (jkeiser)

Comment 6

•

24 years ago

Attached file Testcase showing space problems — Details

Alfonso Martinez

Comment 7

•

24 years ago

*** Bug 94361 has been marked as a duplicate of this bug. ***

Sebastian Biallas

Comment 8

•

24 years ago

Urlencoded anchors often apear in texinfo documents converted to html, as stated by John Keiser (I currenty have this problem). A node like "Some node" will become #some%20node in a html document; these nodes properly work in netscape 4x, I'd recomment adding 4xp keyword.

Suomi Hasler, Ayni AG

Comment 9

•

24 years ago

Check out your own last two examples and you will see that it still does not work: <li><a href="lyrics.php#We+give%2C+they+take">We give, they take</a> <li><a href="lyrics.php#Won%27t+be+defeated">Won't be defeated</a> suomi

Jörg Heinicke

Comment 10

•

24 years ago

*** Bug 96218 has been marked as a duplicate of this bug. ***

David Illsley

Comment 11

•

24 years ago

*** Bug 100531 has been marked as a duplicate of this bug. ***