22942 - (entities) Load external DTDs (entity/entities) (local and remote) if a pref is set

Reporter

Description

•

25 years ago

Currently, XML DTDs are loaded if they are pointed to by a chrome url or if they are placed in a special local directory. We need to extend this functionality so that all local and remote DTDs get loaded if the user sets a pref.

Nisheeth Ranjan

Reporter

Updated

•

25 years ago

Status: NEW → ASSIGNED

Target Milestone: M14

Nisheeth Ranjan

Reporter

•

25 years ago

Moving bugs out by one milestone...

Target Milestone: M15 → M16

Nisheeth Ranjan

Reporter

Comment 7

•

25 years ago

Will look at this post beta 2...

Target Milestone: M16 → M17

Nisheeth Ranjan

Reporter

Comment 8

•

25 years ago

Marking M18...

Target Milestone: M17 → M18

Nisheeth Ranjan

Reporter

Comment 9

•

25 years ago

This bug has been marked "future" because the original netscape engineer working on this is over-burdened. If you feel this is an error, that you or another known resource will be working on this bug,or if it blocks your work in some way -- please attach your concern to the bug for reconsideration.

Target Milestone: M18 → Future

Arthur Barrett

Comment 10

•

25 years ago

In bug 11538 this problem was proposed to be fixed by a modification to the expat glue. Is this still the way to fix it? I want to be able to develop multi-lingual apps using XPFE, and since XUL loads over http nicely I'd like to load the DTD language specific stuff over http too. I still dont understand how to get chrome://myfirstxulapp/locale/file.dtd to resolve to an http: address rather than a local address? Since I have a avested interest in this bug, if it is the 'preventer', if I can understand the solution I can probably work on it ... HELP!

Arthur Barrett

Comment 11

•

25 years ago

added myself to cc:

Nisheeth Ranjan

Reporter

Comment 12

•

25 years ago

Vidur Apparao is exploring the possibility of implementing synchronous XML document loading over HTTP in his XML Extras component. Once he's done, his code could serve as a resource for implementing synchronous DTD loading over HTTP. I suggest that you sync up with him. Please feel free to take ownership of this bug if you want to retarget the milestone to somthing earlier than "Future" which means post Netscape 6.0.

Christine Hoffman

Updated

•

24 years ago

QA Contact: chrisd → petersen

Tobias Burnus

Comment 13

•

24 years ago

Suggest: all/all for platform/OS.

sairuh (rarely reading bugmail)

Updated

•

•

22 years ago

Changing summary to make it easier to find this bug.

Summary: Load external DTDs (local and remote) if a pref is set → Load external DTDs (local and remote) if a pref is set/implement validating XML parser

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 35

•

22 years ago

*** Bug 178308 has been marked as a duplicate of this bug. ***

Christian Wolfgang Hujer

Comment 36

•

22 years ago

So just a question, and please, I don't intend to sound arrogant, since I have not contributed anything to Mozilla (except for some bug reports), but when will this bug be solved? Is it really so difficult to solve this little bug?

Boris Zbarsky [:bzbarsky]

Comment 37

•

22 years ago

Solving this bug requires either switching to a different XML parser completely or rewriting the existing XML parser to be validating. Which part of that is a "little bug"?

Axel Hecht

Comment 38

•

22 years ago

bz, IIRC, this is not as bad as you indicate it is. The main problem is a good strategy for performance. Marking a dependency on XML catalogs, which should get rid of the requirement to load dtds for some xml files, and getting that list to be extensible. About validation, being non-validating just says that we are not required to load external DTDs, not that we must not. On the expat side of things, we may have to block the parser in the external entity ref handler, or even cache the results of it. All I can say, DTDs work fine from chrome, and with a little patch from file://. (oops, just recognized that heikki made this bug about validating parsers a bit, which is something completely different, IMHO. Shouldn't that be a futured bug with a dependency in some way with this one, so folks finding one can get to this one, if they just look for DTDs?)

Depends on: xmlcatalog

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 39

•

22 years ago

*** Bug 196188 has been marked as a duplicate of this bug. ***

Martin Kutschker

Comment 40

•

•

19 years ago

Assignee: hjtoi-bugzilla → peterv

Depends on: 274777

Peter Van der Beken [:peterv]

Comment 73

•

19 years ago

Attached patch wip (obsolete) — Details — Splinter Review

Some edge cases probably don't work yet. Need to extract some of the parser changes in smaller patches. Need to add a pref.

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 74

•

19 years ago

Personally I'm against having a pref for this. Authors need to be able to depend on this feature being there otherwise it's pretty useless. Additionally, no user is going to know what the heck that pref is. I guess we could make it a hidden pref, but I don't really see the point in that.

Daniel C

Comment 75

•

19 years ago

What's the status of this? Did it get any easier to fix after the landing of Expat 1.95.8? The successful fix of this bug would make some XML work I'm doing much easier :)

Jean-Jacques Moreau

Comment 76

•

19 years ago

This bug still exists in Firefox 1.5.0.1. See for example the W3C XML Namespace specification (XML version): http://www.w3.org/TR/2004/REC-xml-names11-20040204/REC-xml-names11-20040204.xml It's a pity to have to resort to IE.

Peter Van der Beken [:peterv]

Comment 77

•

•

18 years ago

Attached patch v1 (obsolete) — Details — Splinter Review

This probably regresses bug 61630 and bug 191482, so need to figure out a solution for that.

Attachment #120224 - Attachment is obsolete: true

Attachment #202535 - Attachment is obsolete: true

Peter Van der Beken [:peterv]

Comment 86

•

18 years ago

Attached patch v1.1 (obsolete) — Details — Splinter Review

Fix the two issues mentioned in comment 85.

Attachment #267628 - Attachment is obsolete: true

Peter Van der Beken [:peterv]

Comment 87

•

18 years ago

Attached patch v1.2 (obsolete) — Details — Splinter Review

I've used a same origin policy for loading the DTDs for now, chrome and the known local DTDs can still be loaded by anyone as before. If a DTD can't be loaded (because of different origin, redirect denied or authentication failed) we continue parsing, though you'll probably still see an error because of missing enities then. Errors in the DTD or recursively loading the same entity do get reported and stop parsing. I'll ask mrbkap to take a look at the nsParser changes too.

Attachment #267712 - Attachment is obsolete: true

Attachment #268004 - Flags: superreview?(jst)

Attachment #268004 - Flags: review?(jst)

Peter Van der Beken [:peterv]

Comment 88

•

18 years ago

Comment on attachment 268004 [details] [diff] [review] v1.2 Will post a new patch that doesn't recurse for DTDs loading DTDs.

Attachment #268004 - Flags: superreview?(jst)

Attachment #268004 - Flags: review?(jst)

Peter Van der Beken [:peterv]

Comment 89

•

18 years ago

Attached patch v1.3 — Details — Splinter Review

Attachment #268004 - Attachment is obsolete: true

Attachment #269859 - Flags: review?(mrbkap)

Blake Kaplan (:mrbkap) (inactive)

Updated

•

18 years ago

Attachment #269859 - Flags: review?(mrbkap) → review+

John D. Ramsdell

Comment 90

•

18 years ago

Attached file The original example demonstrating the bug — Details

Quite I while ago, I submitted this bug report. The original problem I reported was that an error occurs when viewing an XML document with an entity reference that is defined in a DTD referenced by the DTD attached to the XML document. Secondly, if you use a workaround for the entity reference problem, the application of the XSLT stylesheet fails to produce the desired result. I'm concerned that some of the issues raised by the initial report have been forgotten. After expanding the attachment, point your browser at simpdoc/simpdoc.xml. The README in this attachment includes an extended description of the problem. The contents of the README follow. Simpdoc is a stylesheet for using XML to produce XHTML documents with automatically numbered sections and references, and an automatically generated table of contents. It was designed with the idea that the stylesheet would be applied within a browser, and one would publish content by making the XML document, its DTD, and the stylesheets, accessible. Browsers would be given the URL for the XML document. For Firefox, there are currently two problems when attempting to view the XML document. The XML reader doesn't understand the © entity reference even though it is defined in files referenced by the DTD. If one replaces the © entity reference with ©, one can view the document, but the XSLT stylesheet's transformations fail to produce the numbered sections, references, and table of contents. The enclosed Java program validates the document and correctly applies the transformation. See the GNUmakefile for instructions on how to run the program behind a proxy.

Peter Van der Beken [:peterv]

Updated

•

18 years ago

Attachment #269859 - Flags: superreview?(bzbarsky)

Boris Zbarsky [:bzbarsky]

•

•

14 years ago

WONTFIXing per discussion at All Hands in December 2010, since bug 651049 is going to be on my plate and off-the-main-thread parsing, our IO APIs and expat's external entity APIs don't go nicely together and the XML spec left this feature optional precisely in order to allow Web browsers not to be burdened with external DTDs. (See Tim Bray's annotated XML spec.)

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → WONTFIX

Henry S. Thompson

Comment 136

•

13 years ago

This is a serious interop problem, please reconsider. Both Opera and Chrome (and, therefore, I presume, Safari) process the external subset, so that e.g. access to the MathML entity declarations is straight-forward for them. This decision is also a serious roadblock for Polyglot HTML5, since all those entities _are_ available for the HTML serialisation, but, w/o external subset processing, will _not_ be available for the XHTML serialisation. Maintaining an entity stack is already part of expat, and you need it for processing the _internal_ subset. Why isn't Van der Beken's patch usable?

Henry S. Thompson

Comment 137

•

13 years ago

Further to Comment #136 here's an example of existing content on the web which works in Opera and Chrome but not in Firefox: http://www.w3.org/2001/tag/doc/metaDataInURI-31-20070102.xml Don't Break the Web :-)

Henri Sivonen (:hsivonen)

Comment 138

•

13 years ago

(In reply to Henry S. Thompson from comment #136) > Both Opera and Chrome (and, therefore, I presume, Safari) process the external > subset Can you point me to verifiable evidence, please? http://hsivonen.iki.fi/test/moz/external-subset.xml shows that neither Opera (with default settings) nor Chrome load the external subset even from the same origin. (Opera has a pref for loading the external subset, but the pref is off by default.) > This decision is also a serious roadblock for Polyglot HTML5, since all > those entities _are_ available for the HTML serialisation, but, w/o external > subset processing, will _not_ be available for the XHTML serialisation. They are available to the XHTML serialization if you use one of the special public ids in the doctype. E.g. <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0 plus SVG 1.1//EN" "http://www.w3.org/2002/04/xhtml-math-svg/xhtml-math-svg.dtd"> Also, various hacks for making the well known set of entities available are a different issue from actually performing network IO to actually fetch external subsets. > Maintaining an entity stack is already part of expat, and you need it for > processing the _internal_ subset. Why isn't Van der Beken's patch usable? As I said in comment 135, the plan is to move XML parsing off the main thread and expat's external entity API and our network IO APIs don't work nicely together in that case. (In reply to Henry S. Thompson from comment #137) > Further to Comment #136 here's an example of existing content on the web > which works in Opera and Chrome but not in Firefox: > > http://www.w3.org/2001/tag/doc/metaDataInURI-31-20070102.xml This is a different issue: What happens once an external entity isn't fetched from the network. Also, it doesn't "work" in Opera. Look for the word "December" in Opera. With default settings, you see ampersand, letters nbsp and a semicolon on both sides of the word December instead of seeing a non-breaking space.

simplistic proof-of-concept, UI-blocking fix 22 years ago Heikki Toivonen (remove -bugzilla when emailing directly) 4.44 KB, patch		Details \| Diff \| Splinter Review
wip 19 years ago Peter Van der Beken [:peterv] 62.91 KB, patch		Details \| Diff \| Splinter Review
v1 18 years ago Peter Van der Beken [:peterv] 51.48 KB, patch		Details \| Diff \| Splinter Review
v1.1 18 years ago Peter Van der Beken [:peterv] 54.33 KB, patch		Details \| Diff \| Splinter Review
v1.2 18 years ago Peter Van der Beken [:peterv] 60.61 KB, patch		Details \| Diff \| Splinter Review
v1.3 18 years ago Peter Van der Beken [:peterv] 57.97 KB, patch	mrbkap : review+ bzbarsky : superreview-	Details \| Diff \| Splinter Review
The original example demonstrating the bug 18 years ago John D. Ramsdell 12.78 KB, application/x-gzip		Details