Closed Bug 70828 Opened 25 years ago Closed 21 years ago

Tags that are not closed by the end of the document do strange things.

Tracking

()

Status:

RESOLVED FIXED

Milestone:

Future

People

(Reporter: ericmao, Assigned: mrbkap)

References

Details

(Keywords: testcase)

Attachments

(5 files, 1 obsolete file)

Testcase (icky HTML, I know) 25 years ago Boris Zbarsky [:bzbarsky] 315 bytes, text/html		Details
Single quotes on event handler morphed to double-quotes 23 years ago Roland Roberts 640 bytes, text/html		Details
Explanation of this bug. 21 years ago Blake Kaplan (:mrbkap) (inactive) 2.86 KB, text/html		Details
work in progress 21 years ago Blake Kaplan (:mrbkap) (inactive) 5.62 KB, patch		Details \| Diff \| Splinter Review
patch v2 21 years ago Blake Kaplan (:mrbkap) (inactive) 7.20 KB, patch	rbs : review+	Details \| Diff \| Splinter Review
patch v2 21 years ago Blake Kaplan (:mrbkap) (inactive) 8.44 KB, patch	rbs : review+ roc : superreview+	Details \| Diff \| Splinter Review

Eric Mao

Reporter

Description

•

25 years ago

I created a simple HTML file: <html> <body bgcolor="white" "bogus extra stuff""""> </body> </html> I opened the file up in Mozilla 0.8 under Win2K, and went to View Page Source. I expected to see the same HTML code that I typed in, despite the fact that the HTML code contained errors. Instead, some of the double-quotes got cleaned up: <html> <body bgcolor="white" bogus extra stuff> </body> </html>

mar_garina

Comment 1

•

25 years ago

Same in here, under Linux. The question is whether it's a bug: a html file should look like that: <TAG OPT1="value1"> and not <TAG "value1"> ...

Fabian Guisset

Comment 2

•

25 years ago

sending to parser for triage

Assignee: ben → harishd

Component: XP Apps: GUI Features → Parser

QA Contact: sairuh → janc

Oliver Klee

Comment 3

•

25 years ago

Confirming as per user comment, setting OS=All.

Status: UNCONFIRMED → NEW

Ever confirmed: true

OS: Windows 2000 → All

Boris Zbarsky [:bzbarsky]

Comment 4

•

25 years ago

View source should be showing the source as it really is -- otherwise its usefulness for debugging web pages is not very great (and why does anyone ever look at view source otherwise?).

timeless

Comment 5

•

25 years ago

because we want to see what content mozilla decided was worth parsing ;-b otherwise, we'd use telnet to get the real content :) *sigh* this is at least normal severity.

Severity: minor → normal

Keywords: dataloss

Jan Carpenter

Updated

•

25 years ago

QA Contact: janc → bsharma

clayton

Comment 6

•

25 years ago

I appreciate that this is truly annoying, but can it actually alter the semantics of content so that developers are misled rather than just inconvenienced? Futuring for now, will reconsider given a compelling test case.

Target Milestone: --- → Future

Boris Zbarsky [:bzbarsky]

Comment 7

•

25 years ago

Attached file Testcase (icky HTML, I know) — Details

Boris Zbarsky [:bzbarsky]

Comment 8

•

25 years ago

The attached testcase sucks as HTML goes. Nevertheless... Moving the closing quote to before </html> will make things work OK. But the fact remains that view source just silently lost half the source of that page, and more importantly has lost all indications that an error of any kind has occured. I'm not completely sure that it's the same bug, but it seems related...

Boris Zbarsky [:bzbarsky]

Updated

•

25 years ago

Blocks: 57724

Andreas M. "Clarence" Schneider

Updated

•

24 years ago

No longer blocks: 57724

Andreas M. "Clarence" Schneider

Comment 9

•

24 years ago

*** This bug has been marked as a duplicate of 57724 ***

Status: NEW → RESOLVED

Closed: 24 years ago

Resolution: --- → DUPLICATE

Moied

Updated

•

24 years ago

QA Contact: bsharma → moied

Christopher Hoess (gone)

Comment 10

•

24 years ago

Reopening 57724 dependencies for independent resolution.

Status: RESOLVED → REOPENED

Depends on: 57724

Keywords: testcase

Resolution: DUPLICATE → ---

Roland Roberts

Comment 11

•

23 years ago

Attached file Single quotes on event handler morphed to double-quotes — Details

Outer single quotes are converted to double quotes both in "View->Source" and via the "Save Page As..." dialogue. In the latter case, the saved page will not work because this change breaks the nested quotes.

Boris Zbarsky [:bzbarsky]

Comment 12

•

23 years ago

Roland, I cannot reproduce the error you describe when viewing the source of that attachment (linux trunk build 2002-05-01-21). I see single quotes... The "save as" issue is a separate issue (with the "web page, complete" mode only) and should be filed as a separate bug.

Brant Gurganus

Comment 13

•

23 years ago

By the definitions on <http://bugzilla.mozilla.org/bug_status.html#severity> and <http://bugzilla.mozilla.org/enter_bug.cgi?format=guided>, crashing and dataloss bugs are of critical or possibly higher severity. Only changing open bugs to minimize unnecessary spam. Keywords to trigger this would be crash, topcrash, topcrash+, zt4newcrash, dataloss.

Severity: normal → critical

Christopher Hoess (gone)

Updated

•

23 years ago

Severity: critical → minor

Keywords: dataloss

Boris Zbarsky [:bzbarsky]

Comment 14

•

23 years ago

Please actually read bugs before changing the severity, ok?

Richard Neill

Comment 15

•

22 years ago

Comment 16

•

22 years ago

*** Bug 233609 has been marked as a duplicate of this bug. ***

Richard Neill

Comment 17

•

21 years ago

This "cleaning up" of the source before interpretation can also cause security issues. Eg my website allows users to add certain simple HTML tags to their posts, but not others. If, however, a user enters this: <B ONCLICK="window.open('http://www.badsite.com')" then mozilla will automatically append the closing > and render the next part of the website's own content with this onclick link. Admittedly, it's my fault in this case for not having a good enough regexp for filtering out bad tags (which I've now fixed), but I do wonder whether Mozilla should be displaying "what an attacker means" rather than "what the designer said". The following was the vulnerable html cleanup code I had used. I've simplified $allowed for clarity. $allowed='\s*\/?\s*(b|i|u|s|pre|tt|ul|ol|li|p|)\s*'; $memo=preg_replace("/<((?!($allowed>))[^<>]*)>/is", "<\\1>", $memo);