565938 - [HTML5] LinkedIn recruiter loads, then disappears, then starts looking for google.com

Reporter

Description

•

16 years ago

https://www.linkedin.com/cap/ Sadly, you need an account, but if you email me I can provide you with the credentials to use. When you hit the site, it loads, then disappears, and then we seem to go off looking for google.com.

Boris Zbarsky [:bzbarsky]

Comment 1

•

16 years ago

> but if you email me I can provide you with the credentials to use. Please.

Bret Reckard

Reporter

Comment 2

•

16 years ago

Incoming!

Boris Zbarsky [:bzbarsky]

Comment 3

•

16 years ago

OK, definitely due to the way HTML5 parser treats document.write. What happens here is that the script https://www.google.com/jsapi/?key=notsupplied is running after parsing is done, and this script happens to do a document.write of something like <script src="https://www.google.com/uds/?file=feeds&v=1"> in this case. This script is triggered from a JS call in https://www.linkedin.com/cap/js/cap-blog.js on line 1, which is: google.load("feeds", "1"); cap-blog.js is loaded like so: getJS("/cap/js/cap-blog.js", true); where getJS is defined in https://www.linkedin.com/cap/js/loader.js like so, trimming out the part irrelevant to the call for cap-blog.js: /** * Prevent JS blocking ( || downloads) * @param {String} path Absolute path to the JS file * @param {Boolean} defer Should in be BODY or HEAD? inserts "document.getElementsByTagName( node )[0]" * @param {String} method Choose between "scriptDOM" or "XHRInject" */ function getJS( path, defer, method ){ var isIE = /*@cc_on!@*/false; var isSafari = document.childNodes && !document.all && !navigator.taintEnabled; if ( !isIE && !isSafari ) { var node; (defer !== true) ? node = 'head' : node = 'body'; method = method || 'scriptDOM'; node = document.getElementsByTagName(node.toUpperCase())[0]; if (node) { var s = document.createElement('SCRIPT'); s.src = path; document.getElementsByTagName(node.appendChild(s)); } } else if ( isSafari || isIE ) { document.write('<scr'+'ipt src="' + path + '"><\/scr'+'ipt>'); } // END isIE/isSafari } Now what confuses me is that the getJS call above is happening directly off the parser: Breakpoint 2, nsHTMLDocument::GetElementsByTagName (this=0x7fffda524800, aTagname=..., aReturn=0x7fffffffbbb0) at ../../../../../mozilla/content/html/document/src/nsHTMLDocument.cpp:1393 1393 nsAutoString tmp(aTagname); (gdb) jsstack 0 getJS(method = "scriptDOM", defer = true, path = "/cap/js/cap-blog.js") ["https://www.linkedin.com/cap/js/loader.js":71] s = undefined xhr = undefined node = "body" isSafari = false isIE = false this = [object Window @ 0x7fffe3c94d30 (native @ 0x7fffe3c86c60)] 1 <TOP LEVEL> ["https://www.linkedin.com/cap/dashboard/home":885] this = [object Window @ 0x7fffe3c94d30 (native @ 0x7fffe3c86c60)] (gdb) bt #0 nsHTMLDocument::GetElementsByTagName (this=0x7fffda524800, aTagname=..., aReturn=0x7fffffffbbb0) at ../../../../../mozilla/content/html/document/src/nsHTMLDocument.cpp:1393 #1 0x00007ffff6932476 in nsIDOMDocument_GetElementsByTagName (cx=0x7fffe3c5c000, argc=1, vp=0x7fffdbd97138) at dom_quickstubs.cpp:3692 #2 0x00007ffff5190e95 in js_Interpret (cx=0x7fffe3c5c000) at ../../../mozilla/js/src/jsops.cpp:2199 #3 0x00007ffff51a59fa in js_Execute (cx=0x7fffe3c5c000, chain=0x7fffe98e2680, script=0x7fffdc437400, down=0x0, flags=0, result=0x0) at ../../../mozilla/js/src/jsinterp.cpp:1073 #4 0x00007ffff5116b22 in JS_EvaluateUCScriptForPrincipals (cx=0x7fffe3c5c000, obj=0x7fffe98e2680, principals=0x7fffdbd5b9d8, chars=0x7fffffffc870, length=49, filename= 0x7fffdbb180a8 "https://www.linkedin.com/cap/dashboard/home", lineno=881, rval=0x0) at ../../../mozilla/js/src/jsapi.cpp:4880 #5 0x00007ffff6443a48 in nsJSContext::EvaluateString (this=0x7fffe3c94550, aScript=..., aScopeObject=0x7fffe98e2680, aPrincipal=0x7fffdbd5b9d0, aURL= 0x7fffdbb180a8 "https://www.linkedin.com/cap/dashboard/home", aLineNo=881, aVersion=0, aRetValue=0x0, aIsUndefined=0x7fffffffc78c) at ../../../mozilla/dom/base/nsJSEnvironment.cpp:1763 #6 0x00007ffff61fe361 in nsScriptLoader::EvaluateScript (this=0x7fffdbd2e9a0, aRequest=0x7fffdbd11d00, aScript=...) at ../../../../mozilla/content/base/src/nsScriptLoader.cpp:760 #7 0x00007ffff61fdd64 in nsScriptLoader::ProcessRequest (this=0x7fffdbd2e9a0, aRequest=0x7fffdbd11d00) at ../../../../mozilla/content/base/src/nsScriptLoader.cpp:673 #8 0x00007ffff61fd9fa in nsScriptLoader::ProcessScriptElement (this=0x7fffdbd2e9a0, aElement=0x7fffdc492ba0) at ../../../../mozilla/content/base/src/nsScriptLoader.cpp:624 #9 0x00007ffff61fa799 in nsScriptElement::MaybeProcessScript (this=0x7fffdc492ba0) at ../../../../mozilla/content/base/src/nsScriptElement.cpp:195 #10 0x00007ffff62ea7f6 in nsHTMLScriptElement::MaybeProcessScript (this=0x7fffdc492b30) at ../../../../../mozilla/content/html/content/src/nsHTMLScriptElement.cpp:552 #11 0x00007ffff62ea4b4 in nsHTMLScriptElement::DoneAddingChildren (this=0x7fffdc492b30, aHaveNotified=1) at ../../../../../mozilla/content/html/content/src/nsHTMLScriptElement.cpp:480 #12 0x00007ffff65c452e in nsHtml5TreeOpExecutor::RunScript (this=0x7fffda5426f0, aScriptElement=0x7fffdc492b30) at ../../../mozilla/parser/html/nsHtml5TreeOpExecutor.cpp:725 #13 0x00007ffff65c3b3c in nsHtml5TreeOpExecutor::RunFlushLoop (this=0x7fffda5426f0) at ../../../mozilla/parser/html/nsHtml5TreeOpExecutor.cpp:521 #14 0x00007ffff65c513e in nsHtml5ExecutorReflusher::Run (this=0x7fffdbd6cea0) at ../../../mozilla/parser/html/nsHtml5TreeOpExecutor.cpp:90 Why is that script added by getJS not blocking the parser? I would think it would. Henri?

Blocks: html5-parsing

blocking2.0: --- → ?

Keywords: regression

Boris Zbarsky [:bzbarsky]

Comment 4

•

16 years ago

Hmm. I guess it did in the old parser but might not in the new one... if it were document.written, it would, of course. Which makes me want to say this should be tech evang.

Martijn Wargers (dead)

Comment 5

•

16 years ago

I have a similar problem like this in bug 560256, which was marked invalid.

Boris Zbarsky [:bzbarsky]

Comment 6

•

16 years ago

Right. The question here is only why the document load finishes before the cap-log.js script loads.

Henri Sivonen (:hsivonen)

Comment 7

•

16 years ago

(In reply to comment #3) > /** > * Prevent JS blocking ( || downloads) > * @param {String} path Absolute path to the JS file > * @param {Boolean} defer Should in be BODY or HEAD? inserts > "document.getElementsByTagName( node )[0]" > * @param {String} method Choose between "scriptDOM" or "XHRInject" > */ > function getJS( path, defer, method ){ > var isIE = /*@cc_on!@*/false; > var isSafari = document.childNodes && !document.all && > !navigator.taintEnabled; > if ( !isIE && !isSafari ) { > var node; > (defer !== true) ? node = 'head' : node = 'body'; > method = method || 'scriptDOM'; > node = document.getElementsByTagName(node.toUpperCase())[0]; > > if (node) { > var s = document.createElement('SCRIPT'); > s.src = path; > document.getElementsByTagName(node.appendChild(s)); > } > } else if ( isSafari || isIE ) { > document.write('<scr'+'ipt src="' + path + '"><\/scr'+'ipt>'); > } // END isIE/isSafari > } Aargh. I very much dislike this pattern. We converge on a common behavior by implementing WebKit and IE traits but sites assume old Gecko traits and put us on the unsafe code path. But having different code paths is pointless in the first place! > Why is that script added by getJS not blocking the parser? I would think it > would. Henri? The spec says it shouldn't block the parser: See the second to last case under step 8 under http://www.whatwg.org/specs/web-apps/current-work/#running-a-script The historical blocking behavior here is different between browsers. The problem is that the site assumes the blocking behavior will stay constant in each browsers engine and they will never converge on a common behavior. Worse, the site clearly tries to get unblocking behavior where possible ("Prevent JS blocking") but actually relies on getting blocked everywhere... I suggest contacting the site asking them to rewrite the method as: /** * Load a script during the HTML parse. Only to be called from * script elements that appear in the document's source or that have * themselves been created using document.write(). Do not call * from event handlers or timeouts! * @param {String} path Absolute path to the JS file * @param {Boolean} defer Ignored * @param {String} method Ignored */ function getJS( path, defer, method ){ document.write('<scr'+'ipt src="' + path + '"><\/scr'+'ipt>'); } And then introducing a new method: /** * Load a script avoiding blocking. The script designated by * path MUST NOT call document.write()! * @param {String} path Absolute path to the JS file */ function getJSAvoidBlocking( path ){ var s = document.createElement("script"); s.src = path; document.getElementsByTagName("head")[0].appendChild(s); } And then using the new method only for scripts that have been reviewed not to call document.write().

Assignee: nobody → english-us

Component: HTML: Parser → English US

OS: Mac OS X → All

Product: Core → Tech Evangelism

QA Contact: parser → english-us

Hardware: x86 → All

Boris Zbarsky [:bzbarsky]

Comment 8

•

15 years ago

Bret, do you want to drop the linkedin folks an e-mail?

Martijn Wargers (dead)

Comment 9

•

15 years ago

I'm also seeing this kind of problem on http://www.weer.nl . No problem in Firefox3.6.

Bret Reckard

Reporter

Comment 10

•

15 years ago

BZ, Aakash knows someone on the LinkedIn Frontend team. We'll get him CC'd on the bug. Thank you for looking into this.

Henri Sivonen (:hsivonen)

Comment 11

•

15 years ago

http://www.weer.nl WFM.

Martijn Wargers (dead)

Comment 12

•

Comment 29

•

15 years ago

•

15 years ago

Assignee: nobody → english-us

blocking2.0: ? → ---

Component: HTML: Parser → English US

Product: Core → Tech Evangelism

QA Contact: parser → english-us

Nobody; OK to take it and work on it

Updated

•

11 years ago

Product: Tech Evangelism → Tech Evangelism Graveyard