1843454 - [META] Untenable to diagnose a situation on a webpage due to debugger performance issues

Reporter

Description

•

1 year ago

•

I can easily lock up Firefox by debugging the scripts at https://ageor.dipot.com/2020/03/Covid-19-in-Greece.html . For instance, this is what I wanted to do, to investigate why the page throws a SecurityError on a stock nightly build, but not on a local (non-debug) artifact build:

visit the site on a stock nightly build and open the debugger
full-text search for "dataStudioOfflineCache"
open the result
pretty print (optional)
put a breakpoint on this.H=(this.D=d.indexedDB)&&
reload the page
wait for the first breakpoint to trip
see what the value of d is (it is probably a restricted window)
repeat the same steps on the other build

However, the moment I start trying to set the breakpoint, Firefox starts freezing. Sometimes it recovers after 10 seconds (on my 2018 macbook), but before I can read the value of d it has almost certainly locked up completely, my CPU fans spin up, and I have to crash the app. I managed to get the nightly build, but an artifact build consistently locks up by the time I'm waiting for it to show me the scope variables.

I wonder if it would be feasible for it to not lock the process up entirely while it's doing whatever it's doing, or at least provide some sort of progress feedback so I can judge whether it has locked up entirely? By contrast the Chrome and Safari devtools manage to get me to the results with only a few seconds of delay here and there.

Update: after about 5 minutes the artifact build managed to get the scopes listed

Alexandre Poirot [:ochameau]

Comment 1

•

1 year ago

I second bomsy conclusions on Slack.
Let me help by providing in-depth analysis.

The opened source https://www.gstatic.com/_/datastudio/_/js/k=datastudio.datastudio.fr.lSd1ASPaFTk.es6.O/d=0/rs=AHAPuoNbbYWNZkijuD11cSSB6oLLrtzh-A/m=sy6,sy3,sya,syb,sy5,sy4,syj,syk,sy9,syn,syp,syg,syx,sy14,syc,sy7,sym,syi,sy12,syf,syo,sy18,sy1e,sy1b,syd,sy15,syq,sy16,sy13,syr,sys,syt,syw,syh,sy11,sy1c,sy1l,sy2,sy8,sye,syl,syu,syv,syy,syz,sy10,sy17,sy19,sy1a,sy1d,sy1f,sy1g,sy1v,sy1k,sy23,sy1o,sy1,sy1m,sy1p,sy1x,sy22,sy24,sy1t,sy1n,sy27,sy20,sy1w,sy1j,sy28,sy1i,sy1r,sy2e,sy2j,sy2k,sy2i,pm_ng2report where dataStudioOfflineCache is defined is 1.6MB large.
This is large, but nothing really impressive for a bundle.

The STR can be simplier and stop at 3. You just need to wait for a few seconds (many tens!) and firefox will freeze.
This STR will reproduce on reload, which makes it quite bad.

Here is a profile record: https://share.firefox.dev/46WuhGM
This highlights very well what is going wrong.
The parser worker spends 13 seconds parsing this 1.6MB JS file.
The worker sends a message via postMessage so large that it takes 3seconds to complete!
This causes a 570MB memory increase in the parent proces!!!
This is quickly followed by a 3s GC pause in the worker... but also a 34s GC pause in the parent process freeing around 300MB.

While we can probably investigate and make the parser worker pass smaller data,
we should probably investigate into avoiding parsing large sources entirely, or make that on-demand.

In a some cleanup I simplified the story around worker parser by having a very unique callsite where we request the worker to parse the source.
It is now only done when selecting a source, over here:
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/actions/sources/select.js#203
We should probably review all the places where we use the parser worker data. i.e. all usages of getSymbols:
https://searchfox.org/mozilla-central/search?q=getSymbols%28&path=devtools%2Fclient%2Fdebugger%2Fsrc&case=true&regexp=false

It is notably used for:

displaying currently opened source functions in quick open search (Ctrl+P)
For this the parser could probably pass a much simplier data to the main thread.
We could also delay until we get an explicit request to lookup for functions...
inline and popup preview
May be the "expression from coords" codepath works well and is enough???
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/actions/preview.js#32-34
In such case we could stop depending on symbols...
We could also delay the parsing for this (but it would be super slow and require some loading indicator).
We could also review what the worker parser outputs.
This code only uses memberExpressions, identifiers, literals attributes:
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/utils/ast.js#11
We could also easily keep the whole data in the worker and have a dedicated request to get the token from a line/column!
The editor uses hasJsx and hasTypes attributes
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/utils/editor/source-documents.js#152-159
This doesn't require passing much data to the main thread!
And may be this can be done without having a real JS parser?
Outlines may be the component using the most data from the parser:
https://searchfox.org/mozilla-central/source/devtools/client/debugger/src/components/PrimaryPanes/Outline.js#314
But this could be done only once Outline is opened, have a load indicator, have a prompt for big files,...
SourceTreeIcon uses the framework attribute
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/components/shared/SourceIcon.js#51-61
This is way too costly for such no so important feature!
mapping of display names:
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/actions/pause/mapDisplayNames.js#14-20
This only uses functions. This is important to display name of function in original files in the call stack.

Now, it would be great to review all the data emitted by the worker.
I think it is defined over there:
https://searchfox.org/mozilla-central/rev/c615dc4db129ece5cce6c96eb8cab8c5a3e26ac3/devtools/client/debugger/src/workers/parser/getSymbols.js#98-111
There might be attributes, which are store very large data set, which aren't used in the main thread.
Dropping all of those would be a easy and significant improvement.
For example comments doesn't seem to be used outside of the worker itself:
https://searchfox.org/mozilla-central/search?q=symbol%3A%23comments&path=debugger%2Fsrc&case=false&regexp=false

Here is some statistics about each attribute of symbols:
functions = 5510810 (5M)
callExpressions = 18516555 (18M)
memberExpressions = 42169971 (42M)
objectProperties = 8261303 (8M)
comments = 662
identifiers = 107075003 (107M !!!)
classes = 2
imports = 2
literals = 639923 (600k)
hasJsx = 5
hasTypes = 5
framework = 9

This is computed by counting the JSON size of each attribute, like this:

diff --git a/devtools/client/debugger/src/reducers/ast.js b/devtools/client/debugger/src/reducers/ast.js
index 9796815727e5c..e7257ab1a6c8d 100644
--- a/devtools/client/debugger/src/reducers/ast.js
+++ b/devtools/client/debugger/src/reducers/ast.js
@@ -40,6 +40,10 @@ function update(state = initialASTState(), action) {
         value: action.value,
         threadActorId: location.sourceActor?.thread,
       };
+      dump(" >> SYMBOLS:\n");
+      for(let i in action.value) {
+        dump(" " + i + " = "+JSON.stringify(action.value[i]).length);
+      }
       if (location.source.isOriginal) {
         state.mutableOriginalSourcesSymbols[location.source.id] = entry;
       } else {

The largest is, by far, identifiers, which is only used by inline/popup preview...
Followed by memberExpressions, same usage.
And callExpressions, which isn't used in the main thread.

Then, once such cleanups are done within the main thread, we can start investigating simplifying the worker itself.
We may have dropped unused data and be able to simplify it and make it faster to respond.