Closed Bug 1161072 Opened 9 years ago Closed 9 years ago

Toolbox/connection shutdown code is broken in many different ways

Tracking

(firefox43 fixed)

Status:

RESOLVED FIXED

Milestone:

Firefox 43

Tracking Flags:

Tracking

Status

firefox43

---

fixed

People

(Reporter: ochameau, Assigned: ochameau)

References

(Depends on 1 open bug)

Details

Attachments

(9 files, 23 obsolete files)

cleanup reconfigure - v1 9 years ago Alexandre Poirot [:ochameau] 11.52 KB, patch		Details \| Diff \| Splinter Review
cleanup reconfigure - v2 9 years ago Alexandre Poirot [:ochameau] 14.92 KB, patch		Details \| Diff \| Splinter Review
trait 9 years ago Alexandre Poirot [:ochameau] 1.12 KB, patch		Details \| Diff \| Splinter Review
Destroy inspector actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 1.09 KB, patch		Details \| Diff \| Splinter Review
Destroy highlighter actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 1.02 KB, patch	ochameau : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Call hideBoxModel from actor 9 years ago Alexandre Poirot [:ochameau] 1.93 KB, patch		Details \| Diff \| Splinter Review
Destroy the walker actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 2.67 KB, patch		Details \| Diff \| Splinter Review
cleanup reconfigure - v3 9 years ago Alexandre Poirot [:ochameau] 14.92 KB, patch		Details \| Diff \| Splinter Review
cleanup reconfigure - v4 9 years ago Alexandre Poirot [:ochameau] 14.83 KB, patch		Details \| Diff \| Splinter Review
Destroy inspector actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 1.09 KB, patch	pbro : review+	Details \| Diff \| Splinter Review
Prevent "no such actor" exception from style inspector during toolbox shutdown 9 years ago Alexandre Poirot [:ochameau] 1.04 KB, patch	pbro : review+	Details \| Diff \| Splinter Review
cleanup reconfigure - v5 9 years ago Alexandre Poirot [:ochameau] 15.86 KB, patch	jryans : review+	Details \| Diff \| Splinter Review
cleanup reconfigure - v6 9 years ago Alexandre Poirot [:ochameau] 16.23 KB, patch		Details \| Diff \| Splinter Review
cleanup reconfigure - v7 9 years ago Alexandre Poirot [:ochameau] 16.33 KB, patch	ochameau : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Destroy inspector actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 1.09 KB, patch	ochameau : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Prevent "no such actor" exception from style inspector during toolbox shutdown 9 years ago Alexandre Poirot [:ochameau] 1.04 KB, patch	ochameau : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Prevent racing hideBoxModel during connection shutdown. r=pbrosset 9 years ago Alexandre Poirot [:ochameau] 3.71 KB, patch		Details \| Diff \| Splinter Review
Destroy the walker actor on disconnect 9 years ago Alexandre Poirot [:ochameau] 4.14 KB, patch		Details \| Diff \| Splinter Review
cleanup webconsole tests 9 years ago Alexandre Poirot [:ochameau] 2.54 KB, patch		Details \| Diff \| Splinter Review
cleanup webconsole tests v2 9 years ago Alexandre Poirot [:ochameau] 3.73 KB, patch	pbro : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Fix pending requests in markup view 9 years ago Alexandre Poirot [:ochameau] 3.83 KB, patch		Details \| Diff \| Splinter Review
Record stacks for pending requests 9 years ago J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow) 1.65 KB, patch		Details \| Diff \| Splinter Review
Fix pending requests in markup view (v2) 9 years ago J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow) 4.38 KB, patch	bgrins : review+ jryans : checkin+	Details \| Diff \| Splinter Review
Fix pending requests in rule view 9 years ago J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow) 29.60 KB, patch		Details \| Diff \| Splinter Review
Record stacks for pending requests (v2) 9 years ago J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow) 2.90 KB, patch		Details \| Diff \| Splinter Review
Fix pending showBoxModel request when running browser_markupview_keybindings_04.js 9 years ago Alexandre Poirot [:ochameau] 2.89 KB, patch		Details \| Diff \| Splinter Review
Prevent unnecessary hideBoxModel request 9 years ago Alexandre Poirot [:ochameau] 1.71 KB, patch		Details \| Diff \| Splinter Review
Destroy the walker actor on disconnect - v2 9 years ago Alexandre Poirot [:ochameau] 4.14 KB, patch	bgrins : review+	Details \| Diff \| Splinter Review
Destroy the walker actor on disconnect - v3 9 years ago Alexandre Poirot [:ochameau] 6.92 KB, patch		Details \| Diff \| Splinter Review
Destroy the walker actor on disconnect - v4 9 years ago Alexandre Poirot [:ochameau] 7.17 KB, patch	bgrins : review+ ochameau : checkin+	Details \| Diff \| Splinter Review
Fix browser_animation_target_highlight_select.js - v1 9 years ago Alexandre Poirot [:ochameau] 5.49 KB, patch	pbro : review+	Details \| Diff \| Splinter Review
Fix browser_tilt_picking_inspector.js - v1 9 years ago Alexandre Poirot [:ochameau] 3.00 KB, patch	bgrins : review+	Details \| Diff \| Splinter Review

Alexandre Poirot [:ochameau]

Assignee

Description

•

9 years ago

Anyone contributing seriously to our codebase hit issues in tests or while using tools intensively where the toolbox gets into a broken state or you end up with various random exception in browsre console.
Many of these exception are due to races in the way we are handling connection shutdown on actor side and/or target/toolbox/panels cleanups on client side.

One rule we should try to respect to avoid these exception and broken state is:
Do not attempt to use actors once the toolbox starts closing or the client/connection closes.

Today, we end up calling actors method during toolbox cleanup to reset the state of the actors/tab, but instead, the actor itself should clean things up when its `disconnect` method is called.

Alexandre Poirot [:ochameau]

Assignee

Comment 1

•

9 years ago

Attached patch cleanup reconfigure - v1 (obsolete) — Details — Splinter Review

First one, first mess, we call reconfigure in various place, with various patterns,
sometime we wait for the request, or we don't...
Sometime the request reach the actor but the docshell is already gone.

Just move all the reconfigure reset from client code to the actor.
I also tried to reorder/cleanup code in webbrowser.js.

This patch also improve the overall behavior of this actor,
as it will reenable cache even if the network monitor (or any other code) fails to/doesn't restore it.

https://treeherder.mozilla.org/#/jobs?repo=try&revision=a27155808007
https://treeherder.mozilla.org/#/jobs?repo=try&revision=49d75647d26e

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8601001 - Flags: review?(jryans)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 2

•

9 years ago

Comment on attachment 8601001 [details] [diff] [review]
cleanup reconfigure - v1

Review of attachment 8601001 [details] [diff] [review]:
-----------------------------------------------------------------

This seems like a good improvement, in general.

However, I don't think it works for servers before the change.  They will still wait to be reconfigured.  So, it seems like a trait is needed.  My guess is this could be true of many of the changes you make here.

Attachment #8601001 - Flags: review?(jryans)

Alexandre Poirot [:ochameau]

Assignee

Comment 3

•

9 years ago

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #2)
> Comment on attachment 8601001 [details] [diff] [review]
>
> However, I don't think it works for servers before the change.  They will
> still wait to be reconfigured.  So, it seems like a trait is needed.  My
> guess is this could be true of many of the changes you make here.

Yes it doesn't clean state for old servers, I did that on purpose to clean that code.
It isn't as important to reset state as in firefox as you reuse the tab easily after a debugging session.
But I reintroduced these calls in my new patch. I would happily remove them if you think that a good breaking compromise ;)
In my next patch you will see a new trait that I'm using for that and all other similar usecases.
I'll land the trait once I cleaned eveything in order to prevent having to introduce one trait for each single cleanup.

Alexandre Poirot [:ochameau]

Assignee

Comment 4

•

9 years ago

Attached patch cleanup reconfigure - v2 (obsolete) — Details — Splinter Review

Alexandre Poirot [:ochameau]

Assignee

Comment 5

•

9 years ago

Attached patch trait (obsolete) — Details — Splinter Review

The trait, to tell that actors do cleanup on disconnect.

Alexandre Poirot [:ochameau]

Assignee

Comment 6

•

9 years ago

Attached patch Destroy inspector actor on disconnect (obsolete) — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=bd403ab20d79

Alexandre Poirot [:ochameau]

Assignee

Comment 7

•

9 years ago

Attached patch Destroy highlighter actor on disconnect — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=e4bbb5570f33

Alexandre Poirot [:ochameau]

Assignee

Comment 8

•

9 years ago

Attached patch Call hideBoxModel from actor (obsolete) — Details — Splinter Review

not from client during shutdown!

https://treeherder.mozilla.org/#/jobs?repo=try&revision=aad4a873edcc

Alexandre Poirot [:ochameau]

Assignee

Comment 9

•

9 years ago

Attached patch Destroy the walker actor on disconnect (obsolete) — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=7b7f6d7ece38

Alexandre Poirot [:ochameau]

Assignee

Comment 10

•

9 years ago

Comment on attachment 8601564 [details] [diff] [review]
Call hideBoxModel from actor

Here is the right try for this patch:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=2477ea077fa2

Alexandre Poirot [:ochameau]

Assignee

Comment 11

•

9 years ago

Arf, here is the "No such actor for ID: server1.conn2.pagestyle27" exception again, starting from the very first patch related to inspector, attachment 8601560 [details] [diff] [review] "destroy inspector actor on disconnect". I split my cleanup in many patches as I thought it would have only start to fail in one step, but not the very first one :x
In this first step, I'm just adding a disconnect method on InspectorActor with an empty destroy method that only call Actor's one.

That's the same exception that prevents me from landing bug 1145049. I opened this bug as I thought toolbox shutdown was related to it and I would be able to fix/prevent it if I made the toolbox/target cleanup codepath simplier by moving code to the actors.

Alexandre Poirot [:ochameau]

Assignee

Comment 12

•

9 years ago

Something I've seen in many tests is that we close the tab we opened for the test and call `finish()` right after.
That ends up calling toolbox.destroy and its async mess without waiting.
That is a source of overlap between tests, but at the end it shouldn't break the next tests as the next tests are going to open a new tab, new toolbox, new client. It ends up breaking because of warning messages being thrown in the console.
The real issue is that some test ends up throwing and dispatching these warning in the console.

There is a common pattern over multiple files, to pass a promiseWarn method to all promises to log errors.
But I'm wondering if we can get rid of all those as promises now automatically do that?
  http://mxr.mozilla.org/mozilla-central/source/toolkit/devtools/server/protocol.js#18
  http://mxr.mozilla.org/mozilla-central/source/browser/devtools/styleinspector/rule-view.js#48
As well as the various
  .then(null, console.error | Cu.reportError)
?

The only difference is that the error is going to be displayed immediately and always no matter if there is a following promise handler.
Also in the second case (console.error/reportError), if there is a next promise, it will return a resolving (i.e. not rejecting) promise and always log the message.

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 13

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #3)
> (In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #2)
> > Comment on attachment 8601001 [details] [diff] [review]
> >
> > However, I don't think it works for servers before the change.  They will
> > still wait to be reconfigured.  So, it seems like a trait is needed.  My
> > guess is this could be true of many of the changes you make here.
> 
> Yes it doesn't clean state for old servers, I did that on purpose to clean
> that code.
> It isn't as important to reset state as in firefox as you reuse the tab
> easily after a debugging session.
> But I reintroduced these calls in my new patch. I would happily remove them
> if you think that a good breaking compromise ;)
> In my next patch you will see a new trait that I'm using for that and all
> other similar usecases.
> I'll land the trait once I cleaned eveything in order to prevent having to
> introduce one trait for each single cleanup.

I think it's actually quite important to ensure the tab is correctly reset:  If the toolbox disabled caching and then you close the toolbox but we fail to reset it, it could lead to a confusing performance experience that is hard to explain, since there's no user-visible indication that caching is off.

So, at least for this particular change, I think a proper trait is important.

Alexandre Poirot [:ochameau]

Assignee

Comment 14

•

9 years ago

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #13)
> I think it's actually quite important to ensure the tab is correctly reset:

Note that for tab, we don't need trait as client and server are in sync and support actor cleanup, only remote usecase would go wrong.

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 15

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #14)
> (In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #13)
> > I think it's actually quite important to ensure the tab is correctly reset:
> 
> Note that for tab, we don't need trait as client and server are in sync and
> support actor cleanup, only remote usecase would go wrong.

Hmm, that's true.  Still, it does feel like a bad idea to me, even for the remote case.

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 16

•

9 years ago

I created a wiki page to offer guidance on the right way to design actors:

https://wiki.mozilla.org/DevTools/Actor_Best_Practices

I've only added the main flaw you've been running into for now, but please edit and add to it!  An example might be good to have.

Jordan Santell [:jsantell] [@jsantell]

Comment 17

•

9 years ago

Huh, I didn't know that. How can actors register to the connection being destroyed? Can we have a bug that ensures this is the case everywhere? (Cc me if so!)

José Antonio Olivera Ortega [:jaoo]

Updated

•

9 years ago

Blocks: 1153407

Alexandre Poirot [:ochameau]

Assignee

Comment 18

•

9 years ago

Attached patch cleanup reconfigure - v3 (obsolete) — Details — Splinter Review

It looks like the docshell leaks reported last week are gone,
at least locally, it vanished when rebasing against last tip.

Try without the trait:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=cb32becabfd1
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=57e200b90813
With it:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=a830b7bed58d

Attachment #8601001 - Attachment is obsolete: true

Attachment #8601555 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 19

•

9 years ago

Attached patch cleanup reconfigure - v4 (obsolete) — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=bdd2881a1247
A cache test was still failing on linux32,
a race that should be happening also without this patch...
Can't explain why it is highlighted by this small modification.
We shouldn't do a reload of the document when we reset javascript state from toolbox-options:destroy.

I plan to land this patch before the trait addition, and do the same for most patches.
So that we can use just one trait for all these cleanups fixes.
I'm ensuring that tests are green with/without the trait before landing.

Attachment #8604544 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8604775 - Flags: review?(jryans)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 20

•

9 years ago

Comment on attachment 8604775 [details] [diff] [review]
cleanup reconfigure - v4

Review of attachment 8604775 [details] [diff] [review]:
-----------------------------------------------------------------

Patch seems fine, but I really think you need a specific trait.  Let's discuss further if you disagree.

::: browser/devtools/framework/toolbox.js
@@ +1748,5 @@
>  
>      // Now that we are closing the toolbox we can re-enable the cache settings
>      // and disable the service workers testing settings for the current tab.
> +    // FF40+ automatically cleans up state in actor on disconnect.
> +    if (this.target.activeTab && !this.target.activeTab.traits.actorsCleanup) {

I think it feels strange to have a trait that says "things are better".  It's too vague.  Next time someone cleans up something, what happens?  |actorsCleanup2|?

Look at the list of existing traits[1].  Each trait's name summarizes its purpose.

The traits are used beyond Gecko as well, like in Valence.  With a name like "actorsCleanup", it's not at all clear if Valence should say true or false.

Also, by grouping a bunch of unrelated things in one trait, you force all servers (like Valence) to either do *everything* you did or *nothing*, since there is no in-between.

So, for this specific fix, use something like:

noTabReconfigureOnClose

[1]: https://dxr.mozilla.org/mozilla-central/source/toolkit/devtools/server/actors/root.js#109

Attachment #8604775 - Flags: review?(jryans)

Alexandre Poirot [:ochameau]

Assignee

Comment 21

•

9 years ago

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #20)
> Comment on attachment 8604775 [details] [diff] [review]
>
> I think it feels strange to have a trait that says "things are better". 
> It's too vague.  Next time someone cleans up something, what happens? 
> |actorsCleanup2|?

I know but I was scared to introduce tons of traits in this cleanup quest.
But that would be ok, if I can, at least shared a cleanup trait for the inspector which will need various tweaks.
(hideBoxModel, highlighter, walker)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 22

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #21)
> (In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #20)
> > Comment on attachment 8604775 [details] [diff] [review]
> >
> > I think it feels strange to have a trait that says "things are better". 
> > It's too vague.  Next time someone cleans up something, what happens? 
> > |actorsCleanup2|?
> 
> I know but I was scared to introduce tons of traits in this cleanup quest.
> But that would be ok, if I can, at least shared a cleanup trait for the
> inspector which will need various tweaks.
> (hideBoxModel, highlighter, walker)

Sharing just for the inspector sounds potentially reasonable assuming the set of changes are related as group.  It's kind of hard to say for sure without looking at the changes, though.

You could potentially make use of traits on the actor instead of the root, if that seems helpful, as in webbrowser[1].  But, you added those, so I guess you already know. :)

[1]: https://hg.mozilla.org/mozilla-central/annotate/617dbce26726/toolkit/devtools/server/actors/webbrowser.js#l2074

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Assignee: nobody → poirot.alex

Alexandre Poirot [:ochameau]

Assignee

Comment 23

•

9 years ago

Attached patch Destroy inspector actor on disconnect (obsolete) — Details — Splinter Review

Just rebased.

It looks like I should be able to land this patch.
Try is now green (If I ignore the always failing dt1/dt2 on linux32).
But I also need another fix (next patch).

https://treeherder.mozilla.org/#/jobs?repo=try&revision=b781bbad521f

This patch doesn't introduce incompatibilities with old/new client/server,
it just improves the actor.

Alexandre Poirot [:ochameau]

Assignee

Comment 24

•

9 years ago

Attached patch Prevent "no such actor" exception from style inspector during toolbox shutdown (obsolete) — Details — Splinter Review

As I sligthly change the destruction order, the rule-view code starts throwing
in some really rare race where the rule-view dispatches a getApplied request
whereas the targeted actor is already destroyed.
`polulate` here, is called by a setTimeout from inspector-panel:scheduleLayoutChange().
The race happens when the timeout fires *just before* we call toolbox.destroy, really just before.
At the time we call front.getApplied, the front and the actor are still alive,
but by the time the request reach the server side code, the actor is gone.

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8605183 - Flags: review?(pbrosset)

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8605185 - Flags: review?(pbrosset)

Alexandre Poirot [:ochameau]

Assignee

Comment 25

•

9 years ago

Attached patch cleanup reconfigure - v5 (obsolete) — Details — Splinter Review

With specific trait.
https://treeherder.mozilla.org/#/jobs?repo=try&revision=4031bb91ee40
As the trait is now specific I'm landing it combined.

Attachment #8601558 - Attachment is obsolete: true

Attachment #8604775 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8605207 - Flags: review?(jryans)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 26

•

9 years ago

Comment on attachment 8605207 [details] [diff] [review]
cleanup reconfigure - v5

Review of attachment 8605207 [details] [diff] [review]:
-----------------------------------------------------------------

Thanks for using a specific trait, it feels much better to me now.

::: browser/devtools/framework/toolbox-options.js
@@ +375,5 @@
>      this._removeListeners();
>  
>      if (this.target.activeTab) {
> +      this.disableJSNode.removeEventListener("click", this._disableJSClicked);
> +      // FF40+ automatically cleans up state in actor on disconnect

Nit: It's 41 now... (Unless you plan to uplift)

::: browser/devtools/framework/toolbox.js
@@ +1747,5 @@
>      }
>  
>      // Now that we are closing the toolbox we can re-enable the cache settings
>      // and disable the service workers testing settings for the current tab.
> +    // FF40+ automatically cleans up state in actor on disconnect.

Nit: It's 41 now... (Unless you plan to uplift)

::: toolkit/devtools/server/actors/webbrowser.js
@@ +1374,5 @@
>  
>    /**
>     * Handle logic to enable/disable JS/cache/Service Worker testing.
>     */
>    _toggleDevtoolsSettings: function(options) {

Super Nit: If you could change Devtools to DevTools, that would be great!

Attachment #8605207 - Flags: review?(jryans) → review+

Patrick Brosset <:pbro>

Updated

•

9 years ago

Attachment #8605183 - Flags: review?(pbrosset) → review+

Patrick Brosset <:pbro>

Updated

•

9 years ago

Attachment #8605185 - Flags: review?(pbrosset) → review+

Alexandre Poirot [:ochameau]

Assignee

Comment 27

•

9 years ago

Attached patch cleanup reconfigure - v6 (obsolete) — Details — Splinter Review

Attachment #8605207 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 28

•

9 years ago

Attached patch cleanup reconfigure - v7 — Details — Splinter Review

Rebased against recent tip, with new try:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=dbd815bd6a64

Attachment #8605315 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 29

•

9 years ago

Attached patch Destroy inspector actor on disconnect — Details — Splinter Review

Attachment #8605183 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 30

•

9 years ago

Attached patch Prevent "no such actor" exception from style inspector during toolbox shutdown — Details — Splinter Review

Attachment #8605185 - Attachment is obsolete: true

Pulsebot

Comment 31

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/613ea0ac6707
https://hg.mozilla.org/integration/fx-team/rev/bd930602d606
https://hg.mozilla.org/integration/fx-team/rev/c9d7db3555f7

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8607132 - Flags: review+

Attachment #8607132 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8607134 - Flags: review+

Attachment #8607134 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8607136 - Flags: review+

Attachment #8607136 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Keywords: leave-open

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8601560 - Attachment is obsolete: true

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 32

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/613ea0ac6707
https://hg.mozilla.org/mozilla-central/rev/bd930602d606
https://hg.mozilla.org/mozilla-central/rev/c9d7db3555f7

Alexandre Poirot [:ochameau]

Assignee

Comment 33

•

9 years ago

Hum... it looks like the other patches now applies without making try look like a chrystmas tree?!
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=a7c769b811ef

Let's see if that's also somewhat green on linux32:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=ce56177cdf8b

Alexandre Poirot [:ochameau]

Assignee

Comment 34

•

9 years ago

Attached patch Prevent racing hideBoxModel during connection shutdown. r=pbrosset (obsolete) — Details — Splinter Review

Just realized these two patches were disabled as I renamed the trait name...
New patches with traits on related actors and not on the root actor.

https://treeherder.mozilla.org/#/jobs?repo=try&revision=2689d0e035de

Attachment #8601564 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 35

•

9 years ago

Attached patch Destroy the walker actor on disconnect (obsolete) — Details — Splinter Review

Attachment #8601565 - Attachment is obsolete: true

Pulsebot

Comment 36

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/caaf9d16a00d

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8601562 - Flags: review+

Attachment #8601562 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Comment 37

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #34)
> Created attachment 8607602 [details] [diff] [review]
> Prevent racing hideBoxModel during connection shutdown. r=pbrosset
> 
> Just realized these two patches were disabled as I renamed the trait name...
> New patches with traits on related actors and not on the root actor.
> 
> https://treeherder.mozilla.org/#/jobs?repo=try&revision=2689d0e035de

Ok... Better, that actually fails as expected!

Carsten Book [:Tomcat]

Comment 38

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/caaf9d16a00d

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Depends on: 1166774

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Depends on: 1167174

Alexandre Poirot [:ochameau]

Assignee

Comment 39

•

9 years ago

Attached patch cleanup webconsole tests (obsolete) — Details — Splinter Review

This patch ensures that webconsole tests do not have pending requests on test end.
They used to have various highlighter requests pending like showBoxModel.

The issue in console-output.js is that we ended up registering more than one mousover listener
and dispatched multiple showBoxModel requests.

https://treeherder.mozilla.org/#/jobs?repo=try&revision=7c411dba02be

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Depends on: 1167181

Alexandre Poirot [:ochameau]

Assignee

Comment 40

•

9 years ago

All the patches and deps on try:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=0ad3b868ee6e

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8608764 - Flags: review?(past)

Panos Astithas (he/him) [:past] (please ni?)

Comment 41

•

9 years ago

Comment on attachment 8608764 [details] [diff] [review]
cleanup webconsole tests

Review of attachment 8608764 [details] [diff] [review]:
-----------------------------------------------------------------

LGTM, but Patrick knows this code better.

Attachment #8608764 - Flags: review?(past) → review?(pbrosset)

Panos Astithas (he/him) [:past] (please ni?)

Updated

•

9 years ago

Status: NEW → ASSIGNED

Patrick Brosset <:pbro>

Comment 42

•

9 years ago

Comment on attachment 8608764 [details] [diff] [review]
cleanup webconsole tests

Review of attachment 8608764 [details] [diff] [review]:
-----------------------------------------------------------------

::: browser/devtools/webconsole/test/browser_webconsole_output_dom_elements_03.js
@@ +58,5 @@
>  function* hoverOverWidget(widget, toolbox) {
>    info("Hovering over the output to highlight the node");
>  
>    let onHighlight = toolbox.once("node-highlight");
> +  let onHighlighterShown = toolbox.once("highlighter-ready");

I don't understand why you have to wait for this event too. It's sent as a result of "ready" being sent at the end of BoxModelHighlighter._show, which is called by the HighlighterActor's showBoxModel protocol method, which is called in toolbox-highlighter-utils's highlightNodeFront function.
tl;dr; "node-highlight" is emitted after "highlighter-ready", so it doesn't seem required to wait for it.

Attachment #8608764 - Flags: review?(pbrosset)

Alexandre Poirot [:ochameau]

Assignee

Comment 43

•

9 years ago

(In reply to Patrick Brosset [:pbrosset] [:patrick] [:pbro] from comment #42)
> Comment on attachment 8608764 [details] [diff] [review]
> cleanup webconsole tests
> 
> Review of attachment 8608764 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> :::
> browser/devtools/webconsole/test/browser_webconsole_output_dom_elements_03.js
> @@ +58,5 @@
> >  function* hoverOverWidget(widget, toolbox) {
> >    info("Hovering over the output to highlight the node");
> >  
> >    let onHighlight = toolbox.once("node-highlight");
> > +  let onHighlighterShown = toolbox.once("highlighter-ready");
> 
> I don't understand why you have to wait for this event too. It's sent as a
> result of "ready" being sent at the end of BoxModelHighlighter._show, which
> is called by the HighlighterActor's showBoxModel protocol method, which is
> called in toolbox-highlighter-utils's highlightNodeFront function.
> tl;dr; "node-highlight" is emitted after "highlighter-ready", so it doesn't
> seem required to wait for it.

I do not remember why I added this with all the tweaks I made to so many tests :/
You are right, I must have been confused with bug 1167174, or some other patch in my queue did a subtle change to this codepath. But the fact is that I no longer need to wait for highlighter-ready here.
The real issue shifted to browser_webconsole_output_dom_elements_02.js, which makes browser_webconsole_output_dom_elements_04.js to fail (with the two other patches to land in this patch).
I'll post an updated patch shortly.

Alexandre Poirot [:ochameau]

Assignee

Comment 44

•

9 years ago

Attached patch cleanup webconsole tests v2 — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=8facf3a4766e

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8611187 - Flags: review?(pbrosset)

Patrick Brosset <:pbro>

Comment 45

•

9 years ago

Comment on attachment 8611187 [details] [diff] [review]
cleanup webconsole tests v2

Review of attachment 8611187 [details] [diff] [review]:
-----------------------------------------------------------------

Looks good to me now.

Attachment #8611187 - Flags: review?(pbrosset) → review+

Alexandre Poirot [:ochameau]

Assignee

Comment 46

•

9 years ago

Attached patch Fix pending requests in markup view (obsolete) — Details — Splinter Review

The issue here is that markup view dispatch some WalkerActor.insertBefore request
on some mouse up event, but we do not track progress of this request
and the request may still be pending on destroy.
The mouse listener is here:
  http://mxr.mozilla.org/mozilla-central/source/browser/devtools/markupview/markup-view.js#1931

Here is a quite hacky way to address this...
Handling all requests nicely on shutdown in hard.
I'have been chasing them down for quite a while now and just on the inspector.
Such practice (flushPendingRequests) may solve the overall issue.

Ryan, I still think the exception/error thrown when there is still a request in queue
cost us a lot and shouldn't exists. Things should just work.
But I have to agree it also help discovering issues. I've recently highlighted
an anormal number of requests in rule-view.
And also, we shouldn't ever have pending requests/code overlapping between tests.
This patch may address that, but I'm also scared to put such setTimeout within destruction
codepath. It may slowdown toolbox shutdown or worse, prevent it from shutting down!

...Feedback welcomed on this particular markup view scenario and on the overall issue...

Attachment #8613515 - Flags: feedback?(jryans)

Pulsebot

Comment 47

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/c32e6f44cc4d

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 48

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/c32e6f44cc4d

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 49

•

9 years ago

Comment on attachment 8613515 [details] [diff] [review]
Fix pending requests in markup view

Review of attachment 8613515 [details] [diff] [review]:
-----------------------------------------------------------------

I still feel like the pending request during shutdown error is helpful.  As you admit, it's pointing out problems.  To me it's evidence of race conditions that need to be fixed at some point, even if things may "usually" work.

Let me take a look at this part and see what I can think of.  Do the tests fail every time without this patch, or do I need a special setup to cause the errors?

Attachment #8613515 - Flags: feedback?(jryans)

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8608764 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8611187 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Comment 50

•

9 years ago

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #49)
> Let me take a look at this part and see what I can think of.  Do the tests
> fail every time without this patch, or do I need a special setup to cause
> the errors?

You need to apply attachment 8607602 [details] [diff] [review] and attachment 8607603 [details] [diff] [review].
You can see the related test (for ex: markupview_dragdrop_autoscroll.js) fail in comment 34's try.
Note that you may have to run all the tests to get it to fail...

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

9 years ago

Flags: needinfo?(jryans)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 51

•

9 years ago

Attached patch Record stacks for pending requests (obsolete) — Details — Splinter Review

It can be tricky to debug these pending requests as-is.

This adds the stack from the time the request was made, which is more useful to find where it came from.

Attachment #8615078 - Flags: review?(poirot.alex)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 52

•

9 years ago

Attached patch Fix pending requests in markup view (v2) — Details — Splinter Review

I spent some time on markup view tests, and I still feel like the pending request error should remain.

Many times they are demonstrating clear race conditions in the product or test, and could be related to other intermittents.  So, I think fixing them is a better idea than turning off the error.

When you have something like these cases of waiting on the result of an event handler, a good solution is for the tool to emit some event once the RDP request completes, which the test can then wait for.  This is what I've seen done very heavily in the net monitor and web audio tools, and it really helps make the tests more correct.

I've fixed up the markup view tests with this approach.

Attachment #8613515 - Attachment is obsolete: true

Flags: needinfo?(jryans)

Attachment #8615080 - Flags: feedback?(poirot.alex)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 53

•

9 years ago

Attached patch Fix pending requests in rule view (obsolete) — Details — Splinter Review

I kept going and did the rule view tests too, for more evidence that the approach works.

That's all for now. :)

Attachment #8615082 - Flags: feedback?(poirot.alex)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 54

•

9 years ago

Try: https://treeherder.mozilla.org/#/jobs?repo=try&revision=e258c9c9ab3f

Alexandre Poirot [:ochameau]

Assignee

Comment 55

•

9 years ago

Comment on attachment 8615078 [details] [diff] [review]
Record stacks for pending requests

Review of attachment 8615078 [details] [diff] [review]:
-----------------------------------------------------------------

::: toolkit/devtools/server/protocol.js
@@ +1168,5 @@
>      this._requests.push({
>        deferred,
>        to: to || this.actorID,
> +      type,
> +      stack: new Error().stack

Could you do that conditionally, as I imagine it will use additional memory whereas we care about that only while debugging.

Attachment #8615078 - Flags: review?(poirot.alex)

Alexandre Poirot [:ochameau]

Assignee

Comment 56

•

9 years ago

Comment on attachment 8615082 [details] [diff] [review]
Fix pending requests in rule view

Review of attachment 8615082 [details] [diff] [review]:
-----------------------------------------------------------------

I'm already fixing the ruleview-changed in bug 1166774, attachment 8612357 [details] [diff] [review].

Attachment #8615082 - Flags: feedback?(poirot.alex)

Alexandre Poirot [:ochameau]

Assignee

Comment 57

•

9 years ago

Comment on attachment 8615080 [details] [diff] [review]
Fix pending requests in markup view (v2)

Review of attachment 8615080 [details] [diff] [review]:
-----------------------------------------------------------------

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #52)
> Created attachment 8615080 [details] [diff] [review]
> Fix pending requests in markup view (v2)
> Many times they are demonstrating clear race conditions in the product or
> test, and could be related to other intermittents.  So, I think fixing them
> is a better idea than turning off the error.
> 
> When you have something like these cases of waiting on the result of an
> event handler, a good solution is for the tool to emit some event once the
> RDP request completes, which the test can then wait for.  This is what I've
> seen done very heavily in the net monitor and web audio tools, and it really
> helps make the tests more correct.

That's what I've been doing so far, but my concern was that we only fix those possible races in tests.
We are waiting for these events only from the test script.

ruleview-changed is something used in tests, but also in production code
that made sense to me to listen and use this existing event in tests.
But I was reluctant to introduce such event, like "drop-completed", just for tests,
with no usage in production code.
This is why I did fix the ruleview-changed and took some time to ask your feedback on that one.

I think it is more up to inspector maintainer to make the call for this precise case/patch.

::: browser/devtools/markupview/markup-view.js
@@ +1950,5 @@
>  
> +    yield this.markup.walker.insertBefore(this.node, dropTargetNodes.parent,
> +                                          dropTargetNodes.nextSibling);
> +    this.markup.emit("drop-completed");
> +  }),

May be we could avoid Task by just returning insertBefore promise, as in the two tests you are fixing are calling this method directly.

Attachment #8615080 - Flags: feedback?(poirot.alex) → review?(pbrosset)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 58

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #55)
> Comment on attachment 8615078 [details] [diff] [review]
> Record stacks for pending requests
> 
> Review of attachment 8615078 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> ::: toolkit/devtools/server/protocol.js
> @@ +1168,5 @@
> >      this._requests.push({
> >        deferred,
> >        to: to || this.actorID,
> > +      type,
> > +      stack: new Error().stack
> 
> Could you do that conditionally, as I imagine it will use additional memory
> whereas we care about that only while debugging.

Okay, like as a some "debug" boolean in the file?  Mostly I just want to land something even if it's commented out, as I've written this same debugging aid about 5 times now...

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 59

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #56)
> Comment on attachment 8615082 [details] [diff] [review]
> Fix pending requests in rule view
> 
> Review of attachment 8615082 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> I'm already fixing the ruleview-changed in bug 1166774, attachment 8612357 [details] [diff] [review]
> [details] [diff] [review].

Haha, I should have known... :)

For me, this set of tests convinced me more strongly that the error is helpful.  It illuminated several places where the tests were waiting incorrectly, or were out of sync with what actually triggers changes to be sent to the server.

Alexandre Poirot [:ochameau]

Assignee

Comment 60

•

9 years ago

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #58)
> Okay, like as a some "debug" boolean in the file?  Mostly I just want to
> land something even if it's commented out, as I've written this same
> debugging aid about 5 times now...

Yes, or gDevtools.testing as you prefer. I used the exact same trick ;)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 61

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #57)
> Comment on attachment 8615080 [details] [diff] [review]
> Fix pending requests in markup view (v2)
> 
> Review of attachment 8615080 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> (In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #52)
> > Created attachment 8615080 [details] [diff] [review]
> > Fix pending requests in markup view (v2)
> > Many times they are demonstrating clear race conditions in the product or
> > test, and could be related to other intermittents.  So, I think fixing them
> > is a better idea than turning off the error.
> > 
> > When you have something like these cases of waiting on the result of an
> > event handler, a good solution is for the tool to emit some event once the
> > RDP request completes, which the test can then wait for.  This is what I've
> > seen done very heavily in the net monitor and web audio tools, and it really
> > helps make the tests more correct.
> 
> That's what I've been doing so far, but my concern was that we only fix
> those possible races in tests.
> We are waiting for these events only from the test script.
> 
> ruleview-changed is something used in tests, but also in production code
> that made sense to me to listen and use this existing event in tests.
> But I was reluctant to introduce such event, like "drop-completed", just for
> tests,
> with no usage in production code.
> This is why I did fix the ruleview-changed and took some time to ask your
> feedback on that one.

In other tools like net monitor, debugger, and web audio, there are many events used in this way (only from the test script) so it does not worry me very much.

Overall, I think we should be able to find a better solution for these cases than the timeout idea you proposed.  Waiting on events that result from the requests completing handles many cases nicely and improves correctness of the test scripts.

> I think it is more up to inspector maintainer to make the call for this
> precise case/patch.

Of course, I agree, it's up to each tool maintainer, as it should be.  However, I would be interested to hear if people think this event approach is bad, so I can discuss it further.

Patrick Brosset <:pbro>

Comment 62

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #46)
> Ryan, I still think the exception/error thrown when there is still a request
> in queue
> cost us a lot and shouldn't exists. Things should just work.
> But I have to agree it also help discovering issues. I've recently
> highlighted
> an anormal number of requests in rule-view.
> And also, we shouldn't ever have pending requests/code overlapping between
> tests.

(In reply to J. Ryan Stinnett [:jryans] (use ni?) from comment #49)
> I still feel like the pending request during shutdown error is helpful.  As
> you admit, it's pointing out problems.  To me it's evidence of race
> conditions that need to be fixed at some point, even if things may "usually"
> work.

My opinion on the request-pending-at-shutdown error is that it's useful to tell you something is wrong and helps discover problems that need to be fixed. We've already fixed quite a bunch of inspector related problems in the past that happened when the toolbox was closed.
It's also hard to understand sometimes, especially for newcomers to the project, so attaching the stack-trace in debug is a great idea.
The way I've usually gone so far about fixing these in tests was what Alex has used in the rule-view and what Ryan proposed to add in the markup-view: listening for the right type of event, even if that means adding an event that's not used in the product code.
At least we make sure tests run to completion and are as independent of each other as possible.
Isn't there something we can do at the debugger server level, protocol level, test harness level to make sure there's no test overlapping? I've always been so surprised that one test could influence the next one, this feels wrong to me. I'd say let's make sure all tests are properly sandboxed.

Patrick Brosset <:pbro>

Comment 63

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #57)
> ruleview-changed is something used in tests, but also in production code
> that made sense to me to listen and use this existing event in tests.
> But I was reluctant to introduce such event, like "drop-completed", just for
> tests,
> with no usage in production code.
> This is why I did fix the ruleview-changed and took some time to ask your
> feedback on that one.
> 
> I think it is more up to inspector maintainer to make the call for this
> precise case/patch.
I like the idea of making the tests self-contained and if that means waiting for the proper event, even if that event is only here for tests, I think we should do it.

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 64

•

9 years ago

Attached patch Record stacks for pending requests (v2) (obsolete) — Details — Splinter Review

Try: https://treeherder.mozilla.org/#/jobs?repo=try&revision=a69c7fcb8998

Attachment #8615078 - Attachment is obsolete: true

Attachment #8615311 - Flags: review?(poirot.alex)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

9 years ago

Depends on: 1171654

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 65

•

9 years ago

Comment on attachment 8615311 [details] [diff] [review]
Record stacks for pending requests (v2)

Moving this to bug 1171654.

Attachment #8615311 - Attachment is obsolete: true

Attachment #8615311 - Flags: review?(poirot.alex)

Alexandre Poirot [:ochameau]

Assignee

Comment 66

•

9 years ago

Patrick, Could you proceed with attachment 8615080 [details] [diff] [review], better fix the tests than doing nothing!

Ryan, could you submit a try with just this patch to see if we can land it?

Flags: needinfo?(pbrosset)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 67

•

9 years ago

What do you want me to do?  Maybe you should submit it since I am not sure... :P

Alexandre Poirot [:ochameau]

Assignee

Comment 68

•

9 years ago

I had to pull this patch in my queue to see where we are with destruction these days...
Here is what I meant, just a try push to try to push your patch.
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=57a6291d8e18

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 69

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #68)
> I had to pull this patch in my queue to see where we are with destruction
> these days...
> Here is what I meant, just a try push to try to push your patch.
>   https://treeherder.mozilla.org/#/jobs?repo=try&revision=57a6291d8e18

Oh wow, I'm sorry, I forgot I had even written a patch in this set, so then I was really confused about what you meant I should do here. :/

Thanks for pushing the updated try!

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 70

•

9 years ago

Comment on attachment 8615080 [details] [diff] [review]
Fix pending requests in markup view (v2)

Since Patrick is out for a bit, perhaps Brian can review.

Attachment #8615080 - Flags: review?(pbrosset) → review?(bgrinstead)

Brian Grinstead [:bgrins]

Comment 71

•

9 years ago

Comment on attachment 8615080 [details] [diff] [review]
Fix pending requests in markup view (v2)

Review of attachment 8615080 [details] [diff] [review]:
-----------------------------------------------------------------

::: browser/devtools/markupview/markup-view.js
@@ +1950,5 @@
>  
> +    yield this.markup.walker.insertBefore(this.node, dropTargetNodes.parent,
> +                                          dropTargetNodes.nextSibling);
> +    this.markup.emit("drop-completed");
> +  }),

We could, but I'm happy with being more explicit by adding the event.  It's easier to follow what's going on from inside the test.

Also, we have lots of cases where events are only used in tests (like reselectedonremoved / canceledreselectonremoved / text-expand in this file).  It's a pretty common pattern for this tool.

Attachment #8615080 - Flags: review?(bgrinstead) → review+

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

9 years ago

Flags: needinfo?(pbrosset)

Pulsebot

Comment 72

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/d2203c78c12d

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

9 years ago

Attachment #8615080 - Flags: checkin+

Alexandre Poirot [:ochameau]

Assignee

Comment 73

•

9 years ago

New try with various additional fixes:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=e1fb8b2943df

Alexandre Poirot [:ochameau]

Assignee

Comment 74

•

9 years ago

Comment on attachment 8615082 [details] [diff] [review]
Fix pending requests in rule view

Already done in bug 1166774.

Attachment #8615082 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 75

•

9 years ago

Attached patch Fix pending showBoxModel request when running browser_markupview_keybindings_04.js (obsolete) — Details — Splinter Review

Alexandre Poirot [:ochameau]

Assignee

Comment 76

•

9 years ago

Attached patch Prevent unnecessary hideBoxModel request (obsolete) — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=e5f66d1bf589

Carsten Book [:Tomcat]

Comment 77

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/d2203c78c12d

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

9 years ago

Blocks: 1136931

Alexandre Poirot [:ochameau]

Assignee

Comment 78

•

9 years ago

New try, with another set of patches.
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=f5fa56763526
Still cleaning inspector/markup-view tests that overlap...

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Depends on: 1184192

Alexandre Poirot [:ochameau]

Assignee

Comment 79

•

9 years ago

Except one intermittent in an animation inspector test, the previous try was finally looking good!!
Here is a new, rebased try:
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=71fe77bea237

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Blocks: 1186937

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8631564 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8631565 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 80

•

9 years ago

Comment on attachment 8607602 [details] [diff] [review]
Prevent racing hideBoxModel during connection shutdown. r=pbrosset

I'll move this one to bug 1136931.

Attachment #8607602 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 81

•

9 years ago

Attached patch Destroy the walker actor on disconnect - v2 (obsolete) — Details — Splinter Review

Rebased.
https://treeherder.mozilla.org/#/jobs?repo=try&revision=c2f8fab83fb8

Attachment #8607603 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Comment 82

•

9 years ago

The overall goal is also to make it possible to land the inspector tests refactoring...
Let's see if that's still somewhat green (I may need to rebase/tweak the refactoring for new/updated tests):
  https://treeherder.mozilla.org/#/jobs?repo=try&revision=e22078f41ad6

Alexandre Poirot [:ochameau]

Assignee

Comment 83

•

9 years ago

Comment on attachment 8639749 [details] [diff] [review]
Destroy the walker actor on disconnect - v2

Review of attachment 8639749 [details] [diff] [review]:
-----------------------------------------------------------------

Brian, Do you mind keeping reviewing all my inspector patches until Patrick get back?
This patch looks simple, but is pretty significant, especially regarding tests!
I had to fix them all (see all deps and other patches).
With this patch, we finally start releasing the walker actor on server side.
Before all these patches we kept reusing the same actors when closing/reopening the toolbox.
This is just a start as we are still leaking many things (bug 1186937 is going to help and is the next thing to review ;)),
and also we are still not freeing NodeActor's.

Attachment #8639749 - Flags: review?(bgrinstead)

Alexandre Poirot [:ochameau]

Assignee

Comment 84

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #82)
> The overall goal is also to make it possible to land the inspector tests
> refactoring...
> Let's see if that's still somewhat green (I may need to rebase/tweak the
> refactoring for new/updated tests):
>   https://treeherder.mozilla.org/#/jobs?repo=try&revision=e22078f41ad6

I do need to rebase... and there is a suspicious leak, but it is still worth landing attachment 8639749 [details] [diff] [review] as it keep try green without all the refactoring patches.

Brian Grinstead [:bgrins]

Comment 85

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #83)
> Comment on attachment 8639749 [details] [diff] [review]
> Destroy the walker actor on disconnect - v2
> 
> Review of attachment 8639749 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> Brian, Do you mind keeping reviewing all my inspector patches until Patrick
> get back?
> This patch looks simple, but is pretty significant, especially regarding
> tests!
> I had to fix them all (see all deps and other patches).
> With this patch, we finally start releasing the walker actor on server side.
> Before all these patches we kept reusing the same actors when
> closing/reopening the toolbox.
> This is just a start as we are still leaking many things (bug 1186937 is
> going to help and is the next thing to review ;)),
> and also we are still not freeing NodeActor's.

I'm not sure I understand what you mean by 'Before all these patches we kept reusing the same actors when closing/reopening the toolbox'.  Wasn't the call to .release() in fact destroying the WalkerActor (assuming that the toolbox close didn't have some error that prevented that function from being called).

Flags: needinfo?(poirot.alex)

Brian Grinstead [:bgrins]

Comment 86

•

9 years ago

Comment on attachment 8639749 [details] [diff] [review]
Destroy the walker actor on disconnect - v2

Review of attachment 8639749 [details] [diff] [review]:
-----------------------------------------------------------------

I don't see any reason why the WalkerActor shouldn't be managed by the InspectorActor.   Even though it doesn't strictly need the inspector, it's only ever constructed from the inspector (even in tests).  But clearing the review until I understand the answer to Comment 85.

Attachment #8639749 - Flags: review?(bgrinstead)

Alexandre Poirot [:ochameau]

Assignee

Comment 87

•

9 years ago

Comment on attachment 8639749 [details] [diff] [review]
Destroy the walker actor on disconnect - v2

Sorry, yes I'm fuzzy. I'm kind of lost in this pile of patches against the inspector...
I had so hard time getting green try that I had to split this somewhat simple and obvious fix.

You are right, we are already explicitely releasing the walker actor in current tip.
But we aren't releasing it in case of disconnect (if you pull the phone cable for ex).
Having to explicitely call release when the toolbox is closing on client disconnection is just wrong. The actors should clean themself, we shouldn't have the client to say to cleanup stuff.
This is the big picture over most my inspector patches.

Only tab actors can handle disconnection correctly. Only them can get their `disconnect` method being called when the connection closes. For the inspector panel, only the inspector actor can handle this. So in the already landed attachment 8607134 [details] [diff] [review], I fixed the InspectorActor behavior by adding a `disconnect` method on it. It finally starts to clean something on disconnect or toolbox close (as disconnect is called on toolbox close as we shut down the related client). Before that we didn't even tried to release it manually in toolbox.destroyInspector as this._inspector.destroy() only call front's destroy.

So the overall codepath for cleanup is now Inspector.disconnect gets called in most useful cases of cleanup, then it call its destroy method which call protocol.js one which handle all the inspector actor hierarchy. And the followup patches are to improve all these actor destroy method to ensure not leaking various stuff like MutationObserver, or NodeActors.

To summarize:
 - the overall goal of all these patches in to cleanup inspector actors (inspector, walker, highlighter, node, stylesheet, ...) correctly both when we close the toolbox or shutdown the connection.
 - this particular patch only fix the precise case where we shutdown the connection and the walker isn't destroyed

Note that I wasn't able to land this particular patch due to failure on try, so it highlighted cases where we weren't correctly cleaning up the walker actor during tests. We may have some tests where we aren't closing the toolbox and instead just shutdown the server/client/connection.
Also note that most tests failures I'm hitting are cases where we are trying to use an already released actor, this means that we weren't correctly destroying some actors. It also highlighted many tests that weren't waiting correctly for actor responses and were still running after the test clean itself up and/or call SimpleTest.finish and runs another tests.

Flags: needinfo?(poirot.alex)

Attachment #8639749 - Flags: review?(bgrinstead)

Brian Grinstead [:bgrins]

Comment 88

•

9 years ago

Comment on attachment 8639749 [details] [diff] [review]
Destroy the walker actor on disconnect - v2

Review of attachment 8639749 [details] [diff] [review]:
-----------------------------------------------------------------

As I said in Comment 86, this makes sense

::: browser/devtools/framework/toolbox.js
@@ +1784,5 @@
>  
>      // Releasing the walker (if it has been created)
>      // This can fail, but in any case, we want to continue destroying the
>      // inspector/highlighter/selection
> +    // FF41+: Inspector actor starts managing Walker actor and auto destroy it.

Looks like this should already be FF42+ in the comment for now, and then the number will need to be bumped if this doesn't land before the merge date

@@ +1785,5 @@
>      // Releasing the walker (if it has been created)
>      // This can fail, but in any case, we want to continue destroying the
>      // inspector/highlighter/selection
> +    // FF41+: Inspector actor starts managing Walker actor and auto destroy it.
> +    let walker = this._walker && !this.walker.traits.autoReleased ?

Seems like this should assign to 'this._destroyingInspector' instead of 'let walker'.  Since the value is always a promise, the old variable name never really made sense.  Then we could just return this._destroyingInspector in the next statement.

::: toolkit/devtools/server/actors/inspector.js
@@ +1305,5 @@
>      return {
>        actor: this.actorID,
> +      root: this.rootNode.form(),
> +      traits: {
> +        // FF41+ Inspector starts managing the Walker, while the inspector also

FF42+

@@ +3140,5 @@
>    form: function(json) {
>      this.actorID = json.actor;
>      this.rootNode = types.getType("domnode").read(json.root, this);
>      this._rootNodeDeferred.resolve(this.rootNode);
> +    // FF41+ the actor starts exposing traits

FF42+

Attachment #8639749 - Flags: review?(bgrinstead) → review+

Alexandre Poirot [:ochameau]

Assignee

Comment 89

•

9 years ago

(In reply to Brian Grinstead [:bgrins] from comment #88)
> @@ +1785,5 @@
> >      // Releasing the walker (if it has been created)
> >      // This can fail, but in any case, we want to continue destroying the
> >      // inspector/highlighter/selection
> > +    // FF41+: Inspector actor starts managing Walker actor and auto destroy it.
> > +    let walker = this._walker && !this.walker.traits.autoReleased ?
> 
> Seems like this should assign to 'this._destroyingInspector' instead of 'let
> walker'.  Since the value is always a promise, the old variable name never
> really made sense.  Then we could just return this._destroyingInspector in
> the next statement.

That would prevent waiting for `outstanding` resolution.
But I can refactor this code within outstanding and use Task to make that clearer...

Brian Grinstead [:bgrins]

Comment 90

•

9 years ago

(In reply to Alexandre Poirot [:ochameau] from comment #89)
> (In reply to Brian Grinstead [:bgrins] from comment #88)
> > @@ +1785,5 @@
> > >      // Releasing the walker (if it has been created)
> > >      // This can fail, but in any case, we want to continue destroying the
> > >      // inspector/highlighter/selection
> > > +    // FF41+: Inspector actor starts managing Walker actor and auto destroy it.
> > > +    let walker = this._walker && !this.walker.traits.autoReleased ?
> > 
> > Seems like this should assign to 'this._destroyingInspector' instead of 'let
> > walker'.  Since the value is always a promise, the old variable name never
> > really made sense.  Then we could just return this._destroyingInspector in
> > the next statement.
> 
> That would prevent waiting for `outstanding` resolution.
> But I can refactor this code within outstanding and use Task to make that
> clearer...

Or maybe just rename the 'walker' variable to something like 'walkerReleased'.  I wouldn't argue against refactoring it though :)

Alexandre Poirot [:ochameau]

Assignee

Comment 91

•

9 years ago

Attached patch Destroy the walker actor on disconnect - v3 (obsolete) — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=8386d6a7fc0c
Here is the refactoring. It is much clearer!

Attachment #8639749 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8642529 - Flags: review?(bgrinstead)

Brian Grinstead [:bgrins]

Comment 92

•

9 years ago

Comment on attachment 8642529 [details] [diff] [review]
Destroy the walker actor on disconnect - v3

Review of attachment 8642529 [details] [diff] [review]:
-----------------------------------------------------------------

Definitely clearer!  Think there are a couple of issues still that we need to work out to make sure it's doing the same thing as it used to with regards to multiple calls to this function and failures in walker.release()

::: browser/devtools/framework/toolbox.js
@@ +1742,5 @@
>     * Destroy the inspector/walker/selection fronts
>     * Returns a promise that resolves when the fronts are destroyed
>     */
> +  destroyInspector: Task.async(function*() {
> +    if (this._destroyingInspector) {

this._destroyingInspector is no longer set anywhere so this will just run twice when called twice

@@ +1755,5 @@
>      // This can fail, but in any case, we want to continue destroying the
>      // inspector/highlighter/selection
> +    // FF42+: Inspector actor starts managing Walker actor and auto destroy it.
> +    if (this._walker && !this.walker.traits.autoReleased) {
> +      yield this._walker.release();

Don't we need to catch a failure here to be consistent with what used to happen?

Attachment #8642529 - Flags: review?(bgrinstead)

Alexandre Poirot [:ochameau]

Assignee

Comment 93

•

9 years ago

Attached patch Destroy the walker actor on disconnect - v4 — Details — Splinter Review

https://treeherder.mozilla.org/#/jobs?repo=try&revision=23aa51742b4d

Attachment #8642529 - Attachment is obsolete: true

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8642633 - Flags: review?(bgrinstead)

Brian Grinstead [:bgrins]

Comment 94

•

9 years ago

Comment on attachment 8642633 [details] [diff] [review]
Destroy the walker actor on disconnect - v4

Review of attachment 8642633 [details] [diff] [review]:
-----------------------------------------------------------------

::: browser/devtools/framework/toolbox.js
@@ +1863,5 @@
>      }
>  
>      // Now that we are closing the toolbox we can re-enable the cache settings
>      // and disable the service workers testing settings for the current tab.
> +    // FF42+ automatically cleans up state in actor on disconnect.

Was this comment update intentional?

Attachment #8642633 - Flags: review?(bgrinstead) → review+

Pulsebot

Comment 95

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/2672589e571e

Alexandre Poirot [:ochameau]

Assignee

Comment 96

•

9 years ago

(In reply to Brian Grinstead [:bgrins] from comment #94)
> Comment on attachment 8642633 [details] [diff] [review]
> Destroy the walker actor on disconnect - v4
> 
> Review of attachment 8642633 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> ::: browser/devtools/framework/toolbox.js
> @@ +1863,5 @@
> >      }
> >  
> >      // Now that we are closing the toolbox we can re-enable the cache settings
> >      // and disable the service workers testing settings for the current tab.
> > +    // FF42+ automatically cleans up state in actor on disconnect.
> 
> Was this comment update intentional?

No! I think I'm doing too many dumb refactoring these days... Fixed and landed.

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Keywords: leave-open

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8642633 - Flags: checkin+

Carsten Book [:Tomcat]

Comment 97

•

9 years ago

sorry had to back this out for test failures like :

https://treeherder.mozilla.org/logviewer.html#?job_id=4078307&repo=fx-team

Flags: needinfo?(poirot.alex)

Pulsebot

Comment 98

•

9 years ago

Backout:
https://hg.mozilla.org/integration/fx-team/rev/0afaff7ed957

Alexandre Poirot [:ochameau]

Assignee

Comment 99

•

9 years ago

Attached patch Fix browser_animation_target_highlight_select.js - v1 — Details — Splinter Review

Sorry brian to spam you with reviews,
I really wish Patrick was around...
Please do not hesitate to nominate someone else to review any of my requests!

As other tests fixes, I had to add some explicit wait for various events,
that, to prevent sending or replying to requests after the test is finished,
and the toolbox is destroyed (now that actors are correctly destroyed,
we get new "Connection closed, pending request" exceptions).

First, there is something unexpected with `mouseover`.
If we don't do the `mouseout` we get random new mouseover
when calling selectNode(). (And that dispatches unexpected new RDP requests)

Then, we weren't waiting for *all* AnimationTargetNode.render() calls
to finish their getNodeFromActor request.
Nor were we correctly waiting for animation panel to fully proceed
the newly selected node and create the related AnimationTargetNode instances.

Alexandre Poirot [:ochameau]

Assignee

Comment 100

•

9 years ago

Attached patch Fix browser_tilt_picking_inspector.js - v1 — Details — Splinter Review

This one is a bit simplier. We weren't correctly waiting for the inspector
and its various panels to finish updating (and processing related RDP requests like showBoxModel).

Alexandre Poirot [:ochameau]

Assignee

Comment 101

•

9 years ago

Comment on attachment 8643987 [details] [diff] [review]
Fix browser_animation_target_highlight_select.js - v1

https://treeherder.mozilla.org/#/jobs?repo=try&revision=7a7e8c7c8e32

Flags: needinfo?(poirot.alex)

Attachment #8643987 - Flags: review?(bgrinstead)

Alexandre Poirot [:ochameau]

Assignee

Updated

•

9 years ago

Attachment #8643992 - Flags: review?(bgrinstead)

Patrick Brosset <:pbro>

Comment 102

•

9 years ago

Comment on attachment 8643987 [details] [diff] [review]
Fix browser_animation_target_highlight_select.js - v1

Review of attachment 8643987 [details] [diff] [review]:
-----------------------------------------------------------------

Fly-by review. This looks good to me.

::: browser/devtools/animationinspector/test/browser_animation_target_highlight_select.js
@@ +13,5 @@
>    let ui = yield openAnimationInspector();
>    yield testTargetNode(ui);
>  
>    ui = yield closeAnimationInspectorAndRestartWithNewUI();
> +

nit: no need for a new line here.

Attachment #8643987 - Flags: review?(bgrinstead) → review+

Brian Grinstead [:bgrins]

Comment 103

•

9 years ago

Comment on attachment 8643992 [details] [diff] [review]
Fix browser_tilt_picking_inspector.js - v1

Review of attachment 8643992 [details] [diff] [review]:
-----------------------------------------------------------------

seems fine

Attachment #8643992 - Flags: review?(bgrinstead) → review+

Pulsebot

Comment 104

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/2c22db41143d
https://hg.mozilla.org/integration/fx-team/rev/660a4b33140c
https://hg.mozilla.org/integration/fx-team/rev/a0610620d6d1

Ryan VanderMeulen [:RyanVM]

Comment 105

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/2c22db41143d
https://hg.mozilla.org/mozilla-central/rev/660a4b33140c
https://hg.mozilla.org/mozilla-central/rev/a0610620d6d1

Status: ASSIGNED → RESOLVED

Closed: 9 years ago

status-firefox43: --- → fixed

Flags: in-testsuite+

Resolution: --- → FIXED

Target Milestone: --- → Firefox 43

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Comment 106

•

9 years ago

\o/ This was a pretty crazy effort, I am glad to see it finally say FIXED!

arni2033

Updated

•

9 years ago

Depends on: 1197789

arni2033

Updated

•

7 years ago

Depends on: 1328014

BMO Automation

Updated

•

6 years ago

Product: Firefox → DevTools

You need to log in before you can comment on or make changes to this bug.