Closed Bug 660774 Opened 13 years ago Closed 13 years ago

e10s necko: refactor channelEventQueue to allow async resume/flush

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla7

People

(Reporter: jduell.mcbugs, Assigned: jduell.mcbugs)

References

Details

Attachments

(3 files)

Simple fix. 13 years ago Jason Duell 9.11 KB, patch	jdm : review+	Details \| Diff \| Splinter Review
More extensive refactoring (applies on top of simple fix) 13 years ago Jason Duell 39.83 KB, patch	jdm : review+	Details \| Diff \| Splinter Review
fixes issue with test_rentrancy_wrap.js 13 years ago Jason Duell 1.29 KB, patch		Details \| Diff \| Splinter Review

Jason Duell

Assignee

Description

•

13 years ago

Attached patch Simple fix. — Details — Splinter Review

In bug 637339 we've moved mCallonResume and FlushEventQueue() to an asyc callback (instead of right within Resume()) so that we don't flush the queue (and call client callbacks) before the client's Resume call completes. Alas, this breaks the existing, rather hack-y ChannelEventQueue logic. In particular, if the suspend happens within OnStartReq, etc., we wind up hitting FlushEventQueue (when OnStartReq's AutoEventEnqueuer goes out of scope), and that barfs on the mQueuePhase==PHASE_FINISHED_QUEUEING we set in Resume(). If we remove the error check and allow PHASE_FINISHED_QUEUEING the queue will get flushed before we get a chance to run mCallOnResume, which would be out of order. I suppose we might be able to fix that by adding another state to mQueuePhase, or possibly even (now that I think about it) by setting mQueuePhase=PHASE_QUEUEING in HttpChannelChild::Resume(), plus removing the NS_ABORT_IF_FALSE(mQueuePhase != PHASE_UNQUEUED) in FlushEventQueue(). But in general I'm finding the ChannelEventQueue logic to be too fragile and unintuitive: for example, when suspended we wind up queueing messages while state=PHASE_UNQUEUED. For this bug I offer two approaches--one quick fix, and one more extensive refactoring. I think the second is better. Note: both of these patches are built on top of the patches in bug 637339, but you don't really need to understand them at all, other than than HttpChannelChild::Resume now launches an async "CompleteResume()" method, which is where we want to actually resume the EventQueue. Short term fix: I've built in formal Suspend/Resume methods into ChannelEventQueue. This allows us to delay resuming the ChannelEventQueue until we get to CompleteResume(). It also cleans up the caller code a bit (I could also remove all the template logic from ChannelEventQueue, but I haven't bothered to here).

Attachment #536259 - Flags: review?(josh)

Jason Duell

Assignee

Comment 1

•

13 years ago

Attached patch More extensive refactoring (applies on top of simple fix) — Details — Splinter Review

2nd, more ambituous patch. Follow on the first patch, but change ChannelEventQueue from using a single state variable and instead check for the various conditions (suspended, in a 'critical section', flushing) that we've been fudging with an enum. I think this makes the logic a lot easier to follow. Also convert ChannelEventQueue to be a non-template class, and make it a member of channels, rather than a base class.

Attachment #536260 - Flags: review?(josh)

Jason Duell

Assignee

Updated

•

13 years ago

Blocks: 637339

Josh Matthews [:jdm]

Comment 2

•

13 years ago

Comment on attachment 536260 [details] [diff] [review] More extensive refactoring (applies on top of simple fix) I really like the way the logic is presented here; it's significantly easier to follow. My comments are all superficial things. >+#include <nsIChannel.h> I think this can be forward-declared. >+ bool mForced; > bool mSuspended; >+ bool mFlushing; These could be bitfields. I don't feel strongly one way or the other, though. >+ NS_ASSERTION(!( (answer == false) && !mEventQueue.IsEmpty()), >+ "Should always enqueue if ChannelEventQueue not empty"); This hurts every time I think about it. Can we apply DeMorgan's laws and get |answer == true || mEventQueue.IsEmpty()| instead? >+ NS_ASSERTION(!mSuspended, NS_ABORT_IF_FALSE >+ NS_ASSERTION(mSuspended, NS_ABORT_IF_FALSE >+ NS_ASSERTION(!mForced, "MaybeFlushQueue called inside critical section"); NS_ABORT_IF_FALSE >+ mEventQ.Resume(); // TODO: make this async: see HttpChannelChild::Resume Get a bug filed on that?

Attachment #536260 - Flags: review?(josh) → review+

Josh Matthews [:jdm]

Updated

•

13 years ago

Attachment #536259 - Flags: review?(josh)

Josh Matthews [:jdm]

Comment 3

Comment 12

•

13 years ago

http://hg.mozilla.org/integration/mozilla-inbound/rev/4a33bc1f772d

Whiteboard: [inbound]

Mounir Lamouri (:mounir)

Comment 13

•

13 years ago

Pushed: http://hg.mozilla.org/mozilla-central/rev/4a33bc1f772d

Status: ASSIGNED → RESOLVED

Closed: 13 years ago

Flags: in-testsuite+

Resolution: --- → FIXED

Whiteboard: [inbound]

Target Milestone: --- → mozilla7

Version: unspecified → Trunk

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

e10s necko: refactor channelEventQueue to allow async resume/flush

Categories

(Core :: Networking, defect)

Tracking

()

People

(Reporter: jduell.mcbugs, Assigned: jduell.mcbugs)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(3 files)

Description

Comment 1

Updated

Comment 2

Updated

Comment 3

Comment 4

Updated

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Attachment

General

Description

File Name

Content Type