Closed Bug 1207696 Opened 10 years ago Closed 7 years ago

Web replay initial landing

Tracking

()

Status:

RESOLVED FIXED

Tracking Flags:

Tracking

Status

firefox44

---

affected

People

(Reporter: bhackett1024, Assigned: bhackett1024)

References

(
URL
)

Details

(Whiteboard: leave-open)

Attachments

(81 files, 118 obsolete files)

WIP 10 years ago Brian Hackett [Laid off!] 47.55 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 63.55 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 84.27 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 140.33 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 169.46 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 175.18 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 187.35 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 198.03 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 197.30 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 248.89 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 314.47 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 397.02 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 359.35 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 402.97 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 459.93 KB, patch		Details \| Diff \| Splinter Review
test.html 10 years ago Brian Hackett [Laid off!] 298 bytes, text/html		Details
WIP 10 years ago Brian Hackett [Laid off!] 472.88 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 465.64 KB, patch		Details \| Diff \| Splinter Review
Part 1: Link NSPR in more places 10 years ago Brian Hackett [Laid off!] 2.44 KB, patch		Details \| Diff \| Splinter Review
Part 2: Record/replay infrastructure 10 years ago Brian Hackett [Laid off!] 199.61 KB, patch		Details \| Diff \| Splinter Review
Part 3: Atomics interface changes 10 years ago Brian Hackett [Laid off!] 19.80 KB, patch		Details \| Diff \| Splinter Review
Part 4: NSPR locking/threading changes. 10 years ago Brian Hackett [Laid off!] 24.35 KB, patch		Details \| Diff \| Splinter Review
Part 5: Use NSPR primitives in chromium code 10 years ago Brian Hackett [Laid off!] 18.06 KB, patch		Details \| Diff \| Splinter Review
Part 6: Parent side changes 10 years ago Brian Hackett [Laid off!] 6.04 KB, patch		Details \| Diff \| Splinter Review
Part 7: Unrecorded atomics/locks 10 years ago Brian Hackett [Laid off!] 31.87 KB, patch		Details \| Diff \| Splinter Review
Part 8: Child side behavior changes 10 years ago Brian Hackett [Laid off!] 5.07 KB, patch		Details \| Diff \| Splinter Review
Part 9: Record/replay instrumentation of mach messages 10 years ago Brian Hackett [Laid off!] 54.22 KB, patch		Details \| Diff \| Splinter Review
Part 10: Other child side instrumentation 10 years ago Brian Hackett [Laid off!] 14.93 KB, patch		Details \| Diff \| Splinter Review
Part 11: Replayed process IPC 10 years ago Brian Hackett [Laid off!] 76.51 KB, patch		Details \| Diff \| Splinter Review
Part 12: Graphics changes for layers/compositing IPC 10 years ago Brian Hackett [Laid off!] 14.16 KB, patch		Details \| Diff \| Splinter Review
rewind WIP 10 years ago Brian Hackett [Laid off!] 77.99 KB, patch		Details \| Diff \| Splinter Review
rewind WIP 10 years ago Brian Hackett [Laid off!] 85.85 KB, patch		Details \| Diff \| Splinter Review
rewind WIP 10 years ago Brian Hackett [Laid off!] 111.09 KB, patch		Details \| Diff \| Splinter Review
rewind WIP 10 years ago Brian Hackett [Laid off!] 201.96 KB, patch		Details \| Diff \| Splinter Review
rewind WIP 10 years ago Brian Hackett [Laid off!] 280.43 KB, patch		Details \| Diff \| Splinter Review
debugger WIP 10 years ago Brian Hackett [Laid off!] 290.65 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 799.66 KB, patch		Details \| Diff \| Splinter Review
Part 1: Build system changes 10 years ago Brian Hackett [Laid off!] 8.69 KB, patch		Details \| Diff \| Splinter Review
Part 2: Record/replay/rewind infrastructure 10 years ago Brian Hackett [Laid off!] 228.04 KB, patch		Details \| Diff \| Splinter Review
Part 3: Atomics interface changes 10 years ago Brian Hackett [Laid off!] 19.58 KB, patch		Details \| Diff \| Splinter Review
Part 4: NSPR locking/threading changes 10 years ago Brian Hackett [Laid off!] 28.97 KB, patch		Details \| Diff \| Splinter Review
Part 5: Use NSPR primitives in chromium code 10 years ago Brian Hackett [Laid off!] 16.21 KB, patch		Details \| Diff \| Splinter Review
Part 6: Parent side changes 10 years ago Brian Hackett [Laid off!] 2.10 KB, patch		Details \| Diff \| Splinter Review
Part 7: Unrecorded atomics/locks 10 years ago Brian Hackett [Laid off!] 56.36 KB, patch		Details \| Diff \| Splinter Review
Part 8: Child side behavior changes 10 years ago Brian Hackett [Laid off!] 25.78 KB, patch		Details \| Diff \| Splinter Review
Part 9: Record/replay instrumentation of mach messages 10 years ago Brian Hackett [Laid off!] 54.00 KB, patch		Details \| Diff \| Splinter Review
Part 10: Other child side instrumentation 10 years ago Brian Hackett [Laid off!] 15.37 KB, patch		Details \| Diff \| Splinter Review
Part 11: Middleman process IPC / other changes 10 years ago Brian Hackett [Laid off!] 60.67 KB, patch		Details \| Diff \| Splinter Review
Part 12: Replaying process IPC 10 years ago Brian Hackett [Laid off!] 51.46 KB, patch		Details \| Diff \| Splinter Review
Part 13: C++ JS debugger changes 10 years ago Brian Hackett [Laid off!] 119.13 KB, patch		Details \| Diff \| Splinter Review
Part 14: C++ JS replay debugger 10 years ago Brian Hackett [Laid off!] 51.89 KB, patch		Details \| Diff \| Splinter Review
Part 15: Graphics changes for replay IPC 10 years ago Brian Hackett [Laid off!] 28.58 KB, patch		Details \| Diff \| Splinter Review
Part 16: Client (chrome) side devtools changes 10 years ago Brian Hackett [Laid off!] 8.28 KB, patch		Details \| Diff \| Splinter Review
Part 17: Server (content) side devtools changes 10 years ago Brian Hackett [Laid off!] 25.26 KB, patch		Details \| Diff \| Splinter Review
robustness WIP 10 years ago Brian Hackett [Laid off!] 179.80 KB, patch		Details \| Diff \| Splinter Review
graphics WIP 10 years ago Brian Hackett [Laid off!] 88.95 KB, patch		Details \| Diff \| Splinter Review
graphics WIP 10 years ago Brian Hackett [Laid off!] 111.83 KB, patch		Details \| Diff \| Splinter Review
graphics WIP 10 years ago Brian Hackett [Laid off!] 112.37 KB, patch		Details \| Diff \| Splinter Review
record font names 10 years ago Brian Hackett [Laid off!] 10.52 KB, patch		Details \| Diff \| Splinter Review
robustness WIP #2 10 years ago Brian Hackett [Laid off!] 123.47 KB, patch		Details \| Diff \| Splinter Review
Windows WIP 10 years ago Brian Hackett [Laid off!] 370.25 KB, patch		Details \| Diff \| Splinter Review
Windows WIP #2 10 years ago Brian Hackett [Laid off!] 75.84 KB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 1.04 MB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 1.10 MB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 1.26 MB, patch		Details \| Diff \| Splinter Review
WIP 10 years ago Brian Hackett [Laid off!] 1.31 MB, patch		Details \| Diff \| Splinter Review
Part 1: Build system changes 10 years ago Brian Hackett [Laid off!] 11.60 KB, patch		Details \| Diff \| Splinter Review
Part 1b: Move LZ4 to NSPR 10 years ago Brian Hackett [Laid off!] 110.39 KB, patch		Details \| Diff \| Splinter Review
Part 2: Record/replay/rewind infrastructure 10 years ago Brian Hackett [Laid off!] 108.15 KB, patch		Details \| Diff \| Splinter Review
Part 2b: Mac / Windows redirections 10 years ago Brian Hackett [Laid off!] 186.33 KB, patch		Details \| Diff \| Splinter Review
Part 3: Atomics interface changes 10 years ago Brian Hackett [Laid off!] 24.86 KB, patch		Details \| Diff \| Splinter Review
Part 4: NSPR locking/threading changes 10 years ago Brian Hackett [Laid off!] 57.39 KB, patch		Details \| Diff \| Splinter Review
Part 5: Use NSPR primitives in chromium code 10 years ago Brian Hackett [Laid off!] 42.71 KB, patch		Details \| Diff \| Splinter Review
Part 6: Parent side changes 10 years ago Brian Hackett [Laid off!] 3.28 KB, patch		Details \| Diff \| Splinter Review
Part 7: Unrecorded atomics/locks 10 years ago Brian Hackett [Laid off!] 56.10 KB, patch		Details \| Diff \| Splinter Review
Part 8: Child side behavior changes 10 years ago Brian Hackett [Laid off!] 61.80 KB, patch		Details \| Diff \| Splinter Review
Part 9: Record/replay instrumentation of mach messages 10 years ago Brian Hackett [Laid off!] 70.45 KB, patch		Details \| Diff \| Splinter Review
Part 10: Other child side instrumentation 10 years ago Brian Hackett [Laid off!] 57.05 KB, patch		Details \| Diff \| Splinter Review
Part 11: Middleman process IPC / other changes 10 years ago Brian Hackett [Laid off!] 74.08 KB, patch		Details \| Diff \| Splinter Review
Part 12: Replaying process IPC 10 years ago Brian Hackett [Laid off!] 47.80 KB, patch		Details \| Diff \| Splinter Review
Part 13: C++ JS debugger changes 10 years ago Brian Hackett [Laid off!] 166.42 KB, patch		Details \| Diff \| Splinter Review
Part 14: C++ JS replay debugger 10 years ago Brian Hackett [Laid off!] 103.01 KB, patch		Details \| Diff \| Splinter Review
Part 15: Graphics layers/shmem changes for replay IPC 10 years ago Brian Hackett [Laid off!] 31.38 KB, patch		Details \| Diff \| Splinter Review
Part 15b: Use DrawTargetRecording to render graphics in the middleman process 10 years ago Brian Hackett [Laid off!] 56.11 KB, patch		Details \| Diff \| Splinter Review
Part 16: Client (chrome) side devtools changes 10 years ago Brian Hackett [Laid off!] 49.17 KB, patch		Details \| Diff \| Splinter Review
Part 17: Server (content) side devtools changes 10 years ago Brian Hackett [Laid off!] 27.35 KB, patch		Details \| Diff \| Splinter Review
WIP 9 years ago Brian Hackett [Laid off!] 1.33 MB, patch		Details \| Diff \| Splinter Review
WIP #2 9 years ago Brian Hackett [Laid off!] 283.30 KB, patch		Details \| Diff \| Splinter Review
WIP #2 9 years ago Brian Hackett [Laid off!] 405.22 KB, patch		Details \| Diff \| Splinter Review
WIP 9 years ago Brian Hackett [Laid off!] 1.43 MB, patch		Details \| Diff \| Splinter Review
Mac WIP 9 years ago Brian Hackett [Laid off!] 135.32 KB, patch		Details \| Diff \| Splinter Review
Windows WIP 9 years ago Brian Hackett [Laid off!] 232.53 KB, patch		Details \| Diff \| Splinter Review
Mac WIP 9 years ago Brian Hackett [Laid off!] 337.49 KB, patch		Details \| Diff \| Splinter Review
Mac WIP 9 years ago Brian Hackett [Laid off!] 359.35 KB, patch		Details \| Diff \| Splinter Review
Windows WIP 9 years ago Brian Hackett [Laid off!] 312.09 KB, patch		Details \| Diff \| Splinter Review
WIP 9 years ago Brian Hackett [Laid off!] 1.51 MB, patch		Details \| Diff \| Splinter Review
patch 9 years ago Brian Hackett [Laid off!] 1.63 MB, patch		Details \| Diff \| Splinter Review
Part 1a - Record/replay/rewind infrastructure. 9 years ago Brian Hackett [Laid off!] 177.49 KB, patch		Details \| Diff \| Splinter Review
Part 1b - Add libudis86 source. 9 years ago Brian Hackett [Laid off!] 365.40 KB, patch	jandem : review+	Details \| Diff \| Splinter Review
Part 1c - Redirections. 9 years ago Brian Hackett [Laid off!] 277.92 KB, patch	froydnj : feedback+	Details \| Diff \| Splinter Review
Part 1d - Setup/teardown of record/replay state 9 years ago Brian Hackett [Laid off!] 3.38 KB, patch		Details \| Diff \| Splinter Review
Part 1e - Disable crash reporting when recording/replaying 9 years ago Brian Hackett [Laid off!] 1.04 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 2a - Atomics interface changes. 9 years ago Brian Hackett [Laid off!] 25.17 KB, patch		Details \| Diff \| Splinter Review
Part 2b - Don't record activity in atomics unit tests. 9 years ago Brian Hackett [Laid off!] 3.50 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
Part 3 - Use PR_ATOMIC macros instead of Interlocked functions in chromium code. 9 years ago Brian Hackett [Laid off!] 5.21 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 4a - Make recording optional in mozilla::RefCounted. 9 years ago Brian Hackett [Laid off!] 2.16 KB, patch	ehsan.akhgari : review+	Details \| Diff \| Splinter Review
Part 4b - Make recording optional in mozilla mutexes and monitors. 9 years ago Brian Hackett [Laid off!] 10.19 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4c - Don't record activity on ThreadSafeAutoRefCnt or nsStringBuffer refcounts. 9 years ago Brian Hackett [Laid off!] 1.83 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4d - Don't record certain graphics atomics counters. 9 years ago Brian Hackett [Laid off!] 3.57 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 4e - Don't record various JS atomics. 9 years ago Brian Hackett [Laid off!] 22.59 KB, patch	jandem : review+	Details \| Diff \| Splinter Review
Part 4f - Don't record JS mutexes. 9 years ago Brian Hackett [Laid off!] 875 bytes, patch	fitzgen : review+	Details \| Diff \| Splinter Review
Part 4g - Don't record malloc library atomic. 9 years ago Brian Hackett [Laid off!] 1.26 KB, patch	n.nethercote : review+	Details \| Diff \| Splinter Review
Part 4h - Don't record chaos mode counters. 9 years ago Brian Hackett [Laid off!] 1.44 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4i - Don't record NSPR thread/lock-management atomics. 9 years ago Brian Hackett [Laid off!] 4.83 KB, patch		Details \| Diff \| Splinter Review
Part 4j - Don't record pseudo-stack refcount. 9 years ago Brian Hackett [Laid off!] 1.18 KB, patch	fitzgen : review+	Details \| Diff \| Splinter Review
Part 4k - Don't record deadlock detector lock. 9 years ago Brian Hackett [Laid off!] 858 bytes, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4l - Don't record some debugging/statistics atomics. 9 years ago Brian Hackett [Laid off!] 2.39 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4m - Don't record some threading atomics. 9 years ago Brian Hackett [Laid off!] 1.92 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 4n - Don't record XPT table monitor. 9 years ago Brian Hackett [Laid off!] 1.15 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 5a - Disable incremental GC when recording or replaying. 9 years ago Brian Hackett [Laid off!] 3.16 KB, patch		Details \| Diff \| Splinter Review
Part 5b - Don't keep track of times or page fault counts in GC and helper thread activity when recording or replaying. 9 years ago Brian Hackett [Laid off!] 11.32 KB, patch	sfink : review+	Details \| Diff \| Splinter Review
Part 5c - Don't dispatch runnables for GC or finalization when under the GC and recording or replaying. 9 years ago Brian Hackett [Laid off!] 2.74 KB, patch	mccr8 : review+	Details \| Diff \| Splinter Review
Part 5d - Disable compacting GC when replaying. 9 years ago Brian Hackett [Laid off!] 1.18 KB, patch	jonco : review+	Details \| Diff \| Splinter Review
Part 5e - Don't assume that CFBundleGetFunctionPointerForName succeeds. 9 years ago Brian Hackett [Laid off!] 949 bytes, patch	BenWa : review+	Details \| Diff \| Splinter Review
Part 5f - Don't sort arrays of requests in image loader when recording or replaying. 9 years ago Brian Hackett [Laid off!] 2.92 KB, patch	mattwoodrow : review+	Details \| Diff \| Splinter Review
Part 5g - Disable finalization witnesses when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.63 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 5h - Disable Cocoa native drawing when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.15 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 5i - Disable lazy and off thread JS parsing when recording or replaying. 9 years ago Brian Hackett [Laid off!] 2.17 KB, patch	jandem : review+	Details \| Diff \| Splinter Review
Part 5j - Don't add GC events to timelines when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.91 KB, patch	mccr8 : review+	Details \| Diff \| Splinter Review
Part 5k - Don't generate debugger runnables on GC events. 9 years ago Brian Hackett [Laid off!] 1.01 KB, patch	fitzgen : review+	Details \| Diff \| Splinter Review
Part 5l - Don't trace refcounts while recording or replaying. 9 years ago Brian Hackett [Laid off!] 991 bytes, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 5m - Disable hang monitor while recording or replaying. 9 years ago Brian Hackett [Laid off!] 955 bytes, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 5n - Don't perform telemetry while recording or replaying. 9 years ago Brian Hackett [Laid off!] 3.85 KB, patch		Details \| Diff \| Splinter Review
Part 6a - Disable media elements when recording or replaying. 9 years ago Brian Hackett [Laid off!] 4.72 KB, patch	jesup : review+	Details \| Diff \| Splinter Review
Part 6b - Disable DOM workers when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.17 KB, patch	smaug : review+	Details \| Diff \| Splinter Review
Part 6c - Disable accelerated canvases when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.24 KB, patch	dvander : review+	Details \| Diff \| Splinter Review
Part 6d - Disable wasm signal handlers when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.25 KB, patch	luke : review+	Details \| Diff \| Splinter Review
Part 6e - Disable the slow script dialog when recording or replaying. 9 years ago Brian Hackett [Laid off!] 2.03 KB, patch	mrbkap : review+	Details \| Diff \| Splinter Review
Part 7 - Ensure deterministic interaction of GC with CC and object references. 9 years ago Brian Hackett [Laid off!] 25.72 KB, patch	smaug : review-	Details \| Diff \| Splinter Review
Part 8a - Manually ensure hash table iteration ordering consistency in mMutants std::set. 9 years ago Brian Hackett [Laid off!] 2.08 KB, patch	jrmuizel : review-	Details \| Diff \| Splinter Review
Part 8b - Manually record/replay mach_msg IPC calls. 9 years ago Brian Hackett [Laid off!] 5.48 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 8c - Mark places in the JS engine where recording events are disallowed and where the recording should be invalidated. 9 years ago Brian Hackett [Laid off!] 5.87 KB, patch	jandem : review+	Details \| Diff \| Splinter Review
Part 8d - Manually ensure hash table iteration ordering consistency in structured clone hash table. 9 years ago Brian Hackett [Laid off!] 4.82 KB, patch	jorendorff : review+	Details \| Diff \| Splinter Review
Part 8e - Don't incorporate environment into random number seed when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.81 KB, patch	franziskus : review-	Details \| Diff \| Splinter Review
Part 8f - Ensure that PL and PLD hashtables have consistent iteration order when recording/replaying. 9 years ago Brian Hackett [Laid off!] 17.02 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 8g - Instrument accesses on shared memory read locks. 9 years ago Brian Hackett [Laid off!] 10.13 KB, patch	nical : review+	Details \| Diff \| Splinter Review
Part 9a - PReplay protocol and parent side implementation. 9 years ago Brian Hackett [Laid off!] 66.10 KB, patch		Details \| Diff \| Splinter Review
Part 9b - Handle separate PCompositorChild used in middleman processes. 9 years ago Brian Hackett [Laid off!] 9.41 KB, patch	nical : review+	Details \| Diff \| Splinter Review
Part 9c - Shutdown replaying process when middleman process shuts down. 9 years ago Brian Hackett [Laid off!] 1.01 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 9d - Allow sync messages to be sent off the main thread by middleman or replaying processes. 9 years ago Brian Hackett [Laid off!] 1.60 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10a - Child side implementation of PReplay protocol. 9 years ago Brian Hackett [Laid off!] 45.57 KB, patch		Details \| Diff \| Splinter Review
Part 10b - Allow the SharedMemory reporter to be constructed independently. 9 years ago Brian Hackett [Laid off!] 1.93 KB, patch	n.nethercote : review+	Details \| Diff \| Splinter Review
Part 10c - Allow shared memory subsystem to communicate with the actual parent pid while replaying. 9 years ago Brian Hackett [Laid off!] 7.69 KB, patch		Details \| Diff \| Splinter Review
Part 10d - Coordinate with snapshot mechanism in replay specific IPC threads. 9 years ago Brian Hackett [Laid off!] 4.39 KB, patch		Details \| Diff \| Splinter Review
Part 10e - Don't allow snapshots to be taken when in the middle of sending replay specific IPC messages. 9 years ago Brian Hackett [Laid off!] 2.94 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10f - Coordinate with snapshot mechanism in JS helper threads. 9 years ago Brian Hackett [Laid off!] 2.85 KB, patch	fitzgen : review+	Details \| Diff \| Splinter Review
Part 10g - Initialize replay-specific and middleman state. 9 years ago Brian Hackett [Laid off!] 3.69 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10h - Don't register replay specific threads with the profiler. 9 years ago Brian Hackett [Laid off!] 1.12 KB, patch	BenWa : review+	Details \| Diff \| Splinter Review
Part 11 - C++ JS debugger changes. 9 years ago Brian Hackett [Laid off!] 219.10 KB, patch		Details \| Diff \| Splinter Review
Part 12 - C++ JS replay debugger. 9 years ago Brian Hackett [Laid off!] 137.62 KB, patch		Details \| Diff \| Splinter Review
Part 13 - Graphics layers/shmem changes for replay IPC. 9 years ago Brian Hackett [Laid off!] 14.80 KB, patch	nical : review+	Details \| Diff \| Splinter Review
Part 14a - DrawTargetRecording refactoring to allow for common code with DrawTargetRecordReplay. 9 years ago Brian Hackett [Laid off!] 17.85 KB, patch	bas.schouten : review-	Details \| Diff \| Splinter Review
Part 14b - Fix backend getter for DrawTargetRecording. 9 years ago Brian Hackett [Laid off!] 1.08 KB, patch	bas.schouten : review-	Details \| Diff \| Splinter Review
Part 14c - Allow recording translator to manage creation of similar draw targets. 9 years ago Brian Hackett [Laid off!] 1.13 KB, patch	bas.schouten : review-	Details \| Diff \| Splinter Review
Part 14d - Support creating native font resources for Mac draw targets. 9 years ago Brian Hackett [Laid off!] 6.57 KB, patch	jrmuizel : review+	Details \| Diff \| Splinter Review
Part 14e - Add RECORDREPLAY graphics backend. 9 years ago Brian Hackett [Laid off!] 3.59 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 14f - Add DrawTargetRecordReplay. 9 years ago Brian Hackett [Laid off!] 18.58 KB, patch		Details \| Diff \| Splinter Review
Part 14g - Create DrawTargetRecordReplay in gfx Factory when recording or replaying. 9 years ago Brian Hackett [Laid off!] 5.51 KB, patch	bas.schouten : review-	Details \| Diff \| Splinter Review
Part 14h - Optimize recording size for font data events. 9 years ago Brian Hackett [Laid off!] 3.04 KB, patch	bas.schouten : review-	Details \| Diff \| Splinter Review
Part 15a - Client side devtools changes. 9 years ago Brian Hackett [Laid off!] 33.59 KB, patch	jlong : review-	Details \| Diff \| Splinter Review
Part 15b - Spawning record/replay/middleman processes. 9 years ago Brian Hackett [Laid off!] 21.61 KB, patch		Details \| Diff \| Splinter Review
Part 16 - Server side devtools changes. 9 years ago Brian Hackett [Laid off!] 15.70 KB, patch	jimb : review+	Details \| Diff \| Splinter Review
Part 17 - Tests. 9 years ago Brian Hackett [Laid off!] 3.08 KB, patch		Details \| Diff \| Splinter Review
Part 5a - Disable incremental GC when recording or replaying. 9 years ago Brian Hackett [Laid off!] 1.84 KB, patch	mccr8 : review+	Details \| Diff \| Splinter Review
patch 9 years ago Brian Hackett [Laid off!] 1.61 MB, patch		Details \| Diff \| Splinter Review
Part 1a - Record/replay/rewind infrastructure. 9 years ago Brian Hackett [Laid off!] 188.05 KB, patch		Details \| Diff \| Splinter Review
Part 1d - Teardown of record/replay state 9 years ago Brian Hackett [Laid off!] 2.81 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 2a - Atomics interface changes. 9 years ago Brian Hackett [Laid off!] 19.86 KB, patch	Waldo : review+	Details \| Diff \| Splinter Review
Part 8f - Ensure that PLD hashtables have consistent iteration order when recording/replaying. 9 years ago Brian Hackett [Laid off!] 5.34 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 5n - Don't perform telemetry while recording or replaying. 9 years ago Brian Hackett [Laid off!] 7.05 KB, patch	gfritzsche : review+	Details \| Diff \| Splinter Review
Part 7 - Ensure deterministic interaction of GC with CC and object references. 9 years ago Brian Hackett [Laid off!] 31.08 KB, patch	smaug : review-	Details \| Diff \| Splinter Review
Part 7 - Ensure deterministic interaction of GC with CC and object references. 9 years ago Brian Hackett [Laid off!] 26.65 KB, patch		Details \| Diff \| Splinter Review
patch 9 years ago Brian Hackett [Laid off!] 1.82 MB, patch		Details \| Diff \| Splinter Review
Part 1a - Public record/replay API. 9 years ago Brian Hackett [Laid off!] 32.37 KB, patch	billm : review+ froydnj : review+	Details \| Diff \| Splinter Review
Part 1c - Record/replay utilities. 9 years ago Brian Hackett [Laid off!] 21.30 KB, patch		Details \| Diff \| Splinter Review
Part 1f - Execution recording/replaying. 9 years ago Brian Hackett [Laid off!] 125.45 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 1g - Execution rewinding. 9 years ago Brian Hackett [Laid off!] 84.62 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 1h - Redirections infrastructure. 9 years ago Brian Hackett [Laid off!] 73.83 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 1i - Platform specific redirections. 9 years ago Brian Hackett [Laid off!] 254.65 KB, patch	froydnj : review+	Details \| Diff \| Splinter Review
Part 7 - Ensure deterministic interaction of GC with CC and object references. 9 years ago Brian Hackett [Laid off!] 27.31 KB, patch	smaug : review+	Details \| Diff \| Splinter Review
Part 9a - PReplay protocol and parent side implementation. 9 years ago Brian Hackett [Laid off!] 39.18 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 9e - Parent side of compositor management. 9 years ago Brian Hackett [Laid off!] 41.64 KB, patch	nical : review+	Details \| Diff \| Splinter Review
Part 9f - Parent side of DrawTarget management. 9 years ago Brian Hackett [Laid off!] 16.04 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 10a - Child side implementation of PReplay protocol. 9 years ago Brian Hackett [Laid off!] 22.54 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10c - Allow shared memory subsystem to communicate with the actual parent pid while replaying. 9 years ago Brian Hackett [Laid off!] 7.59 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10d - Coordinate with snapshot mechanism in replay specific IPC threads. 9 years ago Brian Hackett [Laid off!] 3.87 KB, patch	billm : review+	Details \| Diff \| Splinter Review
Part 10i - Child side of compositor management. 9 years ago Brian Hackett [Laid off!] 16.99 KB, patch	nical : review+	Details \| Diff \| Splinter Review
Part 10j - Child side of DrawTarget management. 9 years ago Brian Hackett [Laid off!] 11.28 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review
Part 14f - Add DrawTargetRecordReplay. 9 years ago Brian Hackett [Laid off!] 22.17 KB, patch	bas.schouten : review+	Details \| Diff \| Splinter Review

Brian Hackett [Laid off!]

Assignee

Description

•

10 years ago

Over the past couple weeks, roc and I have come up with a design that looks promising for web recording and deterministic replay: an rr like tool that can be integrated into Firefox and used by web and browser developers to debug content processes. Browser non-determinism originates from two sources: intra-thread and inter-thread. Intra-thread non-deterministic behaviors are non-deterministic even in the absence of actions by other threads, and inter-thread non-deterministic behaviors are those affected by interleaving execution with other threads, and which always behave the same given the same interleaving. Intra-thread non-determinism can be recorded and later replayed for each thread. These behaviors mainly originate from system calls (i/o and such), and can be instrumented at the library call level --- this means we can just use shims at library call sites in Gecko code rather than modifying the libraries themselves, and is helpful on Windows, where the library call API is stable/documented but not the system call API. Inter-thread non-determinism can be handled by first assuming the program is data race free: shared memory accesses which would otherwise race are either protected by locks or use mozilla::Atomic. If we record and later replay the order in which threads acquire locks (and, by extension, release locks and use condvars) then accesses on lock-protected memory will occur in the same order. Atomic variables can be handled either by treating reads and writes as if they were wrapped by a lock acquire/release during recording, or by treating reads as intra-thread non-determinism (record and replay the values produced). I like the latter more (less overhead), though it wouldn't work well with atomic pointer values so a mix of the two is probably best (needs experimentation). This design is broadly similar to rr, with some major differences: - This should work on all platforms and architectures supported by Gecko, without much port work required. - This will be part of Gecko itself, rather than a separate tool, which means both that developers won't need additional software to use it and that this can't be used to debug other software. - This can use multiple cores during recording and replay. - This does not preserve exact behavior. Context switches can occur at different times and data races can lead to different behavior between recording and replay. Data races are bugs in and of themselves, however, so this sort of non-determinism should be fixed regardless. There's one exception here that I know of: SharedArrayBuffer can be used by web content to introduce data races to the browser. Pages which use SharedArrayBuffer can still be recorded, but using a different technique from above which will probably use a single core for efficiency's sake. More generally, this design is flexible enough that it could diverge from rr in other ways. For example, it would be really slick if the existing debugger in the browser could be activated during replay without perturbing behaviors which the debugger can observe. This should greatly simplify implementations of a rewind debugger and/or omniscient debugger (build a database of behaviors like DOM/JS object modifications and executed scripts for querying) based on the replayer, but would mean the recording and replay executions could have different pointer values and code paths taken in some places. For now though, this bug is about building a prototype to test how well these ideas work for a basic record/replay system.

Brian Hackett [Laid off!]

Assignee

Comment 1

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Initial WIP, attaching for posterity. This records NSPR thread/lock creation order and lock acquisition order into files, along with some system calls information for each thread. The record/replay architecture is all inside NSPR, and it seems best for now to keep it there and use NSPR for threads/locks everywhere in the browser (this patch already fixes up thread creation in the chromium message passing stuff to use NSPR) though I don't know how long that strategy will last. This patch intercepts system calls by adding new functions like PR_RecordReplay_kevent and replacing direct system calls with calls to the wrapper. I'm going to rework this strategy; on OS X system calls seem to all go through a userspace library (libsystem_kernel.dylib) whose code can be modified to redirect calls to wrapper functions. This should let system calls within complex libraries be intercepted without modifying the libraries, and moves the design closer to rr. I don't know if this will work on all platforms (or even if it will actually work on OS X) but it should simplify the prototype.

Assignee: nobody → bhackett1024

Brian Hackett [Laid off!]

Assignee

Comment 2

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Updated WIP that modifies code in libsystem_kernel.dylib and creates trampolines so that system calls through this library can be redirected to a stub that performs the intended call and records information about the result. Right now recording seems to be working fine for all the system calls I can find running the content process on a basic website (nytimes.com) using dtruss. The code modification is pretty hacky (pattern matching disassembly of the code patterns seen in my computer's libsystem_kernel.dylib) but the general approach looks to be basically the same as other hooking tools (e.g. Detours from MSR, http://research.microsoft.com/pubs/68568/huntusenixnt99.pdf). The next step here is to record atomic variable accesses.

Attachment #8666530 - Attachment is obsolete: true

(Away)

Comment 3

•

10 years ago

We have a similar hooking system in xpcom/build/nsWindowsDllInterceptor.h; perhaps there may be an opportunity for code re-use.

Brian Hackett [Laid off!]

Assignee

Comment 4

•

10 years ago

(In reply to David Major [:dmajor] from comment #3) > We have a similar hooking system in xpcom/build/nsWindowsDllInterceptor.h; > perhaps there may be an opportunity for code re-use. Cool. Yes, CreateTrampoline() in that file is almost shockingly similar to Redirect() in this patch. To reuse this we'll need to expand things to work on other operating systems and architectures but that doesn't seem too hard.

Brian Hackett [Laid off!]

Assignee

Comment 5

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

WIP with handling for atomics. This adds a template argument to mozilla::Atomic to control how much of their behavior is preserved during replay (similar to how the existing MemoryOrdering argument controls the reordering optimizations the compiler is allowed to do wrt the atomic). For now a couple atomics in the JS engine used for interrupts are not recorded at all --- these are accessed all the time, including from jitcode, and recording and replaying executions which are interrupted by the slow script dialog isn't really something that can practically be done (there are a couple other events that will prevent recording I think, like throwing overrecursion errors, but nothing major). All other atomics record enough information to preserve the exact order of their accesses, but there is flexibility to relax that as needed.

Attachment #8668518 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 6

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Updated WIP that fixes a lot of bugs in the record/replay infrastructure. Replaying still balks early during startup; the main issue I'm working on now is how to deal with Grand Central Dispatch (GCD), an OS X API for task parallelism that is built using pthreads and some kernel syscalls that are undocumented and probably shift around between OS versions. Since GCD has its own thread pool, even if we can record its behavior it will be very hard to rewind process state like we will eventually want to do, so I think the best thing here is to intercept calls to the public interface of GCD (which is hopefully fairly stable) and emulate its behavior using our own thread pool and synchronization. I don't know if GCD is used heavily anywhere in browser code; everything I've seen has been deep inside calls to functions in other system libraries.

Attachment #8669236 - Attachment is obsolete: true

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

This patch fixes a fair number of issues with GCD replay. Attaching this now because I'm going to take a step back and try a new approach. Up to now these patches have been trying to instrument things at as low a level as possible, first at the system call level and then at the GCD level when that library posed problems for replay. The main problem I'm running into now is the same as when everything was at the system level. It's very easy for the replay to go off the rails and start behaving differently, either in a way we can detect (different order of system calls) or just by crashing in the middle of some library. Most of these libraries are closed source and debugging these problems is extremely difficult. More problematic though is that some of these libraries shouldn't even be expected to behave the same during replay. If a library's behavior is affected by thread interleavings --- it passes information between threads, lazily initializes things, etc. --- then the calls it makes won't be consistent between executions. And without modifying the source for such libraries this problem isn't really solvable. Hooking functions like pthread_mutex_lock would help, but a lot of synchronization is done using inlined atomic operations which can't be hooked. So I think a better interface to target might be the one between code compiled into the browser and external libraries the browser is linked against. Libraries which are well behaved and operate in a thread local fashion can still be recorded/replayed at the system call level, but more complex multithreaded libraries can be isolated during recording and then have their observable effects replayed later.

Attachment #8681976 - Attachment is obsolete: true

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 13

•

10 years ago

Brian, can you describe the goal here a little better? If it's to make it easier to gecko debuggers to debug gecko, then I can understand the design. But I don't see how this could be used to create a JS record-and-reply debugger. It seems too low level. If the user wanted set a new breakpoint, it seems like it would perturb our state enough that the trace would no longer be valid. Comment 0 addresses that, but I don't quite understand what the conclusion was.

Brian Hackett [Laid off!]

Assignee

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

This patch intercepts enough system library calls so that on a simple webpage we never enter GCD without passing through a function or logic whose behavior is recorded (and will not execute at all during replay). Generally this is done by hooking the library functions, though objective C messages are handled via manual instrumentation (I don't know how or whether these message invocations can be hooked). I really like how this approach is going; everything that is hooked is called directly from browser code, these functions are all documented public APIs (GCD has a tricky issue where other system libraries enter it using undocumented private APIs), and this strategy should carry over well to Windows and other OSes.

Attachment #8682744 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 18

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

This patch fixes a lot of replay issues. Replay gets much further than in the past, though still not far enough to completely replay a simple webpage load. The new approach is still going very well, with replay issues much easier to track down and fix (since all the involved code is compiled with the browser rather than linked in).

Attachment #8683933 - Attachment is obsolete: true

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 19

•

10 years ago

Thanks Brian, this is really interesting. > Hooking functions like pthread_mutex_lock would help, but a lot of synchronization is done using inlined atomic operations > which can't be hooked. OK, that's good to know. Why do we need to wrap void library functions like, e.g., CTFontManagerSetAutoActivationSetting? Why do we need CopyInstruction? We might be able to get that code from an existing library. I'm pretty sure we could extract the relevant parts of DynamioRio for example. It's not clear to me what state is now being preserved between recording and replay. We'll need a precise definition of that.

Brian Hackett [Laid off!]

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Unless I'm totally misunderstanding things, the content process paints data by rendering layers and then sending updates to the compositor thread in the parent process via IPDL. There are at least a couple different ways to relay updates which the replayed process tries to perform to the actual parent process, but I think the most robust is for the replayed process to be able to communicate with the parent process via IPDL in a manner independent from the original recording. This should work well with future IPC we need to do, like having the parent process control the replay via rewinding, etc., and having the parent process communicate with debugger actors. The attached patch updates things so that during replay we create two new threads: one thread has a chromium I/O message loop which communicates with the actual parent process, and the second thread has a second message loop to process messages coming in from the parent process. These threads operate independently from the threads serving this role in the original recording (IOThreadChild and the main thread), which during replay act in lockstep with how they behaved in the recording and don't do any actual IPC. This patch is far enough along that we create an alternative to ContentChild, ReplayContentChild, which looks like it's communicating with the parent process, though as soon as the parent sends ReplayContentChild a message it forces a crash.

Attachment #8688745 - Attachment is obsolete: true

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 24

•

10 years ago

Separate IPDL threads sound good. However, it might be better to use completely different protocols instead of PContent etc since we have quite different needs.

Brian Hackett [Laid off!]

Assignee

Comment 25

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

This patch models enough of the layout IPDL in the replayed process that during replay we are able to render the graphics for a simple "Hello World!" webpage. I tried using a separate protocol from PContent, but ran into difficulties because it seemed like I would need to modify quite a lot of parent side code to tolerate interactions via multiple protocols (maybe there's a good place to refactor things, I don't know the parent side code at all really). This patch has skeletons for protocols like PContent and PBrowser that ignore most incoming messages, and allocates its own actors for layout protocols like PLayer and PTexture that are 1:1 with actors in the replayed content but interact with the actual parent process. During replay we watch for update messages from the replayed content and reflect those updates into ones for these alternate actors and send them to the parent.

Attachment #8691590 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

•

10 years ago

Attached patch Part 1: Link NSPR in more places (obsolete) — Details — Splinter Review

The record/replay infrastructure is in NSPR, so libnspr needs to be linked into more places in the build.

Brian Hackett [Laid off!]

Assignee

Comment 30

•

10 years ago

Attached patch Part 2: Record/replay infrastructure (obsolete) — Details — Splinter Review

This has the recording/replaying infrastructure for thread/lock creation, lock ordering, atomic access ordering, thread events, function hooking, and alternate implementations of all the hooked functions. Sometime later this will be split up more.

Brian Hackett [Laid off!]

Assignee

Comment 31

•

10 years ago

Attached patch Part 3: Atomics interface changes (obsolete) — Details — Splinter Review

This modifies mfbt/Atomics.h the NSPR atomic interfaces to automatically record and replay atomic accesses. In the former case this is done at a user-specifiable precision, via template parameters.

Brian Hackett [Laid off!]

Assignee

Comment 32

•

10 years ago

Attached patch Part 4: NSPR locking/threading changes. (obsolete) — Details — Splinter Review

This patch has the remainder of the changes to NSPR, to keep track of ids on threads and locks and ensure that creation and uses of these are recorded and correctly replayed.

Brian Hackett [Laid off!]

Assignee

Comment 33

•

10 years ago

Attached patch Part 5: Use NSPR primitives in chromium code (obsolete) — Details — Splinter Review

The chromium IPC code currently uses pthreads locking and threading primitives, which are not included in the recording. This patch changes things so that NSPR primitives are used instead.

Brian Hackett [Laid off!]

Assignee

•

10 years ago

Attachment #8696745 - Attachment is patch: true

Brian Hackett [Laid off!]

Assignee

Comment 38

•

10 years ago

Attached patch Part 10: Other child side instrumentation (obsolete) — Details — Splinter Review

This patch has the remaining manual child side instrumentation needed to get replay to work.

Brian Hackett [Laid off!]

Assignee

Comment 39

•

10 years ago

Attached patch Part 11: Replayed process IPC (obsolete) — Details — Splinter Review

This patch has the changes needed to allow the replayed process to communicate with the actual parent process via IPDL messages and shared memory. It also has the implementation for reflecting layers/compositing messages from the child into messages which can be sent to the parent process; this stuff will be split off into another patch some time later.

Brian Hackett [Laid off!]

Assignee

Comment 40

•

10 years ago

Attached patch Part 12: Graphics changes for layers/compositing IPC (obsolete) — Details — Splinter Review

This patch includes the graphics code side of the layers/compositing message reflection. When IPDL actors and shared memory regions are created, or update messages are sent, the replay IPC code is notified so that equivalent structures or messages can be created/sent to the actual parent process.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 41

•

10 years ago

Thanks Brian! We discovered people already working on a similar project. See bug 1231802.

Brian Hackett [Laid off!]

Assignee

•

10 years ago

I've started looking at the devtools JS debugger stuff and how to attach breakpoints in the replayed process. I think the design here will need a little reworking. I want to be able to use the C++ JS debugger (js/src/vm/Debugger.* and some associated bits) in the replayed process, but devtools runs quite a bit more javascript code in the client process, whose results are collated and sent to the parent process in the form of (I think) JSON objects. I'd like to leave this JS code alone as much as possible, but (a) allowing the replayed process to execute JS code that didn't run in the recording could really mess up the replay, and (b) rewinding the process could really confuse the server without some substantial modifications. It would be better I think if the devtools JS ran in a separate middleman process from the replayed process. This would allow the devtools JS to do whatever it wants without fear of damaging the replayed process state, and all its structures and state would be preserved when the replayed process rewinds or fast forwards itself. The design would be: - The parent process communicates with the middleman process via the normal PContent/PBrowser/etc. IPDL protocols. - The middleman process communicates with the replayed process via special replay protocol(s) that are tailored to the operations the replayed process can perform. The parent is largely unchanged from its current behavior. In addition to running the devtools JS, the middleman process manages changes in state when the replayed process rewinds/fastforwards so that the parent process and devtools JS have a coherent representation of that state. For example, when playing forward the middleman will keep track of compositor updates so that if the process rewinds later it can reconstruct that earlier state by sending messages to the parent. WRT the devtools JS, whenever the replayed process takes a snapshot the C++ JS debugger should not be active, but the destruction and reconstruction of these C++ constructs should be transparent to the devtools JS when playing forward through a snapshot or rewinding/fastforwarding. The middleman process should be able to manage this transparency. Both this and the compositor stuff could in principle be done in the replayed process, but it's much harder to do there due to the nature of the process rewinding --- all state that should be preserved across rewinding needs to be stored either on disk or in specially allocated memory regions. The middleman process doesn't really need to be a separate process from the parent; the parent could instead start a message loop thread and I/O thread which do all the work which the middleman does. I think it will be easier to implement as a separate process, though, and because IPDL can be used for both inter-process and inter-thread communication, changing the middleman to run in the parent down the road (if desired) should be pretty painless.

Brian Hackett [Laid off!]

Assignee

Comment 47

•

10 years ago

Attached patch rewind WIP (obsolete) — Details — Splinter Review

This patch does the same stuff as the last one, but does so using a middleman process as described in comment 46. Overall I think this is a definite improvement, and will provide a lot more flexibility going forward. There is more boilerplate and overhead to marshal compositor updates from the replayed process into the middleman and then into the parent process, but the middleman is just a lightly modified Gecko content process --- it defers to the replayed process for compositor updates, but can do anything else a normal content process can do without worrying about how the process replay will be affected. I forgot to say earlier, but using the middleman also gets us to the separate protocols idea from comment 24. Instead of the replayed process having a skeleton PContentChild/PBrowserChild/etc. implementation, the middleman process has standard versions of these, and the middleman and replayed processes communicate with a specialized PReplay protocol.

Attachment #8701073 - Attachment is obsolete: true

Nick Fitzgerald [:fitzgen] [⏰PST; UTC-8]

Comment 48

•

10 years ago

(In reply to Brian Hackett (:bhackett) from comment #46) > I've started looking at the devtools JS debugger stuff and how to attach > breakpoints in the replayed process. I think the design here will need a > little reworking. I want to be able to use the C++ JS debugger > (js/src/vm/Debugger.* and some associated bits) in the replayed process, Is the goal to use all (most? the ones that make sense? read only?) of the devtools during replay? Because while a lot of functionality is built on the Debugger API, just as much is built on a random assortment of other XPCOM APIs and web APIs themselves (such as the "inspector" that lets one view and edit the dom tree and the styles applied to each node directly). Point being that just re-implementing the Debugger API is not enough to get most of the devtools. Even just the JS debugger itself may need more. > but devtools runs quite a bit more javascript code in the client process, > whose results are collated and sent to the parent process in the form of (I > think) JSON objects. Yes, the devtools run a _ton_ of JS. Almost all of the devtools remote debugger protocol server (which lives in the debuggee content process) is JS. Yes, the remote debugger protocol uses JSON for the most part, but there is also the "bulk data" packets used for arbitrary binary data which are implemented on top of streams directly. For more info on the protocol itself: https://wiki.mozilla.org/Remote_Debugging_Protocol Request and response messages are sent over a "transport". We have a `ChildDebuggerTransport` for debugging the child process from the parent process, which seems most relevant here. See https://dxr.mozilla.org/mozilla-central/source/devtools/shared/transport/transport.js#699-712 If there are any devtools or Debugger API related questions I can answer, please ask away :)

Brian Hackett [Laid off!]

Assignee

Comment 49

•

10 years ago

(In reply to Nick Fitzgerald [:fitzgen] [⏰PST; UTC-8] from comment #48) > (In reply to Brian Hackett (:bhackett) from comment #46) > > I've started looking at the devtools JS debugger stuff and how to attach > > breakpoints in the replayed process. I think the design here will need a > > little reworking. I want to be able to use the C++ JS debugger > > (js/src/vm/Debugger.* and some associated bits) in the replayed process, > > Is the goal to use all (most? the ones that make sense? read only?) of the > devtools during replay? Because while a lot of functionality is built on the > Debugger API, just as much is built on a random assortment of other XPCOM > APIs > and web APIs themselves (such as the "inspector" that lets one view and edit > the > dom tree and the styles applied to each node directly). Point being that just > re-implementing the Debugger API is not enough to get most of the devtools. > Even > just the JS debugger itself may need more. Right now the goal is to just use the JS debugger during replay. Eventually, it would be nice to use all the devtools that make sense for a replay debugger (Everything except performance/network? In principle even these two could make sense for the replay debugger but the overhead of recording could perturb them a lot). Everything would be read only to the devtools, since changing state in an observable way can cause the replay to diverge from the recording. Accesses to XPCOM objects in the replayed process can still be supported; instead of having XPConnect objects which call into XPCOM directly, the devtools code will have proxies whose accesses are resolved by sending synchronous IPDL messages to the replayed process. The patch I'm working on now does this for things like the doc shell and DOM window, and the result is fairly transparent to the devtools code (I have been modifying the devtools code to handle foobar.QueryInterface(), which could maybe be done transparently but would be pretty gnarly under the hood). I don't know yet whether this will be enough to support the DOM inspector. The main question is whether the information the inspector is getting --- the boundary/padding for an element, the element which is found by a right click -> Inspect Element, etc. --- can be determined solely from the replayed process' document tree and layout state, without making calls into the system graphics libraries. > If there are any devtools or Debugger API related questions I can answer, > please > ask away :) Sure, thanks!

Brian Hackett [Laid off!]

Assignee

Comment 50

•

10 years ago

Attached patch debugger WIP (obsolete) — Details — Splinter Review

This patch adds some integration with the JS debugger. It does enough that one can run the replayed process, and when it finishes open the debugger UI, see the sources in the replayed process, set a breakpoint on a line, click on a new 'rewind' UI button, have the replayed process rewind to the last point where the breakpoint was hit, and finally have the middleman process notify the chrome process about the pause so it can update the UI appropriately. Not much else works --- when paused we don't yet provide properties on the environment or visible objects to inspect --- but I feel this suffices as a proof of concept, that we can attach the debugger and have it inspect state in the replayed process without perturbing the replay. The prototype is, for lack of a better word, done. Things I'd like to do next: - Post a design document about the project which describes its organization, technical features, and the invariants it is maintaining. Is MDN the best place for this? - Post a cleaned up, rolled up patch and a new patch series breaking it down. - Start recording/replaying more complicated pages. The robustness of the various oarts of this project is I feel the biggest unknown at this point.

Boris Zbarsky [:bzbarsky]

Comment 51

•

10 years ago

The best place for a design doc is probably wiki.mozilla.org.

Nicholas Nethercote [inactive]

Comment 52

•

10 years ago

(In reply to Boris Zbarsky [:bz] (Vacation until Jan 4) from comment #51) > The best place for a design doc is probably wiki.mozilla.org. My understanding is that MDN is best for code-related stuff, and wiki.m.o is for stuff relating to people, teams and planning. E.g. from https://wiki.mozilla.org/MozillaWiki:About: > To summarize: MDN documents how to interact with code. The Mozilla wiki documents how to interact > with teams. SUMO documents how to interact with Mozilla products (as an end-user).

Boris Zbarsky [:bzbarsky]

Comment 53

•

10 years ago

Huh. It used to be that MDN was for documentation for how people outside the Mozilla project should interact with our code (e.g. the web developer documentation) and things like https://wiki.mozilla.org/Gecko:Overview lived... well, where it's living. I wonder when that was supposed to have changed.

Brian Hackett [Laid off!]

Assignee

Comment 54

•

10 years ago

Thanks, I put up the design document at https://developer.mozilla.org/en-US/docs/WebReplay

URL: https://developer.mozilla.org/en-US/d...

Brian Hackett [Laid off!]

Assignee

Comment 55

•

10 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Rolled up patch, rebased to tip and cleaned up in some places.

Attachment #8695312 - Attachment is obsolete: true

Attachment #8696702 - Attachment is obsolete: true

Attachment #8696730 - Attachment is obsolete: true

Attachment #8696731 - Attachment is obsolete: true

Attachment #8696733 - Attachment is obsolete: true

Attachment #8696735 - Attachment is obsolete: true

Attachment #8696738 - Attachment is obsolete: true

Attachment #8696740 - Attachment is obsolete: true

Attachment #8696742 - Attachment is obsolete: true

Attachment #8696743 - Attachment is obsolete: true

Attachment #8696745 - Attachment is obsolete: true

Attachment #8696746 - Attachment is obsolete: true

•

10 years ago

Attached patch Part 16: Client (chrome) side devtools changes (obsolete) — Details — Splinter Review

This patch adds a rewind button to the debugger client's UI, which when pressed sends a message to the debugger server living in the middleman process.

Brian Hackett [Laid off!]

Assignee

Comment 72

•

10 years ago

Attached patch Part 17: Server (content) side devtools changes (obsolete) — Details — Splinter Review

On the server side of the devtools code, this patch has changes to receive the rewind button event and pass it onto the C++ JS debugger. It also has some other changes needed for replaying, mainly for interacting with the proxies created for the docshell/window/etc. in the replaying process, which don't (yet?) support generic QI calls. This is the last patch in this series.

Robert O'Callahan (:roc) (email my personal email if necessary)

•

10 years ago

Attached patch robustness WIP (obsolete) — Details — Splinter Review

With this patch, the main page for Wikipedia can be recorded and replayed. As noted above, while this is a pretty static page it uses a lot more browser features than the basic test page did. The changes in this patch are mostly internal reorganization/improvements in the record/replay code. Other changes include some new APIs being hooked, messages being intercepted, and several hash tables that need to behave deterministically (the handling for these hash tables is pretty low level currently, I want to fold these into the hashtable classes themselves at some point).

Brian Hackett [Laid off!]

Assignee

Comment 77

•

10 years ago

Attached patch graphics WIP (obsolete) — Details — Splinter Review

This patch is still pretty preliminary but illustrates what I'm hoping to with graphics data. As I understand it the tile based renderer creates a bunch of CGBitmapContexts which render graphics updates to shared memory buffers managed by IPDL textures. Right now we handle these during record/replay by keeping track of API calls which will change the rendered data, and then at points where the data is needed (e.g. before sending a graphics update to the parent) recording the contents of the graphics data buffers so they can be restored later. This is simple and it works but it produces an unacceptable amount of data. When loading Wikipedia the size of the recording is 176MB, 88% of which is graphics data. If I load Wikipedia, scroll to the bottom and then back to the top, the size of the recording is 477MB, 95% of it graphics data. This patch leverages the hooking of the CoreGraphics and other Core APIs to avoid including any of this graphics data in the recording. When the process is replaying, when these calls are performed they are recorded in a separate binary stream, and when a graphics update is sent to the middleman that binary stream is also sent to the middleman, so that it can perform the graphics calls itself and render graphics to the memory buffers it shares with the replaying process. It's possible that the middleman could render the graphics data directly to the shared memory buffers it shares with the chrome process, eliminating the need for sharing graphics buffers with the replaying process, but this would require that the replay work without ever accessing the graphics data and that is not currently the case. Currently this works with a simple 'Hello World!' page. Handling more complicated pages should only require improving this binary stream to work with other APIs that are already being hooked.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 78

•

10 years ago

> > 2) How much work is it to extend to more browser features? > > I'll post another patch in a bit, but it took a few days to get from > recording/replaying the text only test page to recording/replaying the > wikipedia home page, which does new things like loading images, using CSS, > having a complex page layout, etc. The main feature I'm concerned about is > WebGL, and I hope we can handle it by just hooking a bunch of new API > functions and doing something similar to what we're doing with CoreGraphics. I'm also worried about the media stack, and third-party code like video codecs and the WebRTC stack that use system APIs and custom threading. (In reply to Brian Hackett (:bhackett) from comment #76) > With this patch, the main page for Wikipedia can be recorded and replayed. > As noted above, while this is a pretty static page it uses a lot more > browser features than the basic test page did. The changes in this patch > are mostly internal reorganization/improvements in the record/replay code. > Other changes include some new APIs being hooked, messages being > intercepted, and several hash tables that need to behave deterministically > (the handling for these hash tables is pretty low level currently, I want to > fold these into the hashtable classes themselves at some point). Yeah, iterating nsPtrHashKey tables was obviously going to be an issue. It might be a good idea to just eliminate that, independently of record/replay. Your graphics recording approach might be able to use Moz2D's CreateRecordingDrawTarget, which in principle does what you need.

Brian Hackett [Laid off!]

•

10 years ago

Attached patch record font names (obsolete) — Details — Splinter Review

This patch records full font names when available from the CGFontRef instead of the font's tables, bringing the total size of the recording on Wikipedia down to around 22 MB (which is where it was with the earlier CoreGraphics based patch).

Brian Hackett [Laid off!]

Assignee

Comment 83

•

10 years ago

Attached patch robustness WIP #2 (obsolete) — Details — Splinter Review

This patch goes on top of the previous ones and includes various fixes to improve robustness. With this patch I can record and replay a simple GMail session --- from a fresh profile, the login page is loaded, I log in, and the inbox loads and displays. The total recording size is about 40 MB, which seems reasonable and should improve if we start doing in memory recording compression. Everything about the page looks right, except for animations that stay on their initial frame (e.g. the avatar that shows up on the login screen). I don't know what to do about these yet, especially when they are based on absolute time stamps and when the debugger is active on these pages. Anyways, the main new issue this patch exposed is what to do about places where non-deterministic parts of the browser interact with deterministic parts. In particular, there are several nsISupports classes --- nsGlobalWindow, HTMLImageElement, and so forth --- that can be destroyed during GC and trigger recorded events in their destructor, and cross compartment wrappers that are destroyed during GC sweeping can trigger recorded events when they are regenerated later. There are a few ways these could be handled: A) Move the recording boundary so that recorded events are never triggered by these operations. This is conceptually the cleanest, and is just an extension of what we already do with e.g. atomic refcounts and other quantities that change during GC. It will probably require the most work, though, as some of these destructors do things like posting events to the main thread's event loop and sending IPDL messages, which will need new infrastructure to separate out the deterministic and non-deterministic portions of these other systems. B) Ensure that destruction of the data in these classes are performed consistently, using an event on the main thread that fires every second or so. During recording, objects which can trigger events have their destruction delayed until this event next fires. During replay, keep the corresponding objects alive until the same event firing, and then either destroy them or clean up their internal data, depending on whether they still have other references. I tried to do this approach, but don't like it that much. C) Force these objects to behave deterministically, by simply never destroying them. This is of course the simplest approach and is the one this patch takes. It might be viable in the long run, since recordings are expected to be relatively short lived, but it still doesn't seem ideal. Probably the best strategy is a mix of A) and C). Start by just not destroying such objects in a non-deterministic fashion, and then find some minimally invasive ways to improve Gecko so that the destruction of these objects does not interfere with the recording and can happen as it normally would.

Brian Hackett [Laid off!]

Assignee

Comment 84

•

10 years ago

Attached patch Windows WIP (obsolete) — Details — Splinter Review

This patch makes enough changes so that the build works on Windows. It doesn't do anything else, but since this patch is so large (mostly from code reorganization and renaming variable types to appease cl) it seems better to split off from changes with actual substance.

Brian Hackett [Laid off!]

Assignee

Comment 85

•

10 years ago

Attached patch Windows WIP #2 (obsolete) — Details — Splinter Review

This patch goes on top of the previous one and improves the Windows build so that all thread/locking activity is recorded and so that the hooking infrastructure works. Right now this just hooks a few functions, the next step is to expand this to a more complete set.

Jeff Muizelaar [:jrmuizel]

Comment 86

Attachment #8705217 - Attachment is obsolete: true

Attachment #8705219 - Attachment is obsolete: true

Attachment #8705223 - Attachment is obsolete: true

Attachment #8705225 - Attachment is obsolete: true

Attachment #8705228 - Attachment is obsolete: true

Attachment #8705230 - Attachment is obsolete: true

Attachment #8705231 - Attachment is obsolete: true

Attachment #8705236 - Attachment is obsolete: true

Attachment #8705237 - Attachment is obsolete: true

•

9 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Rolled up patch updated with the last couple weeks of work. This mainly overhauls how breakpoints are handled when debugging a replaying process, and gets forward and reverse stepping to work. For now, stepping is handled by setting breakpoints at specific spots instead of setting a handler for a specific frame. This should work fine except when a script is invoked recursively. Eventually the replaying process should be able to keep track of frame identity even after rewinding, which would allow setting handlers on the frames themselves and fixing this.

Attachment #8737826 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 114

•

9 years ago

Attached patch WIP #2 (obsolete) — Details — Splinter Review

Diff on top of the previous patch with the last couple weeks of work. This has various bug fixes and robustness and efficiency improvements. The most important changes are: - PLDHashTables (along with the various classes that are based on them) all have a deterministic iteration order when recording or replaying. This allows removal of all other hashtable related instrumentation in the browser, except for one spot that uses std::set (whose code we can't modify). - Rendered graphics data can be sent both ways between the replaying process and the middleman. This allows the replaying process to access the rendered graphics when needed, and also lets the middleman leverage the snapshot mechanism to restore earlier graphics data when updating compositor state after rewinding. I've started working through recording and replaying the web-platform tests, one hundred or so at a time. Currently we can record and replay all the 2dcontext tests (about 800 tests, or 10% of the suite). Three of them give different pass/fail results from a normal execution, as we currently use the CG draw target when recording/replaying, instead of Skia (though since the backend in use is hidden behind the DrawTargetRecording and never renders anything in the replaying process, fixing this shouldn't be too hard). From here I want to go through the rest of the web-platform tests, both to fix bugs and to see what browser features we can easily support while recording/replaying. With enough work it shouldn't be terribly hard to get all features tested by web-platform to record/replay, but I want to shift away from building out the prototype to handle new features (like webgl) and towards building a, uh, minimum viable product, something that people can start using and testing. Part of that is figuring out which features we do or don't want to support, and cleanly failing to record when dealing with an unsupported feature instead of just crashing somewhere.

Brian Hackett [Laid off!]

Assignee

Comment 115

•

9 years ago

Attached patch WIP #2 (obsolete) — Details — Splinter Review

With this patch, all web-platform tests run by mach can be recorded and replayed, with the following exceptions/caveats: - The custom threading used when playing audio/video is not handled yet, so this patch prevents these from being activated when recording. - DOM workers are similarly not allowed to be created. While recording/replaying executions with workers shouldn't be hard to do (except when SharedArrayBuffers are in use), right now the replay debugger only supports JS which runs on the main thread, and if we record/replay executions with workers then we should be able to debug the workers. A couple other technologies aren't tested by web-platform but also need to be disallowed: - WebGL uses a different graphics pipeline from normal rendering. - asm.js creates signal handlers which interfere with the handler used while replaying. Again, per comment 114 it should be possible to get all of these to work, but in the interest of focusing on improving the quality and robustness of record/replay on 'normal' web pages and on porting this stuff to other platforms (i.e. Windows) these restrictions are in place for now.

Attachment #8747462 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 116

•

9 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Rolled up patch with all changes as of a few weeks ago. I'll attach a couple patches with new stuff shortly.

Attachment #8739966 - Attachment is obsolete: true

Attachment #8750383 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 117

•

9 years ago

Attached patch Mac WIP (obsolete) — Details — Splinter Review

Some fit-and-finish changes to the Mac port, going on top of the previous patch. This has some bug fixes and internal improvements, but the main change is to allow triggering record/replay in a new tab from the browser's UI (Tools -> Web Developer -> Record/Replay Execution}, instead of using environment variables at the command line.

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attached patch Windows WIP (obsolete) — Details — Splinter Review

Updated Windows WIP. This mostly adds a lot of redirections, including some nice clean handling of COM object interfaces, and is I think getting close to being able to replay simple web pages.

Attachment #8757439 - Attachment is obsolete: true

Jens

Updated

•

9 years ago

Updated

•

9 years ago

Blocks: 1282039

Jens

Updated

•

9 years ago

Comment 122

•

9 years ago

Attached patch WIP (obsolete) — Details — Splinter Review

Rolled up and combined patch (for both mac and windows), incorporating the last couple of weeks of work. Windows replaying continues to slowly progress, and this also has some nice fixes for Mac like removing all the manual instrumentation of objective C messages (the core objc_msgSend routine is hooked instead) and a general cleanup and C++ification of the public API.

•

9 years ago

Attached patch Part 8a - Manually ensure hash table iteration ordering consistency in mMutants std::set. (obsolete) — Details — Splinter Review

The mMutants set in ShadowLayerForwarder is an std::set whose iteration order may differ between recording and replay. This patch manually instruments uses of this member to ensure consistent behavior, though it's pretty ugly. Alternatives would be to just use a PLDHashTable (which always gives consistent behavior, see later patches) or making a wrapper class like mozilla::set that wraps an std::set and behaves consistently while recording/replaying.

Josh Matthews [:jdm]

Comment 169

•

9 years ago

Comment on attachment 8790771 [details] [diff] [review] Part 5a - Disable incremental GC when recording or replaying. Could the parts of this patch that don't involve record/replay be extracted into another patch in a new bug? This sounds really important for Servo, where we currently have incremental GC disabled.

Brian Hackett [Laid off!]

Assignee

Comment 170

•

9 years ago

Attached patch Part 8b - Manually record/replay mach_msg IPC calls. — Details — Splinter Review

mach_msg is used for both inter-thread and inter-process communication. This patch manually records/replays some calls to this method that are used for IPC, though it might be better to redirect mach_msg itself and intercept all messages that are sent/received between threads as well.

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Depends on: 1302523

Brian Hackett [Laid off!]

Assignee

Comment 204

•

9 years ago

•

9 years ago

Attachment #8790756 - Flags: review?(nfroyd)

Brian Hackett [Laid off!]

Assignee

9 years ago

Attachment #8790823 - Flags: review?(dvander)

Kannan Vijayan [:djvj]

Comment 205

•

9 years ago

Heads up Brian.. Vladan is no longer at mozilla. You may want to find another reviewer for that patch.

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790825 - Flags: review?(luke)

Brian Hackett [Laid off!]

Assignee

Comment 206

•

•

9 years ago

Attachment #8790840 - Flags: review?(nfroyd)

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Comment on attachment 8790843 [details] [diff] [review] Part 9a - PReplay protocol and parent side implementation. This patch has a mix of IPC (communication with the replaying process), compositor updates (take messages from the replaying process and forward them to the chrome process' PCompositorBridgeParent), and draw event replay (read draw events from the replaying process and translate them in the middleman).

Attachment #8790843 - Flags: review?(wmccloskey)

Attachment #8790843 - Flags: review?(nical.bugzilla)

Attachment #8790843 - Flags: review?(bas)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790847 - Flags: review?(nical.bugzilla)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790848 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790851 - Flags: review?(wmccloskey)

Nicholas Nethercote [inactive]

Updated

•

9 years ago

Attachment #8790755 - Flags: review?(n.nethercote) → review+

Brian Hackett [Laid off!]

Assignee

Comment 210

•

9 years ago

Comment on attachment 8790854 [details] [diff] [review] Part 10a - Child side implementation of PReplay protocol. This patch has the IPC/compositor/graphics logic in the replaying process corresponding to part 9a.

Attachment #8790854 - Flags: review?(wmccloskey)

Attachment #8790854 - Flags: review?(nical.bugzilla)

Attachment #8790854 - Flags: review?(bas)

Luke Wagner [:luke]

Comment 211

•

9 years ago

Comment on attachment 8790825 [details] [diff] [review] Part 6d - Disable wasm signal handlers when recording or replaying. Righto, makes sense.

Attachment #8790825 - Flags: review?(luke) → review+

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790856 - Flags: review?(n.nethercote)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790857 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attachment #8790898 - Flags: review?(jimb)

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attachment #8790734 - Flags: review?(wmccloskey)

Andrew McCreight [:mccr8]

Comment 212

•

9 years ago

Comment on attachment 8790774 [details] [diff] [review] Part 5c - Don't dispatch runnables for GC or finalization when under the GC and recording or replaying. Review of attachment 8790774 [details] [diff] [review]: ----------------------------------------------------------------- ::: dom/base/nsJSEnvironment.cpp @@ +1968,5 @@ > // static > void > nsJSContext::PokeShrinkingGC() > { > + if (sShrinkingGCTimer || sShuttingDown || PR_IsRecordingOrReplaying()) { Could you please add a static method named like SkipCollection which is just "return sShuttingDown || PR_IsRecordingOrReplaying();" and use that here and in the rest of the file? That would make it a little clearer what is the intent.

Attachment #8790774 - Flags: review?(continuation) → review+

Andrew McCreight [:mccr8]

Updated

•

9 years ago

Attachment #8790796 - Flags: review?(continuation) → review+

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 213

•

9 years ago

Comment on attachment 8790822 [details] [diff] [review] Part 6b - Disable DOM workers when recording or replaying. I wouldn't really put PR_IsRecordingOrReplaying() in the constructor, but just hide the interfaces from the global. So, in .webidl have something like Func="SomeMethodName". But if you want to start with this approach which breaks any pages using Workers, fine. Though I guess those pages can't be used with replay anyhow, so shouldn't matter much. PR_IsRecordingOrReplaying() isn't threadsafe, but I guess the idea is that PR_IsRecordingOrReplaying() returns always the same value during the process lifetime.

Attachment #8790822 - Flags: review?(bugs) → review+

Nicolas Silva [:nical]

Comment 214

•

9 years ago

Wow, this is one of the most exciting features ever, but the implications in terms of how easy it is to forget to instrument future code in the content process and break recording are very scary.

Nathan Froyd [:froydnj]

Updated

•

9 years ago

Attachment #8790788 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

Updated

•

9 years ago

Attachment #8790804 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

•

9 years ago

Attachment #8790767 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

Comment 218

•

9 years ago

Comment on attachment 8790748 [details] [diff] [review] Part 4c - Don't record activity on ThreadSafeAutoRefCnt or nsStringBuffer refcounts. Review of attachment 8790748 [details] [diff] [review]: ----------------------------------------------------------------- Why does not recording these avoid overhead when we're not recording/replaying? Are we actually calling into the record/replay machinery for every atomic access even when we're not recording/replaying? Or am I mis-parsing your explanation? I am curious whether subsequent patches address the GC finalization bits.

Attachment #8790748 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

Updated

•

9 years ago

Attachment #8790756 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

Updated

•

9 years ago

Attachment #8790768 - Flags: review?(nfroyd) → review+

Nathan Froyd [:froydnj]

Comment 219

•

9 years ago

Comment on attachment 8790769 [details] [diff] [review] Part 4m - Don't record some threading atomics. Review of attachment 8790769 [details] [diff] [review]: ----------------------------------------------------------------- Major brownie points for separating out this massive patch into small understandable pieces! Thank you!

Attachment #8790769 - Flags: review?(nfroyd) → review+

Eddy Bruel [:ejpbruel]

Comment 220

•

9 years ago

Comment on attachment 8790895 [details] [diff] [review] Part 15a - Client side devtools changes. For frontend changes to the debugger James is probably a better reviewer than me.

Attachment #8790895 - Flags: review?(ejpbruel) → review?(jlong)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790840 - Flags: review?(wtc)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790764 - Flags: review?(wmccloskey) → review?(wtc)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790736 - Flags: review?(wtc)

Brian Hackett [Laid off!]

Assignee

Comment 221

•

9 years ago

Comment on attachment 8790724 [details] [diff] [review] Part 1a - Record/replay/rewind infrastructure. This patch adds a new NSPR public header.

Attachment #8790724 - Flags: review?(wtc)

Jeff Muizelaar [:jrmuizel]

Comment 222

•

9 years ago

Comment on attachment 8790828 [details] [diff] [review] Part 8a - Manually ensure hash table iteration ordering consistency in mMutants std::set. Review of attachment 8790828 [details] [diff] [review]: ----------------------------------------------------------------- I would prefer using some kind of stable set here instead of having to manually instrument. Is there a plan for how we're going to avoid accidentally introducing non-determinism like this in?

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 223

•

9 years ago

(In reply to Nathan Froyd [:froydnj] from comment #215) > Comment on attachment 8790840 [details] [diff] [review] > Part 8f - Ensure that PL and PLD hashtables have consistent iteration order > when recording/replaying. > > Review of attachment 8790840 [details] [diff] [review]: > ----------------------------------------------------------------- > > I think this is OK, though I have concerns about how much overhead this adds > when not recording/replaying: history has told us that this code is > perf-sensitive (I can't pull up the specific Android bug I'm thinking about > right now--we had startup regressions and essentially worked around it by > eliminating an extra branch), and adding a (non-XUL!) global variable check > in these paths sounds expensive, at least. (I think that'd be 2 pointer > chases on ARM, 2 (?) on x86, but only 1 on x86-64.) Have you done any > performance testing of this? > Yeah, hashtable code is super perf sensitive and already too slow (which is why we in some cases have caches on top of it), so yes, please be careful with that.

Brian Hackett [Laid off!]

Assignee

•

9 years ago

(In reply to Jeff Muizelaar [:jrmuizel] from comment #222) > Comment on attachment 8790828 [details] [diff] [review] > Part 8a - Manually ensure hash table iteration ordering consistency in > mMutants std::set. > > Review of attachment 8790828 [details] [diff] [review]: > ----------------------------------------------------------------- > > I would prefer using some kind of stable set here instead of having to > manually instrument. Is there a plan for how we're going to avoid > accidentally introducing non-determinism like this in? The best way of avoiding this non-determinism is to use hash table classes that always behave consistently when recording/replaying. Right now that includes all tables based on PLDHashTable or PLHashTable, but it does not include std::set. Would it be OK to change mMutants to be a nsDataHashtable, or should I make a wrapper for std::set that behaves consistently?

Andrew McCreight [:mccr8]

Comment 229

•

•

9 years ago

•

9 years ago

@bhackett There was a frontend devtools patch that I r-'ed above. This bug already is huge, and I think it would be best if you landed this feature without any frontend changes. After all of the implementation lands, we can land the UI to expose it in a separate bug. That way this bug is more focused, and it'll be easier to help you land the frontend changes without the discussion about backend details. I say this also because we're developing the debugger outside of m-c now, so it'll be easier to coordinate those changes in a more focused bug. We also need to work with out designer to implement the right UX.

Jim Blandy :jimb

Comment 243

•

9 years ago

(In reply to James Long (:jlongster) from comment #242) As a procedural question, separating out the front-end patches into their own bug seems like a good idea. Regarding UI, we've already begun discussing possibilities with Helen, but we were waiting to get something in-tree on the server side to understand exactly how the new server behaves before developing the UI in detail. I think it would be extremely valuable for people (not least us!) to have some way to experiment with WebReplay before the fully-designed UI is ready. Could we land rough work behind a pref? What's the best way to make this happen?

Flags: needinfo?(jlong)

Jan de Mooij [:jandem]

•

9 years ago

I might be misunderstanding the big picture, here, but it looks like WebReplay records absolutely everything in the content process. All graphics operations are going in the DrawTarget recording, infra, all atomic reads and writes that graphics do to synchronize with the compositor are instrumented, etc. What surprises me is that graphics (canvas aside) is purely a result of layout. If you can replay layout, you should be able to replay graphics without recording it. Layout itself can probably be replayed from a limited set of inputs. Is it possible to reduce the amount of things that need to be recorded and replayed in order for this to work under this design? I am very worried that this amount of instrumentation may be fragile and may become a large maintenance burden on everything that lives in the content process. The bug title says "prototype". Are these patches intended to land or are they more intended as requests for feedback?

Flags: needinfo?(bhackett1024)

Brian Hackett [Laid off!]

Assignee

Comment 252

•

9 years ago

(In reply to James Long (:jlongster) from comment #242) > @bhackett There was a frontend devtools patch that I r-'ed above. This bug > already is huge, and I think it would be best if you landed this feature > without any frontend changes. After all of the implementation lands, we can > land the UI to expose it in a separate bug. That way this bug is more > focused, and it'll be easier to help you land the frontend changes without > the discussion about backend details. > > I say this also because we're developing the debugger outside of m-c now, so > it'll be easier to coordinate those changes in a more focused bug. We also > need to work with out designer to implement the right UX. Hi, my main concern is that without some changes to the debugger client, a lot of the features in this bug can't be tested at all. The automated test in part 17 needs new functionality from the client to detect when the replay has finished, and future automated tests that exercise rewinding etc. will need to use the debugger. I guess I don't see what's so bad about landing some changes to the existing deprecated client when it is the only in-tree way to access most of what this bug is doing. All user visible changes to the UI which this bug makes are gated on the devtools.recordreplay.enabled pref, which is, yeah, off by default (and for now only available at all on nightly mac builds).

Brian Hackett [Laid off!]

Assignee

Comment 253

•

9 years ago

(In reply to Nicolas Silva [:nical] from comment #251) > I might be misunderstanding the big picture, here, but it looks like > WebReplay records absolutely everything in the content process. All graphics > operations are going in the DrawTarget recording, infra, all atomic reads > and writes that graphics do to synchronize with the compositor are > instrumented, etc. Not quite everything in the content process is recorded/replayed exactly. Besides the allowed non-determinism discussed in the design document (see the URL field at the top of the bug) we generally replay all behavior except for what goes on under system/library calls which we have redirected. There is an exception for draw targets, though. While we are replaying we never create a native draw target (e.g. DrawTargetSkia) but rather create a DrawTargetRecording which wraps a DrawTargetRecordReplay (which wraps nothing). The recorded draw events are sent to the middleman process for rendering in a native draw target, which then sends the rendered data back to the replaying process in case any content needs it. > What surprises me is that graphics (canvas aside) is purely a result of > layout. If you can replay layout, you should be able to replay graphics > without recording it. Layout itself can probably be replayed from a limited > set of inputs. Yes, layout and the commands issued to draw targets are being replayed exactly. Most of the gfx changes are related to the special record/replay draw targets described above. > Is it possible to reduce the amount of things that need to be recorded and > replayed in order for this to work under this design? As far as graphics code goes, yes, before the patch in comment 81 we did not modify the draw target code at all and just redirected calls to native drawing functions (e.g. CGContextDrawImage). But since even then the replaying process does not reproduce what goes on inside those native drawing functions, we need some way of getting the rendered graphics data. Just including the graphics data in the recording consumes an enormous amount of space (per comment 77, up to 95% of the recording on wikipedia). We could redirect the native drawing functions and then replay them in the middleman process, which is what the patches in comments 77 and 80 do, but this approach is effectively duplicating what DrawTargetRecording does, except in a platform specific way, so later patches started using DrawTargetRecording. > I am very worried that this amount of instrumentation may be fragile and may > become a large maintenance burden on everything that lives in the content > process. Which instrumentation are you mainly concerned about? I've been trying to minimize the amount of instrumentation needed (FWIW it used to be a lot worse, especially for hash tables and objective C messages), but there are more steps I can take to reduce the changes to existing code. Like the draw target issue that motivated the redesign in comment 81, though, such steps might end up duplicating functionality within the code base and not really be good design improvements. > The bug title says "prototype". Are these patches intended to land or are > they more intended as requests for feedback? These patches are intended to land. I'll change the bug title.

Flags: needinfo?(bhackett1024)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Summary: Build a web replay prototype → Web replay initial landing

David Anderson [:dvander] - inactive, e-mail if emergency

Updated

•

9 years ago

Attachment #8790823 - Flags: review?(dvander) → review+

Georg Fritzsche [:gfritzsche]

Comment 254

•

9 years ago

Comment on attachment 8790818 [details] [diff] [review] Part 5n - Don't perform telemetry while recording or replaying. Review of attachment 8790818 [details] [diff] [review]: ----------------------------------------------------------------- ::: toolkit/components/telemetry/TelemetryHistogram.cpp @@ +1726,5 @@ > // it is used, and (2) because it is never de-initialised, and > // a normal Mutex would show up as a leak in BloatView. StaticMutex > // also has the "OffTheBooks" property, so it won't show as a leak > // in BloatView. > +static StaticMutexNotRecorded gTelemetryHistogramMutex; Do we need to do the same for TelemetryScalar.cpp? @@ +1762,5 @@ > MOZ_ASSERT(!gInitDone, "TelemetryHistogram::InitializeGlobalState " > "may only be called once"); > > + gCanRecordBase = canRecordBase && !PR_IsRecordingOrReplaying(); > + gCanRecordExtended = canRecordExtended && !PR_IsRecordingOrReplaying(); What problems specifically is this solving? (1) That we shouldn't collect Telemetry from the record/replay content processes? Or (2) does it solve specific implementation issues with the patch here? I assume we should do (1), as rr content processes seem to be a special case that would bias our measurements. We should do that one level higher in Telemetry.cpp instead of here though, to also cover TelemetryScalar etc.

Attachment #8790818 - Flags: review?(gfritzsche)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790724 - Flags: review?(wtc)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790736 - Flags: review?(wtc)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790764 - Flags: review?(wtc)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790840 - Flags: review?(wtc)

Jason Orendorff [:jorendorff]

•

9 years ago

(In reply to Jan de Mooij [:jandem] from comment #247) > Comment on attachment 8790726 [details] [diff] [review] > Part 1c - Redirections. > > Review of attachment 8790726 [details] [diff] [review]: > ----------------------------------------------------------------- > > Sorry, I'm not very comfortable signing off on this. I don't know much about > the low-level Darwin/Windows bits and non-SM code has its own style, > conventions, etc. Nathan, maybe you can take a look or know who's a good > reviewer? Sorry, I thought I replied to this yesterday, but I must have forgotten to hit submit or something. I can at least give things a good once-over, but I won't be able to give this the chunk of time it deserves until early next week.

Nicolas Silva [:nical]

Comment 259

•

9 years ago

(In reply to Brian Hackett (:bhackett) from comment #256) > (In reply to Nicolas Silva [:nical] from comment #255) > > Ok, so graphics is not actually recorded and replayed. The replaying process > > replays some inputs which eventually cause layout to be calculated and > > graphics is naturally executed as a result of that. The use of > > DrawTargetRecording is purely to remote the actual drawing to the middleman > > process, and not actually part of the recording. Am I right? > > Yes. Ok great. > > Unfortunately, this would introduce a level of non-determinism which we > wouldn't be able to cope with. [...] This is unfortunate. If we do this it means we'll have to strive to remove as much logic from the content process as possible and this type of constraint will give us more to worry about as we add parallelism to our code à la servo. We should look into the configurations that limit the dependencies on ipc, cross-process atomics and the likes. There are certainly a few knobs we can turn on the gfx side to use the slower but simpler paths during recording and replay.

Brian Hackett [Laid off!]

Assignee

Comment 260

•

9 years ago

(In reply to Nicolas Silva [:nical] from comment #259) > This is unfortunate. If we do this it means we'll have to strive to remove > as much logic from the content process as possible and this type of > constraint will give us more to worry about as we add parallelism to our > code à la servo. I don't understand your concerns. The presence of the record/replay system shouldn't affect plans for adding parallelism anywhere in m-c. We can record/replay parallel code as easily as sequential code, provided it is race free. Rust code isn't handled yet, but as soon as (or before) it is used in m-c I will fix that; I haven't looked at Rust's internals but as long as it uses the standard pthreads etc. primitives for synchronization the only required changes should be for instrumenting atomics, as in part 2a of the patches here. As things stand now, the only instrumentation anywhere in gfx which is necessary for correct replay is for the shared memory atomics. This instrumentation is straightforward.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 261

•

9 years ago

Comment on attachment 8790827 [details] [diff] [review] Part 7 - Ensure deterministic interaction of GC with CC and object references. > nsWrapperCache::CheckCCWrapperTraversal(void* aScriptObjectHolder, > nsScriptObjectTracer* aTracer) > { >- JSObject* wrapper = GetWrapper(); >+ JSObject* wrapper = GetWrapperForGC(); this is major change to behavior. GetWrapper() does unmark graying, but this new GetWrapperForGC does not. But ok, this case this should be fine. > nsWrapperCache() : mWrapper(nullptr), mFlags(0) > { >+ PR_RecordReplayRegisterWeakPointer(this); > } > ~nsWrapperCache() > { >+ PR_RecordReplayUnregisterWeakPointer(this); What do these new PR_ methods do? I don't see they definition in the full patch. This code can be a bit hot in some microbenchmarks, so I'd like to see what the methods actually do. >+ /** >+ * Get the underlying wrapper object for use during GC. >+ */ >+ JSObject* GetWrapperForGC() const >+ { >+ return mWrapper; >+ } Please don't add this. We really should use GetWrapperPreserveColor(), since its name indicates clearly how it is different to GetWrapper(). Or at least call it GetWrapperPreserveColorForGC() or some such. Though, it is totally unclear when this method should be used and when GetWrapperPreserveColor(), so this needs some documentation about expected usage. >+ static void RecordReplayRun(void* aObj, void*, void*) >+ { >+ AsyncFreeSnowWhite* freer = reinterpret_cast<AsyncFreeSnowWhite*>(aObj); >+ freer->mActive = PR_RecordReplayValue(freer->mActive); >+ freer->mPurge = PR_RecordReplayValue(freer->mPurge); >+ freer->mContinuation = PR_RecordReplayValue(freer->mContinuation); >+ freer->Run(); >+ } >+ > void Dispatch(bool aContinuation = false, bool aPurge = false) > { > if (mContinuation) { > mContinuation = aContinuation; > } > mPurge = aPurge; >- if (!mActive && NS_SUCCEEDED(NS_DispatchToCurrentThread(this))) { >- mActive = true; >+ if (!mActive) { >+ if (PR_IsRecordingOrReplaying()) { >+ PR_RecordReplayActivateTrigger(this); >+ mActive = true; >+ } else { >+ if (NS_SUCCEEDED(NS_DispatchToCurrentThread(this))) { >+ mActive = true; >+ } >+ } > } > } > >- AsyncFreeSnowWhite() : mContinuation(false), mActive(false), mPurge(false) {} >+ AsyncFreeSnowWhite() : mContinuation(false), mActive(false), mPurge(false) >+ { >+ PR_RecordReplayRegisterTrigger(this, RecordReplayRun, nullptr, nullptr); >+ } >+ >+ ~AsyncFreeSnowWhite() >+ { >+ PR_RecordReplayUnregisterTrigger(this); >+ } To keep this code somehow maintainable, the PR_Record stuff needs comments. Why they are here and what they are doing. That applies to the whole patch. >+++ b/xpcom/base/nsCycleCollector.cpp >@@ -3824,16 +3824,19 @@ nsCycleCollector::BeginCollection(ccType > > // BeginCycleCollectionCallback() might have started an IGC, and we need > // to finish it before we run FixGrayBits. > FinishAnyIncrementalGCInProgress(); > timeLog.Checkpoint("Pre-FixGrayBits finish IGC"); > > FixGrayBits(forceGC, timeLog); > >+ if (PR_IsRecordingOrReplaying()) >+ PR_RecordReplayExecuteTriggers(); Nit, always {} with 'if' (js/* may still use unusual coding style, but this is xpcom) >+++ b/xpcom/threads/nsThread.cpp >@@ -989,16 +989,20 @@ void canary_alarm_handler(int signum) > PR_END_MACRO > > NS_IMETHODIMP > nsThread::ProcessNextEvent(bool aMayWait, bool* aResult) > { > LOG(("THRD(%p) ProcessNextEvent [%u %u]\n", this, aMayWait, > mNestedEventLoopDepth)); > >+ if (PR_IsRecordingOrReplaying() && NS_IsMainThread()) { Use (mIsMainThread == MAIN_THREAD) and not NS_IsMainThread(), and put then PR_IsRecordingOrReplaying() after the thread check r- mainly because this all needs documentation. Developers modifying this code after these patches have landed need to understand what the PR_*Record* code is doing there.

Attachment #8790827 - Flags: review?(bugs) → review-

Brian Hackett [Laid off!]

Assignee

Brian Hackett [Laid off!]

Assignee

Comment 265

•

9 years ago

Attached patch Part 1a - Record/replay/rewind infrastructure. (obsolete) — Details — Splinter Review

The core record/replay/rewind code and public API. The public API is now in mfbt (there is also a much smaller NSPR API in bug 1303779).

•

9 years ago

Comment on attachment 8790764 [details] [diff] [review] Part 4i - Don't record NSPR thread/lock-management atomics. This part has been moved entirely into bug 1303779.

Attachment #8790764 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 269

•

9 years ago

Comment on attachment 8790839 [details] [diff] [review] Part 8e - Don't incorporate environment into random number seed when recording or replaying. This part has moved into bug 1303785.

Attachment #8790839 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 270

•

9 years ago

Attached patch Part 8f - Ensure that PLD hashtables have consistent iteration order when recording/replaying. — Details — Splinter Review

The PL hashtable instrumentation has moved into bug 1303779, and the PLD hashtable instrumentation has been rewritten to have much less overhead.

Attachment #8790840 - Attachment is obsolete: true

Attachment #8792592 - Flags: review?(nfroyd)

James Long (:jlongster)

Comment 271

•

9 years ago

(In reply to Brian Hackett (:bhackett) from comment #252) > (In reply to James Long (:jlongster) from comment #242) > > @bhackett There was a frontend devtools patch that I r-'ed above. This bug > > already is huge, and I think it would be best if you landed this feature > > without any frontend changes. After all of the implementation lands, we can > > land the UI to expose it in a separate bug. That way this bug is more > > focused, and it'll be easier to help you land the frontend changes without > > the discussion about backend details. > > > > I say this also because we're developing the debugger outside of m-c now, so > > it'll be easier to coordinate those changes in a more focused bug. We also > > need to work with out designer to implement the right UX. > > Hi, my main concern is that without some changes to the debugger client, a > lot of the features in this bug can't be tested at all. The automated test > in part 17 needs new functionality from the client to detect when the replay > has finished, and future automated tests that exercise rewinding etc. will > need to use the debugger. I guess I don't see what's so bad about landing > some changes to the existing deprecated client when it is the only in-tree > way to access most of what this bug is doing. All user visible changes to > the UI which this bug makes are gated on the devtools.recordreplay.enabled > pref, which is, yeah, off by default (and for now only available at all on > nightly mac builds). It's not terrible, but if there are a bunch of tests being written against the old debugger it's going to make it that much harder to implement it in the new debugger. It would be great if we could just implement it in the new debugger now. The new debugger is already on by default in nightly (not riding trains yet, but it will soon). Many of the frontend changes can land regardless of which debugger is used: the menu items, the debugger client update (I'm assuming there are new protocol methods), etc. At that point all we need to do on our side it add buttons to call the protocol methods, correct?

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attachment #8790884 - Flags: review- → review?(jmuizelaar)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Depends on: 1303901

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 276

•

9 years ago

Hi Brian, Can you explain more why you took this approach instead of what WebKit does? I read your design document (which is really great), but I didn't understand that section. I'm still catching up on a lot of reviews, but I should have time for this this week.

Flags: needinfo?(bhackett1024)

Brian Hackett [Laid off!]

Assignee

Comment 277

•

9 years ago

(In reply to Bill McCloskey (:billm) from comment #276) > Hi Brian, > Can you explain more why you took this approach instead of what WebKit does? > I read your design document (which is really great), but I didn't understand > that section. Hi Bill, I updated the "Comparison with other projects" section of the design document with some more information on this. The system library APIs are much better defined and more stable than anything we have internally in Gecko or, I assume, WebKit. Any place where internal browser APIs are instrumented to act as a recording boundary will impose a maintenance burden. The only point we do that in this project is for the DrawTarget API, and I'm wondering now if that is the best strategy and whether to go back to an approach to graphics rendering like comment 77. (Also, FWIW, way back at the start of this project I wanted to pursue an approach similar to WebKit's, but roc talked me out of it.)

Flags: needinfo?(bhackett1024)

Jeff Walden [:Waldo]

Comment 278

•

9 years ago

Comment on attachment 8790738 [details] [diff] [review] Part 2b - Don't record activity in atomics unit tests. Review of attachment 8790738 [details] [diff] [review]: ----------------------------------------------------------------- ::: mfbt/tests/TestAtomics.cpp @@ +10,5 @@ > #include <stdint.h> > > using mozilla::Atomic; > using mozilla::MemoryOrdering; > +using mozilla::AtomicRecording; Alphabetical.

Attachment #8790738 - Flags: review?(jwalden+bmo) → review+

Jeff Walden [:Waldo]

Comment 279

•

9 years ago

Comment on attachment 8792586 [details] [diff] [review] Part 2a - Atomics interface changes. Review of attachment 8792586 [details] [diff] [review]: ----------------------------------------------------------------- ::: mfbt/Atomics.h @@ +184,5 @@ > +namespace detail { > + > +template <AtomicRecording Recording> struct AutoRecordAtomicAccess; > + > +template <> <> and such go immediately adjacent to template, throughout. @@ +193,5 @@ > + > +template <> > +struct AutoRecordAtomicAccess<AtomicRecording::PreserveOrdering> { > + AutoRecordAtomicAccess() { recordreplay::BeginOrderedAtomicAccess(); } > + ~AutoRecordAtomicAccess() { recordreplay::EndOrderedAtomicAccess(); } So the theory is a global load of a bool will be super-well-predicted or something, and then there's ~zero cost to this at all times in the future, unless recording (in which case there's an indirect function pointer call, an NSPR lock, and untold other gunk)? I'm as leery as the next person of this being true, but I guess at worst there's only shipping record/replay in dev edition or something. :-\ @@ +653,5 @@ > } > > private: > + template<MemoryOrdering AnyOrder, AtomicRecording AnyRecording> > + AtomicBase(const AtomicBase<T, AnyOrder, AnyRecording>& aCopy) = delete; While you're here, make this AtomicBase(const AtomicBase& aCopy) = delete; Templated copy constructors actually *aren't* formally copy constructors. And while they do participate in overloading and such, I'm not 100% confident this inhibits creation of the implicit copy constructor. Better to remove all the templating to be 100% confident. @@ +675,5 @@ > T operator--() { return Base::Intrinsics::dec(Base::mValue) - 1; } > > private: > + template<MemoryOrdering AnyOrder, AtomicRecording AnyRecording> > + AtomicBaseIncDec(const AtomicBaseIncDec<T, AnyOrder, AnyRecording>& aCopy) = delete; Make this non-templated, too. @@ +751,5 @@ > return Base::Intrinsics::and_(Base::mValue, aVal) & aVal; > } > > private: > + Atomic(Atomic<T, Order, Recording>& aOther) = delete; Make this an untemplated copy constructor, too. @@ +788,1 @@ > Atomic(Atomic<T*, Order>& aOther) = delete; And this while you're in the area. @@ +807,5 @@ > > using Base::operator=; > > private: > + Atomic(Atomic<T, Order, Recording>& aOther) = delete; And this. @@ +858,5 @@ > return Base::compareExchange(aOldValue, aNewValue); > } > > private: > + Atomic(Atomic<bool, Order, Recording>& aOther) = delete; And this. And if there were any others in this file needing changes, please change them. Feel free to make all these template-tweaks in a distinct rev if you want, but do make them.

Attachment #8792586 - Flags: review?(jwalden+bmo) → review+

Nathan Froyd [:froydnj]

Comment 280

•

9 years ago

Comment on attachment 8792592 [details] [diff] [review] Part 8f - Ensure that PLD hashtables have consistent iteration order when recording/replaying. Review of attachment 8792592 [details] [diff] [review]: ----------------------------------------------------------------- I like this approach much better. r=me ::: xpcom/glue/PLDHashTable.h @@ +282,1 @@ > // destructor, which the move assignment operator does. This comment doesn't seem entirely correct; a table with an empty mEntryStore, as we're doing below, won't execute any code that depends on mOps or mEntrySize. Ah, I see, we depend on some of these fields in the move assignment operator. Can you rewrite the comment to clarify that?

Attachment #8792592 - Flags: review?(nfroyd) → review+

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Blocks: 1304146

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Blocks: 1304147

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attached patch Part 5n - Don't perform telemetry while recording or replaying. — Details — Splinter Review

Updated patch per review comments.

Attachment #8790818 - Attachment is obsolete: true

Attachment #8793044 - Flags: review?(gfritzsche)

Nathan Froyd [:froydnj]

Comment 284

•

9 years ago

Comment on attachment 8790726 [details] [diff] [review] Part 1c - Redirections. Review of attachment 8790726 [details] [diff] [review]: ----------------------------------------------------------------- First pass: just ProcessRedirect.*, plus one or two other things I happened to notice while trying to figure out how all the pieces work. I didn't review the mechanics of crafting the redirections too closely, or double-checking on all the opcodes being used, etc. Block comments in ProcessRedirect.h (and ProcessRedirect.cpp) that talk about what we're doing, why we're doing it, and maybe even some of the mechanics for the different architectures, would be great. Jan's comments all seem appropriate. Leaving the review open to remind me that I have to look at the OS X and Windows redirections. ::: toolkit/recordreplay/ProcessRedirect.cpp @@ +29,5 @@ > + > +static uint8_t* > +PageStart(uint8_t* aPtr) > +{ > + return (uint8_t*)((size_t)aPtr & ~(PageSize - 1)); Nit: uintptr_t instead of size_t. I would cheer if we used reinterpret_cast and static_cast everywhere appropriate, but given such low-level code, enforcing that restriction here might lead to quite some verbosity. WDYT? @@ +47,5 @@ > + DWORD oldProtect; > + if (!VirtualProtect(pageStart, pageEnd - pageStart, PAGE_EXECUTE_READWRITE, &oldProtect)) > + MOZ_CRASH(); > +#else > + #error "Unknown platform" I'm assuming there are appropriate moz.build files or configury magic that ensures we're not compiling this on unsupported platforms? @@ +132,5 @@ > + uint8_t* mStart; > + uint8_t* mTarget; > + bool mShort; > +}; > +static Vector<JumpPatch> gJumpPatches; The static global vectors throughout are going to add static constructors to libxul on some platforms (notably Android), which we try to avoid. Could we gather them up into some sort of global data structure that gets allocated prior to the patching process, and then deleted afterwards? @@ +144,5 @@ > + patch.mStart = aStart; > + patch.mTarget = aTarget; > + patch.mShort = aShort; > + if (!gJumpPatches.append(patch)) > + MOZ_CRASH(); These helper functions (here and elsewhere) would be shorter and possibly clearer if they were just: if (!gJumpPatches.emplaceBack(aStart, aTarget, aShort)) { MOZ_CRASH(); } though I guess that requires writing constructors for them...the generated code might be better, too? @@ +168,5 @@ > + > +#if defined(XP_MACOSX) > +#define HAS_CPU_X64 > +#elif defined(WIN32) > +#define HAS_CPU_X86 How hard is it to support 64-bit Windows in all of this? I don't think we'd want to land this without 64-bit Windows support; we're trying to get to the point where we really recommend 64-bit Firefox to developers, and it's not going to be very nice to have a cool debugging tool that's not supported there. @@ +184,5 @@ > + // movq $aTarget, rax > + *(aIp++) = 0x40 | (1 << 3); > + *(aIp++) = 0xB8; > + *(void**)aIp = aTarget; > + aIp += 8; Nit: this looks like dead code. @@ +213,5 @@ > + > + *(aIp++) = 0x66; > + *(aIp++) = 0x68; > + *(uint16_t*)aIp = ntarget >> 48; > + aIp += 2; All of this is essentially doing: pushq $aTarget ret right (assuming we actually had a 64-bit pushq)? Would be good to document, since we've got all these magic opcodes floating about. Might be worth a comment saying that we break this up into 16-bit immediates, as pushing 32-bit immediates would be sign-extended in 64-bit mode and pushed as 64-bit quantities. @@ +369,5 @@ > + return -1; > +} > + > +static bool > +ByteMatch(uint8_t aByte, const char* aStr, ...) This function is...wow. I think you could get much more idiomatic code if you wrote it with variable-argument template functions, something like: template<typename Head, typename... Tail> bool ByteMatch(uint8_t aByte, Head aValue, Tail aMoreValues...) { if (aByte == aValue) { return true; } return ByteMatch(aByte, aMoreValues...); } bool ByteMatch(uint8_t aByte) { // Nothing to match. return false; } Admittedly gnarly, but at least this way, we're not parsing strings at runtime... @@ +402,5 @@ > + size_t nbytes = 0; > + uint8_t nconsumed = 0; > + > + // Watch for prefaces to branch instructions that give a hint to the branch > + // predictor. Note that this preface might appear before other instructions Nit: I think the usual terminology here is "prefixes" and "prefix". @@ +707,5 @@ > + > + // Don't copy call and jump instructions. We should have special cased these. > + ud_mnemonic_code_t mnemonic = ud_insn_mnemonic(&ud); > + if (mnemonic == UD_Icall || (mnemonic >= UD_Ijo && mnemonic <= UD_Ijmp)) > + return UnknownInstruction(aName, aIp); Isn't it more appropriate to assert here, rather than returning something unknown, given the comment above? @@ +921,5 @@ > + MakeMemoryAccessible(functionStart, JumpBytesClobberRax); > + } > + > + uint8_t* cursor, *end; > + AllocateRedirectionBuffer(&cursor, &end); This leaks the redirection buffer, yes? I see a couple other things that look like deliberate leaks that we're going to have to handle appropriate for our ASan builds, etc. ::: toolkit/recordreplay/ProcessRedirect.h @@ +32,5 @@ > + // DLL containing this function. > + const char* mDllName; > +#endif > + > + // Function for which calls should be redirected to targetFunction. Nit: mTargetFunction. ::: toolkit/recordreplay/ProcessRedirectDarwin.cpp @@ +392,5 @@ > + > + RecordReplayFunction(recvmsg, ssize_t, (aSockFd, aMsg, aFlags)); > + > + for (size_t i = 0; i < 2 + initialLengths[1]; i++) > + events.CheckInput(initialLengths[i]); What happens here when PR_AreThreadEventsPassedThrough() is true, and we don't initialize initialLengths? @@ +472,5 @@ > + > +static ssize_t > +RR_mprotect(void* aAddress, size_t aSize, int aFlags) > +{ > + // Ignore calls to mprotect while replaying. This function interferes with Are there security implications from this? Being able to potentially scribble on JIT'd code makes me nervous. ::: toolkit/recordreplay/ProcessRedirectWindows.cpp @@ +4192,5 @@ > + FOR_EACH_KERNEL32_REDIRECTION(MAKE_KERNEL32_REDIRECTION_ENTRY) > + FOR_EACH_SHELL32_REDIRECTION(MAKE_SHELL32_REDIRECTION_ENTRY) > + FOR_EACH_USER32_REDIRECTION(MAKE_USER32_REDIRECTION_ENTRY) > + FOR_EACH_DLL_REDIRECTION(MAKE_DLL_REDIRECTION_ENTRY) > +}; Nit: it looks like the Windows version is missing a sentinel entry.

Brian Hackett [Laid off!]

Assignee

Comment 285

•

9 years ago

Attached patch Part 7 - Ensure deterministic interaction of GC with CC and object references. (obsolete) — Details — Splinter Review

Updated patch per review comments.

Attachment #8790827 - Attachment is obsolete: true

Attachment #8793108 - Flags: review?(bugs)

Brian Hackett [Laid off!]

Assignee

Comment 286

•

9 years ago

(In reply to Nick Fitzgerald [:fitzgen] [⏰PDT; UTC-7] from comment #263) > ::: js/src/threading/posix/Mutex.cpp > @@ +31,5 @@ > > if (!platformData_) > > oom.crash("js::Mutex::Mutex"); > > > > + // There are no JS mutexes which need to have their usage recorded/replayed. > > + mozilla::recordreplay::AutoPassThroughThreadEvents pt; > > Does this type skip recording of thread events while instantiated? If so, > then I think you want to put this in the guard rather than the mutex, > because we have mutexes instantiated during the whole JSRuntime's lifetime. AutoPassThroughThreadEvents causes thread events to be passed through to the underlying system without being recorded or replayed. If events are being passed through when a mutex is created, though, then no locking events on that mutex will be recorded/replayed. i.e. for a lock event to be recorded both a) the mutex had to have been created when events were not passed through, and b) the lock event itself has to happen when events are not passed through. I can beef up the comment here if you want.

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Comment on attachment 8790898 [details] [diff] [review] Part 16 - Server side devtools changes. Review of attachment 8790898 [details] [diff] [review]: ----------------------------------------------------------------- ::: devtools/server/actors/object.js @@ +115,5 @@ > let raw = this.obj.unsafeDereference(); > > // If Cu is not defined, we are running on a worker thread, where xrays > // don't exist. > + if (raw && Cu) { This needs a comment at the unsafeDereference call explaining that, in replay, it can return null or undefined or whatever it returns. It seems like there are a lot of other calls to unsafeDereference in this file, some of which refer to the raw object outside of a `try` block. Are they not covered by the tests? @@ +1487,5 @@ > return true; > } > > let raw = obj.unsafeDereference(); > + if (raw) { Similarly. @@ +1826,5 @@ > let items = grip.preview.items = []; > > let i = 0; > for (let key of keys) { > + if (rawObj && rawObj.hasOwnProperty(key) && i++ < OBJECT_PREVIEW_MAX_ITEMS) { It seems like you could put the `if (rawObj) around the entire `for` loop. ::: devtools/server/actors/script.js @@ +471,5 @@ > return this._dbg; > }, > > get globalDebugObject() { > + if (!this._parent.window || this.dbg.replaying) { How does this affect uses of globalDebugObject like the one here? http://searchfox.org/mozilla-central/rev/c31ad35f39c6187b2e121aa6d3a39b7f67397010/devtools/server/actors/script.js#1132 @@ +934,5 @@ > if (stepFrame) { > switch (steppingType) { > case "step": > + if (rewinding) { > + this.dbg.onPopFrame = onEnterFrame; I sort of get it, but it would be nice to have a comment about how rewinding is being handled here.

Attachment #8790898 - Flags: review?(jimb) → review+

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 293

•

9 years ago

Comment on attachment 8793108 [details] [diff] [review] Part 7 - Ensure deterministic interaction of GC with CC and object references. > nsWrapperCache::CheckCCWrapperTraversal(void* aScriptObjectHolder, > nsScriptObjectTracer* aTracer) > { >- JSObject* wrapper = GetWrapper(); >+ JSObject* wrapper = GetWrapperPreserveColorForGC(); This is not GC related code. So, I don't think you need any change here. > class nsWrapperCache > { > public: > NS_DECLARE_STATIC_IID_ACCESSOR(NS_WRAPPERCACHE_IID) > > nsWrapperCache() : mWrapper(nullptr), mFlags(0) > { >+ mozilla::recordreplay::RegisterWeakPointer(this); Ok, the method is inline and doesn't do much if IsRecordingOrReplaying() returns false. Some microbenchmarking will be needed anyhow. >+ /** >+ * Get the cached wrapper. >+ * >+ * As for GetWrapperPreserveColor, this getter does not change the color of >+ * the JSObject. Additionally, when recording or replaying it might not >+ * behave consistently between the two executions. Because this does not >+ * interact with the recording, it is the only getter which should be used >+ * during a GC. >+ */ I don't understand the setup here, especially I don't understand how you guarantee this is used during GC, but not elsewhere. Since same code paths, which then end up calling this or the non-ForGC case, might be used in some helper methods. FWIW, my guess is that this all will be broken immediately it lands, since it is hard to understand when one is supposed to do what. We need more documentation on this stuff, and _tests_ static void sandbox_finalize(js::FreeOp* fop, JSObject* obj) { nsIScriptObjectPrincipal* sop = static_cast<nsIScriptObjectPrincipal*>(xpc_GetJSPrivate(obj)); if (!sop) { // sop can be null if CreateSandboxObject fails in the middle. return; } static_cast<SandboxPrivate*>(sop)->ForgetGlobalObject(); - NS_RELEASE(sop); + if (recordreplay::IsRecordingOrReplaying()) { + // Trigger a later call to RecordReplayReleaseSandbox. + recordreplay::ActivateTrigger(sop); + } else { + // Release the principal reference immediately. + NS_RELEASE(sop); + } Oh, this stuff looks buggy to me, I mean even without web replay. Could you file a bug to make sandbox_finalize to release async. CC :gabor and :bholley to that bug >+// When recording or replaying we can't directly release references on the >+// sandbox principal during GC finalization (since finalization happens at >+// non-deterministic points and destroying the principal can interact with the >+// recording). Use the record/replay trigger mechanism to release the principal >+// reference at a consistent point, similar to what is done with deferred >+// finalizers. So, I think we need to fix sandbox finalization in general, and that will hopefully simplify this patch. void CycleCollectedJSRuntime::JSObjectsTenured() { MOZ_ASSERT(mJSContext); for (auto iter = mNurseryObjects.Iter(); !iter.Done(); iter.Next()) { nsWrapperCache* cache = iter.Get(); - JSObject* wrapper = cache->GetWrapperPreserveColor(); - MOZ_ASSERT(wrapper); + JSObject* wrapper = cache->GetWrapperPreserveColorForGC(); + MOZ_ASSERT_IF(!recordreplay::IsReplaying(), wrapper); So is recordreplay::IsReplaying() threadsafe? So, I think better to fix sandbox handling in a separate bug and that would simplify this stuff. Other than that, rs+, I guess, but since this patch shouldn't land, marking still r-

Attachment #8793108 - Flags: review?(bugs) → review-

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Depends on: 1306280

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Depends on: 1306281

Brian Hackett [Laid off!]

Assignee

Comment 294

•

Updated

•

9 years ago

Attachment #8792583 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 304

•

9 years ago

Comment on attachment 8790843 [details] [diff] [review] Part 9a - PReplay protocol and parent side implementation. Canceling per comment 303.

Attachment #8790843 - Flags: review?(wmccloskey)

Attachment #8790843 - Flags: review?(nical.bugzilla)

Attachment #8790843 - Flags: review?(bas)

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Attachment #8790854 - Flags: review?(wmccloskey)

Attachment #8790854 - Flags: review?(nical.bugzilla)

Attachment #8790854 - Flags: review?(bas)

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 305

•

9 years ago

Thanks!

Nathan Froyd [:froydnj]

Comment 306

•

9 years ago

Comment on attachment 8790726 [details] [diff] [review] Part 1c - Redirections. Review of attachment 8790726 [details] [diff] [review]: ----------------------------------------------------------------- Apologies for the delay; I've looked at this several times over the past week, but each time I come to the conclusion that reviewing the record/replay wrappers themselves is not terribly useful. The code works, after all, and it's not like you can say, "Oh, you should take this wrapper out"--it's obviously there for a reason. I'd echo Bill's comments on other patches: ProcessRedirect.h could use a lot more commentary about the pieces it defines and the macros provided therein, maybe with a bit of guidance on "here's what to do if you have to wrap a previously unwrapped function." I don't have much substantive to offer here except for some thoughts on how the machinery could be made friendlier. ::: toolkit/recordreplay/ProcessRedirect.h @@ +139,5 @@ > + return rrf.mRval; \ > + rrf.StartRecordReplay(); \ > + File& events = rrf.mThread->mEvents; \ > + (void) events; \ > + aReturnType& rval = rrf.mRval I wonder if it's just as easy to get aReturnType from decltype(aName) here. Similarly for argument types, if we have to pass them in @@ +142,5 @@ > + (void) events; \ > + aReturnType& rval = rrf.mRval > + > +#define RecordReplayFunctionABI(aName, aReturnType, aABI, aActuals) \ > + RecordReplayFunctionFormals(aName, aReturnType, aABI, (...), aActuals) varargs here actually works? Ugh. @@ +144,5 @@ > + > +#define RecordReplayFunctionABI(aName, aReturnType, aABI, aActuals) \ > + RecordReplayFunctionFormals(aName, aReturnType, aABI, (...), aActuals) > + > +#define RecordReplayFunction(aName, aReturnType, aActuals) \ It was a little confusing reading code and realizing that this had to be a macro, because various locals were springing to life without being declared. I'm curious whether all of these macros could somehow be rewritten using better C++ features (lambdas?). @@ +166,5 @@ > +#define RecordReplayFunctionVoid(aName, aActuals) \ > + RecordReplayFunctionVoidABI(aName, DEFAULTABI, aActuals) > + > +// Note: Do not use any of the macros below when the redirected function takes > +// arguments that are not scalar values. Use RecordReplayFunction directly. What are "scalar values" in this context? Integers only? @@ +169,5 @@ > +// Note: Do not use any of the macros below when the redirected function takes > +// arguments that are not scalar values. Use RecordReplayFunction directly. > + > +// The following macros are used for functions that return a scalar value and > +// do not record an error anywhere (i.e. with errno or SetLastError). Can we static_assert that the function returns a scalar value (e.g. via decltype) and that it has scalar parameters, at least? @@ +176,5 @@ > + static size_t DEFAULTABI \ > + RR_ ##aName () \ > + { \ > + RecordReplayFunction(aName, size_t, ()); \ > + events.ReadOrWriteValue(&rval); \ I wonder if RecordOrReplayValue might be a better name for this. (And similarly for what's currently called ReadOrWriteBytes.) ::: toolkit/recordreplay/ProcessRedirectDarwin.cpp @@ +907,5 @@ > +// pthreads redirections > +/////////////////////////////////////////////////////////////////////////////// > + > +static ssize_t > +RR_pthread_cond_init(pthread_cond_t* aCond, const pthread_condattr_t* aAttr) Shouldn't this return the same size thing as pthread_cond_init, and similarly for all the other functions below? I guess it probably doesn't matter thanks to the ABI details.

Attachment #8790726 - Flags: review?(nfroyd) → feedback+

Nicolas Silva [:nical]

Comment 307

•

9 years ago

Comment on attachment 8790847 [details] [diff] [review] Part 9b - Handle separate PCompositorChild used in middleman processes. Review of attachment 8790847 [details] [diff] [review]: ----------------------------------------------------------------- ::: dom/ipc/TabChild.cpp @@ +2779,5 @@ > sTabChildren->Put(aLayersId, this); > mLayersId = aLayersId; > } > > + if (!PR_IsMiddleman()) Please brace the if blocks.

Attachment #8790847 - Flags: review?(nical.bugzilla) → review+

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Depends on: 1309552

Brian Hackett [Laid off!]

Assignee

Comment 308

•

9 years ago

Attached patch patch — Details — Splinter Review

Updated rolled up patch. This is larger than the last patch but the record/replay code is structured much better now and has about twice as many comments.

Attachment #8792581 - Attachment is obsolete: true

Brian Hackett [Laid off!]

Assignee

Comment 309

•

9 years ago

Attached patch Part 1a - Public record/replay API. — Details — Splinter Review

This patch has the public record/replay API and its linkage strangeness.

Attachment #8792583 - Attachment is obsolete: true

Attachment #8800305 - Flags: review?(wmccloskey)

Attachment #8800305 - Flags: review?(nfroyd)

Brian Hackett [Laid off!]

Assignee

Comment 310

•

9 years ago

Attached patch Part 1c - Record/replay utilities. — Details — Splinter Review

This patch has some simple utility classes which do not directly interact with the record/replay infrastructure but which the infrastructure needs.

Attachment #8790726 - Attachment is obsolete: true

Attachment #8800306 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 311

•

9 years ago

Attached patch Part 1f - Execution recording/replaying. — Details — Splinter Review

This patch has the core infrastructure needed to record/replay an execution.

Attachment #8800311 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 312

•

9 years ago

Attached patch Part 1g - Execution rewinding. — Details — Splinter Review

This patch has almost all the infrastructure needed for rewinding an execution (there is also some in Thread.cpp in part 1f).

Attachment #8800312 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 313

•

9 years ago

Attached patch Part 1h - Redirections infrastructure. — Details — Splinter Review

This patch has the infrastructure needed for redirecting functions. The redirection header is better documented, the redirection macros have changed somewhat (it turns out that using varargs on windows doesn't work: msvc treats __stdcall varargs function pointers as __cdecl), and the actual redirection code is cleaner now.

Attachment #8800313 - Flags: review?(nfroyd)

Brian Hackett [Laid off!]

Assignee

Comment 314

•

9 years ago

Attached patch Part 1i - Platform specific redirections. — Details — Splinter Review

The actual platform specific function redirections. This hasn't changed much from the last version of this code, other than some new windows redirections, and doesn't need an in depth review I think.

Attachment #8800318 - Flags: review?(nfroyd)

Brian Hackett [Laid off!]

Assignee

Comment 315

•

9 years ago

Attached patch Part 7 - Ensure deterministic interaction of GC with CC and object references. — Details — Splinter Review

This is a little cleaner now that RegisterTrigger takes an std::function.

Attachment #8796191 - Attachment is obsolete: true

Attachment #8800320 - Flags: review?(bugs)

Brian Hackett [Laid off!]

Assignee

Comment 316

•

9 years ago

Attached patch Part 9a - PReplay protocol and parent side implementation. — Details — Splinter Review

This patch has the middleman process side of IPC with the replaying process, as before, though the compositor/graphics logic is now abstracted into separate files/patches.

Attachment #8800322 - Flags: review?(wmccloskey)

Brian Hackett [Laid off!]

Assignee

Comment 317

•

9 years ago

Attached patch Part 9e - Parent side of compositor management. — Details — Splinter Review

The middleman process side of compositor IPDL marshaling.

Attachment #8790843 - Attachment is obsolete: true

Attachment #8800327 - Flags: review?(nical.bugzilla)

Brian Hackett [Laid off!]

Assignee

Comment 318

•

9 years ago

Attached patch Part 9f - Parent side of DrawTarget management. — Details — Splinter Review

DrawEvent translation and DrawTarget rendering performed in the middleman process.

Attachment #8800328 - Flags: review?(bas)

Brian Hackett [Laid off!]

Assignee

•

9 years ago

Attached patch Part 10i - Child side of compositor management. — Details — Splinter Review

This patch has the replaying process side of compositor state management.

Attachment #8800338 - Flags: review?(nical.bugzilla)

Brian Hackett [Laid off!]

Assignee

Comment 323

•

9 years ago

Attached patch Part 10j - Child side of DrawTarget management. — Details — Splinter Review

This patch has the child side of graphics state management, for keeping track of draw targets and sending draw events to the middleman process.

Attachment #8800339 - Flags: review?(bas)

Brian Hackett [Laid off!]

Assignee

Comment 324

•

9 years ago

Attached patch Part 14f - Add DrawTargetRecordReplay. — Details — Splinter Review

This is similar to the last version of this patch, but has better documentation.

Attachment #8790888 - Attachment is obsolete: true

Attachment #8790888 - Flags: review?(bas)

Attachment #8800342 - Flags: review?(bas)

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 325

•

9 years ago

Comment on attachment 8800320 [details] [diff] [review] Part 7 - Ensure deterministic interaction of GC with CC and object references. So, how does this all work in worker threads? Is recording disabled there or are workers disabled when recording?

Attachment #8800320 - Flags: review?(bugs) → review+

Brian Hackett [Laid off!]

Assignee

Comment 326

•

9 years ago

(In reply to Olli Pettay [:smaug] from comment #325) > Comment on attachment 8800320 [details] [diff] [review] > Part 7 - Ensure deterministic interaction of GC with CC and object > references. > > So, how does this all work in worker threads? Is recording disabled there or > are workers disabled when recording? For now, DOM workers are disabled when recording or replaying.

Brian Hackett [Laid off!]

Assignee

Updated

•

9 years ago

Blocks: 1310271

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 327

•

9 years ago

Comment on attachment 8800305 [details] [diff] [review] Part 1a - Public record/replay API. Review of attachment 8800305 [details] [diff] [review]: ----------------------------------------------------------------- ::: mfbt/RecordReplay.h @@ +79,5 @@ > +// Whether the current process is a middleman between a replaying process and > +// chrome process. > +static inline bool IsMiddleman() { return gIsMiddleman; } > + > +// Mark a region which occurs atomically wrt the recording. No two threads can Do these functions enforce the fact that the region is atomic, or merely note it for replay? @@ +192,5 @@ > +static inline void RemoveHashTableItem(void* aTable, void* aThing); > +static inline void RemoveAllHashTableItems(void* aTable); > +static inline void MoveHashTableItems(void* aFirstTable, void* aSecondTable); > +static inline void SortHashTableItems(void* aTable, void** aArray, size_t aCount); > +MFBT_API void GetOrderedHashTableItems(void* aTable, void** aArray, size_t aCount); What's the difference between Sort and GetOrdered? @@ +202,5 @@ > +// not exposed here.) > +static inline const PLDHashTableOps* GeneratePLDHashTableCallbacks(const PLDHashTableOps* aOps); > +static inline const PLDHashTableOps* UnwrapPLDHashTableCallbacks(const PLDHashTableOps* aOps); > +static inline void DestroyPLDHashTableCallbacks(const PLDHashTableOps* aOps); > +static inline void MovePLDHashTableContents(const PLDHashTableOps* aFirstOps, What does this do? @@ +290,5 @@ > +// void DestroyContents(); > +// }; > +MFBT_API void RegisterTrigger(void* aObj, std::function<void()> aCallback); > +MFBT_API void UnregisterTrigger(void* aObj); > +MFBT_API void ActivateTrigger(void* aObj); Could we call this EnableTrigger instead? Activate makes it sound like it will run right away. @@ +322,5 @@ > +// > +// The callback passed to NotifyUnrecordedWait will be invoked at most once > +// by the main thread whenever the main thread is waiting for other threads to > +// become idle, and at most once after the call to NotifyUnrecordedWait if the > +// main thread is already waiting for other threads to become idle. I'm still confused when the callback is called. How do you know when you should no longer call the callback? What if the callback references state that will be deleted at some point in the future? An example or something would really help here. @@ +368,5 @@ > +// FinishReplaying has been called. > +MFBT_API bool LastSnapshotIsFinal(); > + > +// Return whether the last snapshot encountered is an interim one. If the > +// middleman asked us to rewind to snapshot N, but we did not record N but What does it mean for a snapshot not to be recorded? Why would you take a snapshot without recording it? How could the middleman reference such a snapshot?

Attachment #8800305 - Flags: review?(wmccloskey) → review+

Brian Hackett [Laid off!]

Assignee

Comment 328

•

9 years ago

(In reply to Bill McCloskey (:billm) from comment #327) > Comment on attachment 8800305 [details] [diff] [review] > Part 1a - Public record/replay API. > > Review of attachment 8800305 [details] [diff] [review]: > ----------------------------------------------------------------- > > ::: mfbt/RecordReplay.h > @@ +79,5 @@ > > +// Whether the current process is a middleman between a replaying process and > > +// chrome process. > > +static inline bool IsMiddleman() { return gIsMiddleman; } > > + > > +// Mark a region which occurs atomically wrt the recording. No two threads can > > Do these functions enforce the fact that the region is atomic, or merely > note it for replay? These functions enforce that the region is atomic, but will only do so when recording or replaying. Otherwise they are no-ops. > @@ +192,5 @@ > > +static inline void RemoveHashTableItem(void* aTable, void* aThing); > > +static inline void RemoveAllHashTableItems(void* aTable); > > +static inline void MoveHashTableItems(void* aFirstTable, void* aSecondTable); > > +static inline void SortHashTableItems(void* aTable, void** aArray, size_t aCount); > > +MFBT_API void GetOrderedHashTableItems(void* aTable, void** aArray, size_t aCount); > > What's the difference between Sort and GetOrdered? SortHashTableItems sorts an existing array, whereas GetOrderedHashTableItems allocates an array whose contents are sorted. I'll update the comment. > @@ +202,5 @@ > > +// not exposed here.) > > +static inline const PLDHashTableOps* GeneratePLDHashTableCallbacks(const PLDHashTableOps* aOps); > > +static inline const PLDHashTableOps* UnwrapPLDHashTableCallbacks(const PLDHashTableOps* aOps); > > +static inline void DestroyPLDHashTableCallbacks(const PLDHashTableOps* aOps); > > +static inline void MovePLDHashTableContents(const PLDHashTableOps* aFirstOps, > > What does this do? We ensure that PLDHashTables behave consistently during iteration by generating custom ops for each table that produce the same hash numbers between recording and replay. Each of these custom ops wraps the original ops for the table, and these functions are used for managing the custom ops. I'll update the comment to describe in more detail what these APIs are doing. > @@ +322,5 @@ > > +// > > +// The callback passed to NotifyUnrecordedWait will be invoked at most once > > +// by the main thread whenever the main thread is waiting for other threads to > > +// become idle, and at most once after the call to NotifyUnrecordedWait if the > > +// main thread is already waiting for other threads to become idle. > > I'm still confused when the callback is called. How do you know when you > should no longer call the callback? What if the callback references state > that will be deleted at some point in the future? An example or something > would really help here. Once NotifyUnrecordedWait has been called, the callback can be invoked any number of times, at any point in the future. This API is used by threads that are servicing some sort of event loop (be it posted runnables, incoming/outgoing data on a pipe, JS helper thread tasks, ...) and that will never terminate when replaying. > @@ +368,5 @@ > > +// FinishReplaying has been called. > > +MFBT_API bool LastSnapshotIsFinal(); > > + > > +// Return whether the last snapshot encountered is an interim one. If the > > +// middleman asked us to rewind to snapshot N, but we did not record N but > > What does it mean for a snapshot not to be recorded? Why would you take a > snapshot without recording it? How could the middleman reference such a > snapshot? Hmm, the terminology here is pretty bad. Right now there are two related concepts: Snapshot: a point in execution which has been assigned an identifier. Recorded snapshot: A snapshot where we have captured enough information that we can restore the process' state at this point sometime in the future. I think it would be better to rename 'snapshot' to 'checkpoint', and 'recorded snapshot' to just plain 'snapshot'. Does this make more sense? (The existing terminology dates back to the time when all checkpoints/snapshots were recorded so there wasn't any need to distinguish the two.) Anyways, using this new terminology, whenever the replaying process reaches a checkpoint it notifies the middleman, so that the middleman knows what happens between each checkpoint (and can, say, compute how many times each JS debugger breakpoint was hit). When the JS debugger (or whichever other client exists in the middleman) wants the replaying process to rewind, it asks the replaying process to go backwards one checkpoint at a time. If the checkpoint it wants to rewind to does not have an associated snapshot, the replaying process rewinds to the previous snapshot and runs forward to the desired checkpoint from there. All the checkpoints hit before the desired checkpoint is reached are interim ones.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 329

•

9 years ago

(In reply to Brian Hackett (:bhackett) from comment #328) > I think it would be better to rename 'snapshot' to 'checkpoint', and > 'recorded snapshot' to just plain 'snapshot'. Does this make more sense? > (The existing terminology dates back to the time when all > checkpoints/snapshots were recorded so there wasn't any need to distinguish > the two.) Yes, I like this much better.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 330

•

9 years ago

Comment on attachment 8800306 [details] [diff] [review] Part 1c - Record/replay utilities. Review of attachment 8800306 [details] [diff] [review]: ----------------------------------------------------------------- I'm not too happy about these data structures since they mostly duplicate stuff we already have. I can understand the need for spinlocks when you don't want to call into the OS. Maybe, as I get further into the patches and see how this stuff is used, I'll feel better about it. ::: toolkit/recordreplay/ChunkAllocator.h @@ +19,5 @@ > +// cost of O(n) lookup. > +// > +// ChunkAllocator contents are never destroyed. > +template <typename T> > +class ChunkAllocator Can this be marked MOZ_ONLY_USED_TO_AVOID_STATIC_CONSTRUCTORS? @@ +25,5 @@ > + // A page sized block holding a next pointer and an array of as many things > + // as possible. > + struct Chunk > + { > + uint8_t mStorage[PageSize - sizeof(Chunk*)]; It would be nicer to use AlignedStorage for this. @@ +67,5 @@ > + > + // Create a new entry with the specified ID. This must not be called on IDs > + // that have already been used with this allocator. > + inline T* Create(size_t aId) { > + if (aId < mCapacity) { This isn't really safe. You're reading from mCapacity without a lock, and it could be on a different cache line from the chunk itself. So you could end up getting a more recent value for mCapacity than you do for the chunk data (and, crucially, the chunk next pointer). In that case you'll crash. I think this will be safe if you use a release-acquire Atomic for mCapacity. I'd feel a lot better if you just the spinlock around the whole thing though. Is it really that important to be lockless? ::: toolkit/recordreplay/InfallibleVector.h @@ +91,5 @@ > +template<typename T, > + size_t MinInlineCapacity = 0, > + class AllocPolicy = MallocAllocPolicy> > +class InfallibleVector > + : public InfallibleVectorOperations<InfallibleVector<T, MinInlineCapacity, AllocPolicy>, Instead of doing this, could you just use a normal vector with an infallible alloc policy? http://searchfox.org/mozilla-central/source/memory/mozalloc/mozalloc.h#289 @@ +105,5 @@ > + > +template<typename T, > + size_t MinInlineCapacity = 0, > + class AllocPolicy = MallocAllocPolicy> > +class StaticInfallibleVector This is kind of nice, but it seems like we'll leak the memory on shutdown. I'm not sure if that will cause problems for LSan. ::: toolkit/recordreplay/Monitor.h @@ +14,5 @@ > + > +// Simple wrapper around a PRLock and PRCondVar. This is a lighter weight > +// abstraction than mozilla::Monitor and has simpler interactions with the > +// record/replay system. > +class Monitor This seems pretty similar to mozilla::Monitor except that it lacks deadlock detection and it does this pass-through thing. It seems like deadlock detection would be deterministic. Why not use mozilla::Monitor? ::: toolkit/recordreplay/SpinLock.h @@ +25,5 @@ > +// locking APIs. These locks are used in places where reentrance into APIs > +// needs to be avoided, or where writes to heap memory are not allowed. > + > +// A basic spin lock. > +class SpinLock Can this be marked MOZ_ONLY_USED_TO_AVOID_STATIC_CONSTRUCTORS? @@ +32,5 @@ > + inline void Lock(); > + inline void Unlock(); > + > +private: > + int32_t mLocked; This should be Atomic with release-acquire semantics. @@ +37,5 @@ > +}; > + > +// A basic read/write spin lock. This lock permits either multiple readers and > +// no writers, or one writer. > +class ReadWriteSpinLock MOZ_ONLY_USED_TO_AVOID_STATIC_CONSTRUCTORS? @@ +147,5 @@ > + AutoSpinLock ex(mLock); > + done = aRead ? (mReaders != -1) : (mReaders == 0); > + if (done) { > + mReaders = aRead ? (mReaders + 1) : -1; > + } Maybe ThreadYield outside of the spin lock if you fail to take the lock?

Attachment #8800306 - Flags: review?(wmccloskey)

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 331

•

9 years ago

•

9 years ago

Comment on attachment 8800313 [details] [diff] [review] Part 1h - Redirections infrastructure. Review of attachment 8800313 [details] [diff] [review]: ----------------------------------------------------------------- This all seems much nicer than the previous version, thanks for the revisions. Apologies for the delays in the reviewing all of these. ::: toolkit/recordreplay/CallFunction.h @@ +29,5 @@ > +// > +// And so forth. > +#define DefineCallFunction(aABI, aReturnType, aFormals, aFormalTypes, aActuals) \ > + static inline aReturnType CallFunction ##aABI aFormals { \ > + return BitwiseCast<aReturnType (aABI *) aFormalTypes>(aFn) aActuals; \ I'm pretty sure you could do this all with variadic templates, but somehow I think that would be more pain than it's worth... ::: toolkit/recordreplay/ProcessRedirect.cpp @@ +678,5 @@ > + UnprotectExecutableMemory(patch.mStart, patch.mShort ? ShortJumpBytes : JumpBytesClobberRax); > + } > + for (size_t i = 0; i < gClobberPatches.length(); i++) { > + const ClobberPatch& patch = gClobberPatches[i]; > + UnprotectExecutableMemory(patch.mStart, patch.mEnd - patch.mStart); I think we want to reprotect the patched memory from these two loops...or do we require these regions to be unprotected while things are running? ::: toolkit/recordreplay/ProcessRedirect.h @@ +3,5 @@ > +/* This Source Code Form is subject to the terms of the Mozilla Public > + * License, v. 2.0. If a copy of the MPL was not distributed with this > + * file, You can obtain one at http://mozilla.org/MPL/2.0/. */ > + > +#ifndef mozilla_toolkit_recordreplay_ProcessRedirect_h Single-line comment prior to this for DXR et al would be great. Here and the other headers as well.

Attachment #8800313 - Flags: review?(nfroyd) → review+

Brian Hackett [Laid off!]

Assignee

Comment 340

•

9 years ago

(In reply to Nathan Froyd [:froydnj] from comment #337) > @@ +84,5 @@ > > +// be in an atomic region at once, and the order in which atomic sections are > > +// executed by the various threads will be the same in the replay as in the > > +// recording. > > +static inline void BeginOrderedAtomicAccess(); > > +static inline void EndOrderedAtomicAccess(); > > Where are these functions (and similar |static inline| declarations) > actually defined? Do they come in a later patch that put them in this > header, or out-of-line? |inline| seems like not what you want for these if > they are actually defined out-of-line... They are defined later in this file via some macros, under the "API inline function implementation" comment.

Brian Hackett [Laid off!]

Assignee

Comment 341

•

9 years ago

(In reply to Nathan Froyd [:froydnj] from comment #339) > ::: toolkit/recordreplay/ProcessRedirect.cpp > @@ +678,5 @@ > > + UnprotectExecutableMemory(patch.mStart, patch.mShort ? ShortJumpBytes : JumpBytesClobberRax); > > + } > > + for (size_t i = 0; i < gClobberPatches.length(); i++) { > > + const ClobberPatch& patch = gClobberPatches[i]; > > + UnprotectExecutableMemory(patch.mStart, patch.mEnd - patch.mStart); > > I think we want to reprotect the patched memory from these two loops...or do > we require these regions to be unprotected while things are running? I'll fix this so we reprotect the regions (I didn't do it after Jan's review comment earlier because there was an interaction with the mprotect redirection on OS X --- right now we ignore all mprotect calls while replaying --- but that redirection can be fixed).

Jim Blandy :jimb

Comment 342

•

9 years ago

So, here we are at comment 342! Brian, could you give us a summary of how things are going?

Flags: needinfo?(bhackett1024)

Brian Hackett [Laid off!]

Assignee

Comment 343

•

9 years ago

(In reply to Jim Blandy :jimb from comment #342) > So, here we are at comment 342! Brian, could you give us a summary of how > things are going? Well, most of the work I've done lately has been on the Windows port in bug 1310271, though for the last few weeks I haven't been working on this project at all. The reviews in this bug have stalled and I no longer have an idea of how this project fits into the future of the browser, and this has been rather demotivating. I'm hoping to get things cleared up next week in Hawaii and then resume working on the Windows port.

Flags: needinfo?(bhackett1024)

Jim Blandy :jimb

Comment 344

•

9 years ago

> I no longer have an idea of how this project fits into the future of the browser, and this has been rather demotivating. I'm sorry to hear that. At DevTools we have a pretty clear idea how this fits into the future of the browser, in that we're considering it a killer feature, albeit one at high risk due to the complexity of the implementation. Can you say more about how you'd originally seen it fitting in, and why that isn't working out?

Brian Hackett [Laid off!]

Assignee

Comment 345

•

9 years ago

(In reply to Jim Blandy :jimb from comment #344) > > I no longer have an idea of how this project fits into the future of the browser, and this has been rather demotivating. > > I'm sorry to hear that. At DevTools we have a pretty clear idea how this > fits into the future of the browser, in that we're considering it a killer > feature, albeit one at high risk due to the complexity of the > implementation. Can you say more about how you'd originally seen it fitting > in, and why that isn't working out? I'm mainly concerned about when and how to land this without interfering with Quantum or other ongoing work. I don't want to be overly negative/dramatic; this project is the coolest thing I've ever done, and I'm fully committed to it.

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 346

•

9 years ago

Worth to discuss next week when would be the best time to land this. My initial guess is asap. A question is how to ensure all this keeps working and how other people can modify the code which this bug is touching. The code here is after all rather black magic on first sight. I assume we will have to disable some Quantum (DOM) specific stuff when replay is enabled, but that is probably fine.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 347

•

•

9 years ago

Comment on attachment 8790834 [details] [diff] [review] Part 8b - Manually record/replay mach_msg IPC calls. Review of attachment 8790834 [details] [diff] [review]: ----------------------------------------------------------------- ::: ipc/chromium/src/chrome/common/mach_ipc_mac.mm @@ +277,5 @@ > + timeout, // timeout in ms > + MACH_PORT_NULL); > + } > + > + mozilla::recordreplay::AutoOrderedAtomicAccess(); What is this expected to do? Did you mean to include a variable name here?

Attachment #8790834 - Flags: review?(wmccloskey) → review+

Bill McCloskey [inactive unless it's an emergency] (:billm)

Updated

•

9 years ago

Attachment #8800322 - Flags: review?(wmccloskey) → review+

Bill McCloskey [inactive unless it's an emergency] (:billm)

•

9 years ago

(In reply to Bill McCloskey (:billm) from comment #349) > Comment on attachment 8790834 [details] [diff] [review] > Part 8b - Manually record/replay mach_msg IPC calls. > > Review of attachment 8790834 [details] [diff] [review]: > ----------------------------------------------------------------- > > ::: ipc/chromium/src/chrome/common/mach_ipc_mac.mm > @@ +277,5 @@ > > + timeout, // timeout in ms > > + MACH_PORT_NULL); > > + } > > + > > + mozilla::recordreplay::AutoOrderedAtomicAccess(); > > What is this expected to do? Did you mean to include a variable name here? Calling AutoOrderedAtomicAccess() without an RAII class instance can be used to record/replay the order in which certain events execute. In this case we are ensuring that during the replay we don't replay the receipt of messages before the sender has actually sent them. I'll add a comment here, but first I'll look more into just hooking mach_msg and removing all this instrumentation, which is what we really should be doing.

Bill McCloskey [inactive unless it's an emergency] (:billm)

Updated

•

9 years ago

Attachment #8800333 - Flags: review?(wmccloskey) → review+

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 357

•

9 years ago

Bas Schouten (:bas.schouten)

Comment 358

•

9 years ago

Comment on attachment 8790880 [details] [diff] [review] Part 14b - Fix backend getter for DrawTargetRecording. Review of attachment 8790880 [details] [diff] [review]: ----------------------------------------------------------------- ::: gfx/2d/DrawTargetRecording.h @@ -31,5 @@ > > ~DrawTargetRecording(); > > virtual DrawTargetType GetType() const override { return mFinalDT->GetType(); } > - virtual BackendType GetBackendType() const override { return mFinalDT->GetBackendType(); } This can actually cause problems, we don't want the recording DT to be visible to other users or we'll construct incorrect things inside Mozilla when using one.

Attachment #8790880 - Flags: review?(bas) → review-

Bas Schouten (:bas.schouten)

Comment 359

•

9 years ago

Comment on attachment 8790880 [details] [diff] [review] Part 14b - Fix backend getter for DrawTargetRecording. Review of attachment 8790880 [details] [diff] [review]: ----------------------------------------------------------------- FWIW, the comment here is correct, there were some bugs with this, there still might be, this is what 'IsRecording' is for, those places should check for that as well.

Bas Schouten (:bas.schouten)

Comment 360

•

9 years ago

Comment on attachment 8790882 [details] [diff] [review] Part 14c - Allow recording translator to manage creation of similar draw targets. Review of attachment 8790882 [details] [diff] [review]: ----------------------------------------------------------------- I think your other suggestion is better, having a Translator::CreateSimilarDT.

Attachment #8790882 - Flags: review?(bas) → review-

Bas Schouten (:bas.schouten)

•

9 years ago

Attachment #8790873 - Flags: review?(jorendorff) → review?(wmccloskey)

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

•

8 years ago

I've restarted work on Web Replay. I'll be continuing to develop the technology rather than trying to land it, so I opened bug 1422587 for this new work.

Pulsebot

Comment 369

•

7 years ago

Pushed by bhackett@mozilla.com: https://hg.mozilla.org/integration/mozilla-inbound/rev/385ff3d8137a Part 5e - Don't assume that CFBundleGetFunctionPointerForName succeeds, r=benwa.

Brian Hackett [Laid off!]

Assignee

Comment 370

•

7 years ago

Web Replay is finally ready to land! A fair number of the patches in this bug are still relevant and will be landing as well. I'll be dividing things into multiple pushes.

Whiteboard: leave-open

Bogdan Tara[:bogdan_tara | bogdant]

Comment 371

•

7 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/385ff3d8137a

Pulsebot

•

7 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/9042673fb235 https://hg.mozilla.org/mozilla-central/rev/92fc9eca3135

Pulsebot

Comment 376

•

7 years ago

Pushed by bhackett@mozilla.com: https://hg.mozilla.org/integration/mozilla-inbound/rev/34cf09869600 Part 1e - Disable crash reporting when recording/replaying, r=billm. https://hg.mozilla.org/integration/mozilla-inbound/rev/eec76ff04ff9 Part 5a - Disable incremental GC when recording or replaying, r=mccr8. https://hg.mozilla.org/integration/mozilla-inbound/rev/1f2c6099f852 Part 5b - Don't keep track of times or page fault counts in GC and helper thread activity when recording or replaying, r=sfink. https://hg.mozilla.org/integration/mozilla-inbound/rev/ab179dae314c Part 5c - Don't dispatch runnables for GC or finalization when under the GC and recording or replaying, r=mccr8. https://hg.mozilla.org/integration/mozilla-inbound/rev/6771fa9888f1 Part 5d - Disable compacting GC when replaying, r=jonco. https://hg.mozilla.org/integration/mozilla-inbound/rev/2af83b92e9c8 Part 5g - Disable finalization witnesses when recording or replaying, r=froydnj. https://hg.mozilla.org/integration/mozilla-inbound/rev/be1ad4d1217e Part 5i - Disable lazy and off thread JS parsing when recording or replaying, r=jandem. https://hg.mozilla.org/integration/mozilla-inbound/rev/7ff86265c080 Part 5j - Don't add GC events to timelines when recording or replaying, r=mccr8. https://hg.mozilla.org/integration/mozilla-inbound/rev/ab53a96c3b30 Part 5k - Don't generate debugger runnables on GC events, r=fitzgen. https://hg.mozilla.org/integration/mozilla-inbound/rev/0649e9060851 Part 5l - Don't trace refcounts while recording or replaying, r=froydnj. https://hg.mozilla.org/integration/mozilla-inbound/rev/a926716f38c4 Part 5n - Don't perform telemetry while recording or replaying, r=gfritzsche. https://hg.mozilla.org/integration/mozilla-inbound/rev/c0db8c1e5050 Part 6a - Disable media elements when recording or replaying, r=jesup. https://hg.mozilla.org/integration/mozilla-inbound/rev/ff3373e34204 Part 6c - Disable accelerated canvases when recording or replaying, r=dvander. https://hg.mozilla.org/integration/mozilla-inbound/rev/d558b836552b Part 6d - Disable wasm signal handlers when recording or replaying, r=luke. https://hg.mozilla.org/integration/mozilla-inbound/rev/ab75f0522bcc Part 6e - Disable the slow script dialog when recording or replaying, r=mrbkap. https://hg.mozilla.org/integration/mozilla-inbound/rev/67736c575b34 Part 7 - Ensure deterministic interaction of GC with CC and object references, r=smaug. https://hg.mozilla.org/integration/mozilla-inbound/rev/d9bbebacecd6 Part 8c - Mark places in the JS engine where recording events are disallowed and where the recording should be invalidated, r=jandem. https://hg.mozilla.org/integration/mozilla-inbound/rev/1e146aebbcc6 Part 8f - Ensure that PL and PLD hashtables have consistent iteration order when recording/replaying, r=froydnj. https://hg.mozilla.org/integration/mozilla-inbound/rev/70c285e729d9 Part 10f - Coordinate with snapshot mechanism in JS helper threads, r=fitzgen.

Raul Gurzau (:RaulG)

Comment 377

•

7 years ago

bugherder

Pulsebot

Comment 378

•

7 years ago

Pushed by bhackett@mozilla.com: https://hg.mozilla.org/integration/mozilla-inbound/rev/fa06b0a09780 Part 16 - Server side devtools changes, r=jimb.

Sebastian Hengst [:aryx] (needinfo me if it's about an intermittent or backout)

Comment 379

•

7 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/fa06b0a09780

Brian Hackett [Laid off!]

Assignee

Comment 380

•

7 years ago

All parts from this bug, and all parts from bug 1422587 excepting tests have landed and should be in tomorrow's nightly (Mac only for now). Features are turned on via the devtools.recordreplay.enabled pref (default to false for now) and accessed via the 'Tools -> Web Developer' menu. Things might not be super stable yet.

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → FIXED

Sylvestre Ledru [:Sylvestre]

Updated

•

7 years ago

Depends on: 1489449

You need to log in before you can comment on or make changes to this bug.