Closed Bug 1258072 Opened 10 years ago Closed 5 years ago

Hang in Nightly with stack sampling; happens when using a Mozilla internal Jenkins Ops tool

Tracking

()

Status:

RESOLVED DUPLICATE of bug 698882

Tracking Flags:

Tracking

Status

firefox48

---

affected

People

(Reporter: jrgm, Assigned: jrgm)

Details

(Whiteboard: [necko-backlog])

Attachments

(7 files)

NightlyHang.txt 10 years ago John Morrison [:jrgm] 144.93 KB, text/plain		Details
experienced this same type of hang again; here's the process stack sample 10 years ago John Morrison [:jrgm] 150.97 KB, text/plain		Details
I'm on a hot streak; here's another process stack sample in the hung state 10 years ago John Morrison [:jrgm] 151.12 KB, text/plain		Details
and another hang stack sample 10 years ago John Morrison [:jrgm] 155.31 KB, text/plain		Details
another one bites the dust 10 years ago John Morrison [:jrgm] 155.64 KB, text/plain		Details
And another one gone 10 years ago John Morrison [:jrgm] 155.40 KB, text/plain		Details
And another one gone 10 years ago John Morrison [:jrgm] 151.08 KB, text/plain		Details

John Morrison [:jrgm]

Assignee

Description

•

10 years ago

Attached file NightlyHang.txt — Details

Hi Bill, This is that hang that I experience when using a Jenkins internal ops tool. I'm attaching a process sample from Activity Monitor. Some notes: - E10S is not enabled in this profile. (Although, I used to have it on, and would see similar hangs). - Nightly entered into this hang by initially burning 250% CPU for a few minutes, and then dropped to ~0% CPU and "Not Responding" showing in Activity Monitor. - A Quit from Activity Monitor had no effect. I had to Force Quit.

John Morrison [:jrgm]

Assignee

Comment 1

•

10 years ago

Attached file experienced this same type of hang again; here's the process stack sample — Details

John Morrison [:jrgm]

Assignee

Comment 2

•

10 years ago

Attached file I'm on a hot streak; here's another process stack sample in the hung state — Details

John Morrison [:jrgm]

Assignee

Comment 3

•

10 years ago

Attached file and another hang stack sample — Details

Do you have enough information from these four stack samples, or shall I just keep submitting more?

John Morrison [:jrgm]

Assignee

Comment 4

•

10 years ago

Attached file another one bites the dust — Details

John Morrison [:jrgm]

Assignee

Comment 5

•

10 years ago

Attached file And another one gone — Details

John Morrison [:jrgm]

Assignee

Comment 6

•

10 years ago

Attached file And another one gone — Details

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 7

•

10 years ago

It looks like the call to PR_SetPollableEvent is expected to be non-blocking. But in this case we're writing so much data that we block waiting for the queue to empty. This may actually be an NSPR bug, but I'll needinfo Patrick since he probably has a better idea.

Assignee: wmccloskey → nobody

Component: General → Networking

Flags: needinfo?(mcmanus)

Patrick McManus [:mcmanus]

Comment 8

•

10 years ago

your timing is pretty amazing. This is a dup of bug 698882 which has been open for years and was just merged to mozilla-central one hour ago. retest when it hits a nightly build? so yes, PR_SetPollableEvent uses a blocking queue which is a serious bug, and it can cause a deadlock when tons of events are generated on the socket thread.. which rarely happens - but apparently jenkins makes it happen somehow? in any event, the deadlock should be fixed by 698882 whenever it sticks (it has a uncovered several unrelated latent bugs and been backed out a few times).

Flags: needinfo?(mcmanus)

John Morrison [:jrgm]

Assignee

Comment 9

•

10 years ago

Cool. I'll see if I get this hang again. (Given my recent rate of these hangs, if I don't see it in a week or so, it probably means it's been fixed).

Honza Bambas (:mayhemer)

Updated

•

10 years ago

Whiteboard: [necko-active]

Patrick McManus [:mcmanus]

Updated

•

10 years ago

Assignee: nobody → jrgm

Flags: needinfo?(jrgm)

John Morrison [:jrgm]

Assignee

Comment 10

•

10 years ago

So, I believe I had the same hang in the past two weeks, but I didn't have time right then to capture a trace, as I had a more pressing problem to address. Sorry. If I trigger it again, and I have a bit of time to capture the stack, I will.

Flags: needinfo?(jrgm)

Patrick McManus [:mcmanus]

Updated

•

10 years ago

Whiteboard: [necko-active] → [necko-backlog]

Firefox Bug Husbandry Bot

Comment 11

•

8 years ago

Bulk change to priority: https://bugzilla.mozilla.org/show_bug.cgi?id=1399258

Priority: -- → P1

Firefox Bug Husbandry Bot

Comment 12

•

8 years ago

Bulk change to priority: https://bugzilla.mozilla.org/show_bug.cgi?id=1399258

Priority: P1 → P3

Andrei Purice

Updated

•

5 years ago

Status: NEW → RESOLVED

Closed: 5 years ago

Resolution: --- → DUPLICATE

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Hang in Nightly with stack sampling; happens when using a Mozilla internal Jenkins Ops tool

Categories

(Core :: Networking, defect, P3)

Tracking

()

People

(Reporter: jrgm, Assigned: jrgm)

References

Details

(Whiteboard: [necko-backlog])

Crash Data

Security

(public)

User Story

Attachments

(7 files)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Updated

Updated

Comment 10

Updated

Comment 11

Comment 12

Updated

Attachment

General

Description

File Name

Content Type