Closed Bug 565156 Opened 14 years ago Closed 14 years ago

Add rabbitmq to munin

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: clouserw, Assigned: oremj)

Details

Wil Clouser [:clouserw]

Reporter

Description

•

14 years ago

Rabbitmq is running on the gearman boxes (or at least one of them).  We should send stats to munin so we can monitor it over time:

http://github.com/ask/rabbitmq-munin/

(I'm assuming nagios is already making sure it's running, but if that's not the case please do that also)

Jeremy Orem [:oremj]

Assignee

Updated

•

14 years ago

Assignee: server-ops → jeremy.orem+bugs

Jeremy Orem [:oremj]

Assignee

Comment 1

•

14 years ago

Installed munin plugins.  What about rabbitmq should I be monitoring, just a process or tcp check?

Wil Clouser [:clouserw]

Reporter

Comment 2

•

14 years ago

(In reply to comment #1)
> Installed munin plugins.  What about rabbitmq should I be monitoring, just a
> process or tcp check?

I think we should be monitoring the same stuff munin is.  Munin is just execing rabbitmqctl, the scripts it's running are almost good enough for nagios - it even has warn/crit thresholds.  They are all at http://github.com/ask/rabbitmq-munin

Jeremy Orem [:oremj]

Assignee

Comment 3

•

14 years ago

What will the action be if nagios goes off? Need to make docs for the other admins.

Wil Clouser [:clouserw]

Reporter

Comment 4

•

14 years ago

If it's below threshold for workers, start more workers and then figure out why they disappeared.  

If the queue is too high, order more hardware I guess.  Also let webdev know so we can throttle back unimportant jobs.

I think amo-developers should get these pages too.

Wil Clouser [:clouserw]

Reporter

Comment 5

•

14 years ago

Only the connections graph is working in munin, all the rest are blank.  If you run the commands manually do they execute?  If you're running them through sudo don't forget to add rabbitmqctl to what it can run.

Jeremy Orem [:oremj]

Assignee

Comment 6

•

14 years ago

I didn't have "env.vhost vhostname" set. I was hoping by default it would just graph all vhosts. Kind of lame that it will only do 1.

Jeremy Orem [:oremj]

Assignee

Comment 7

•

14 years ago

Turns out these plugins don't work with celery at all. It expects just a couple of queues to exists and celery has created over 7,000 queues.

Jeff Balogh (:jbalogh)

Comment 8

•

14 years ago

Can we try these again now that we're not creating tons of result queues? (bug 567932)

Jeremy Orem [:oremj]

Assignee

Comment 9

•

14 years ago

Graphs are up: http://munin.mozilla.org/munin/gearman/pm-gearman-amo01.mozilla.org/

Jeremy Orem [:oremj]

Assignee

Updated

•

14 years ago

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

11 years ago

Component: Server Operations: Web Operations → WebOps: Other

Product: mozilla.org → Infrastructure & Operations

BMO Automation

Updated

•

5 years ago

Product: Infrastructure & Operations → Infrastructure & Operations Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Add rabbitmq to munin

Categories

(Infrastructure & Operations Graveyard :: WebOps: Other, task)

Tracking

(Not tracked)

People

(Reporter: clouserw, Assigned: oremj)

References

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Updated

Updated

Updated