Closed Bug 1452000 Opened 7 years ago Closed 7 years ago

Figure out plan for statsum/signalfx/etc in a r14y world

Categories

(Taskcluster :: Services, enhancement)

enhancement
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bstack, Assigned: dustin)

References

Details

What is the easiest/best thing we can provide external users for monitoring their clusters?
Assignee: nobody → dustin
I think the easiest thing, for early stages of redeployability, is "nothing". Let's find a way to disable these in taskcluster-lib-monitor, using configuration we pass to it. Alexandre had a similar issue today, where docker-worker won't start up without auth:statsum:docker-worker and auth:sentry:docker-worker. Longer-term, we could build a monitoring service to replace or supplement statsum, that also handles exception reporting, and where users can centrally implement and configure backends for various services.
Later, we'll want to be able to customize the project name. That's already possible, but services tend to hard-code it, rather than taking it from a configuration file. So I think what I'll do is this: * add an `enabled` option to tc-lib-monitor, which we'll just set to false in all deployments for the moment * update all services to get the project name as well as `enabled` from config.yml Brian, does that seem reasonable?
Flags: needinfo?(bstack)
We need to project-name just to support things like staging clusters internally right? Otherwise this is fine with me. We can't assume that our users will want signalfx and friends.
Flags: needinfo?(bstack)
https://github.com/taskcluster/taskcluster-lib-monitor/pull/68 Once that's landed, I'll shop around the services and add the config options.
Upgrading tc-lib-monitor on everything is also going to ugprade raven, which is good, but a little scary. I want to do bug 1453724 first.
This will be bringing along bug 1453725 and bug 1453724 as well. Note that this involves a major update of raven. https://public.etherpad-mozilla.org/p/dustin-1452000-updates
Commits pushed to master at https://github.com/taskcluster/taskcluster-index https://github.com/taskcluster/taskcluster-index/commit/0f20e0b4f94b231b8b09a6b26acf03243f69b4f6 Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good https://github.com/taskcluster/taskcluster-index/commit/d9316f3f4fc2a84ad028cb16388a6f4a9c34052a Merge pull request #50 from taskcluster/bug1452000 Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good
Commits pushed to master at https://github.com/taskcluster/taskcluster-login https://github.com/taskcluster/taskcluster-login/commit/dbd73bc5330a3f0ae138cf383f43b1c76337c336 Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good https://github.com/taskcluster/taskcluster-login/commit/84768de5a2d974160ab5cf0b837e7b2e587ca0a3 Merge pull request #96 from taskcluster/bug1452000 Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good
Commits pushed to master at https://github.com/taskcluster/taskcluster-auth https://github.com/taskcluster/taskcluster-auth/commit/a6f35e19737eed7cf4a2a7788b59ee83c95ca11a Bug 1452000 - update to new lib-monitor and supply project, enable from config https://github.com/taskcluster/taskcluster-auth/commit/322acf06c4d2fd1911a085cc294b2f42aa6b113e Merge pull request #150 from djmitche/bug1452000 Bug 1452000 - update to new lib-monitoring and supply project, enable…
Commits pushed to master at https://github.com/taskcluster/taskcluster-queue https://github.com/taskcluster/taskcluster-queue/commit/79ea2eb733a3fd2ad320c913781dbdd007600dfb Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good https://github.com/taskcluster/taskcluster-queue/commit/abcef54bea5398cd38af88117ece4ad7c7531337 Merge pull request #267 from djmitche/bug1452000 Bug 1452000, 1453724, 1453725 - upgrade tc-lib-monitor for greater good
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Component: Redeployability → Services
You need to log in before you can comment on or make changes to this bug.