Closed
Bug 592793
Opened 14 years ago
Closed 13 years ago
tools-staging-master vm can't keep up with sendchanges
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
WONTFIX
People
(Reporter: anodelman, Assigned: arich)
References
Details
The tools-staging-master vm is hosted on the resource constrained ESX host in mountain view - thus it is unable to keep up with all of the sendchanges from production build machines.
We can keep things afloat here by reducing the talos masters on tools-staging-master to only test moz-central and ignore all other branches/tests. The long term solution is having the esx host upgraded.
Updated•14 years ago
|
Assignee: nobody → mrz
Comment 1•14 years ago
|
||
What luck! Hardware just arrived today. This is likely to be after the 42 builders though.
Comment 2•14 years ago
|
||
Ben - can you spin up this VM for Alice on the KVM cluster @ Castro?
Assignee: mrz → bkero
Comment 3•14 years ago
|
||
Sure, I'll do that this afternoon.
Reporter | ||
Comment 4•14 years ago
|
||
Any news here?
Comment 5•14 years ago
|
||
Sorry for the delay, I needed to finish setting up the KVM cluster for you.
The machine should be available at tools-staging-master02.build.mtv1.mozilla.com. Let me know if you have any issues with the VM, or if you needed any hardware(RAM, CPUs, drive space) added, or if you need any SSH keys put on there.
Reporter | ||
Comment 6•14 years ago
|
||
So, we have a new staging master?
Comment 7•14 years ago
|
||
Punting this back to Alice. You have a root login to that box.
Assignee: bkero → anodelman
Reporter | ||
Comment 8•14 years ago
|
||
Let's not punt so fast...
I'd like tools-staging-master02 to be a copy of tools-staging-master as I've already done a bunch of setup work on tools-staging-master, as have my coworkers.
Can you pull a copy of tools-staging-master and apply it to tools-staging-master02?
Assignee: anodelman → server-ops
Component: Talos → Server Operations
Product: Testing → mozilla.org
QA Contact: talos → mrz
Version: unspecified → other
Comment 9•14 years ago
|
||
No, not possible (or easy). One's an ESX VM and the other is a KVM VM.
Reporter | ||
Comment 10•14 years ago
|
||
What's the image on tools-staging-master02?
Comment 11•14 years ago
|
||
Pretty sure it's a fresh CentOS install but I'll let Ben comment.
Assignee: server-ops → bkero
Comment 12•14 years ago
|
||
Yep, it's a fresh centos 5.5 install.
Comment 13•14 years ago
|
||
How should we proceed on this? If I learn a bit about ESX I should be able to dump the image for tools-staging-master, and import it into KVM.
Reporter | ||
Comment 14•14 years ago
|
||
If it's possible to get a copy of the current state of tools-staging-master and put it onto tools-staging-master02 that would be great, but I'm unsure what is involved.
Setting up from scratch is going to be time consuming on the auto-tools team side (upwards of a week, I bet, to get all the ducks in a row).
Comment 15•14 years ago
|
||
Comment 16•14 years ago
|
||
I'm going to need to research ESX to accomplish this. I'll be doing this anyway for another bug. I'm first going to try to do this without any downtime, but will let you know the situation as I proceed.
Comment 17•14 years ago
|
||
(In reply to comment #13)
> How should we proceed on this? If I learn a bit about ESX I should be able to
> dump the image for tools-staging-master, and import it into KVM.
If you can take a short downtime, I can copy the VM to another location.
Updated•14 years ago
|
Assignee: bkero → server-ops-releng
Component: Server Operations → Server Operations: RelEng
QA Contact: mrz → zandr
Reporter | ||
Comment 18•14 years ago
|
||
Short downtimes are a-okay if it gets the existing, configured master somewhere where it can run more efficiently.
Updated•14 years ago
|
Assignee: server-ops-releng → phong
Updated•14 years ago
|
Assignee: phong → bkero
Comment 19•14 years ago
|
||
Phong,
did this ever get copied somewhere?
If not, I have enough ESX chops now to copy this somewhere for replication.
Assignee | ||
Comment 20•13 years ago
|
||
I'm going through the releng ops queue and just checking to see where we stand on this bug. Ben/Alice, have the two of you coordinated a downtime and gotten a new vm spun up?
Reporter | ||
Comment 21•13 years ago
|
||
I believe that a downtime was attempted but was not successful...
Comment 22•13 years ago
|
||
I can create this VM at any time, except I think this also follows the same problem as talos-addon-master1, where you'll need a new reference image.
Would you like me to deploy with the tools-staging-master reference image, or wait for a new one here too?
Reporter | ||
Comment 23•13 years ago
|
||
This is dependent on bug 659512 - having a copy of the old tools-staging-master won't be helpful as it doesn't work with the latest releng master set up.
Assignee | ||
Comment 24•13 years ago
|
||
Since we started over fresh with tools-staging-master02, where do we stand on this bug?
Reporter | ||
Comment 25•13 years ago
|
||
I think that we could mark this is a WONTFIX:
- the new vm should be better able to keep up
- we are no longer linked to the releng masters that were generating the glut of sendchanges and are instead generating sendchanges locally using scripts
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → WONTFIX
Assignee | ||
Updated•13 years ago
|
Assignee: bkero → arich
Updated•11 years ago
|
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in
before you can comment on or make changes to this bug.
Description
•