Closed Bug 866194 Opened 11 years ago Closed 11 years ago

Re-purpose some Fedora machines as WinXP

Categories

(Infrastructure & Operations :: DCOps, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: armenzg, Unassigned)

References

Details

(Whiteboard: [Reimaging])

On bug 864741 I'm trying to determine how many Fedora machines we can re-purpose as WinXP machines (to help wait times).

On Monday/Tuesday I will have an answer of how many and which ones.
Stealing this because there will be some DNS/inventory/nagios/DS work that needs to be done before the machines can be reimaged.
Assignee: server-ops-dcops → arich
Component: Server Operations: DCOps → Server Operations: RelEng
QA Contact: dmoore → arich
Hi,
Could you please re-image the following list? I might have some more on Tuesday or Wednesday.

| talos-r3-fed-001   |
| talos-r3-fed-002   |
| talos-r3-fed-003   |
| talos-r3-fed-004   |
| talos-r3-fed-005   |
| talos-r3-fed64-031 |
| talos-r3-fed64-032 |
| talos-r3-fed64-033 |
| talos-r3-fed64-034 |
| talos-r3-fed64-036 |

Thanks a lot!
DCops, please reimage the following renamed machines by netbooting them.  Please also make sure that the dongle status matches those of the rest of the XP machines.


talos-r3-fed-001.build.scl1.mozilla.com talos-r3-xp-125.build.scl1.mozilla.com
talos-r3-fed-002.build.scl1.mozilla.com talos-r3-xp-126.build.scl1.mozilla.com
talos-r3-fed-003.build.scl1.mozilla.com talos-r3-xp-127.build.scl1.mozilla.com
talos-r3-fed-004.build.scl1.mozilla.com talos-r3-xp-128.build.scl1.mozilla.com
talos-r3-fed-005.build.scl1.mozilla.com talos-r3-xp-129.build.scl1.mozilla.com
talos-r3-fed64-031.build.scl1.mozilla.com talos-r3-xp-130.build.scl1.mozilla.com
talos-r3-fed64-032.build.scl1.mozilla.com talos-r3-xp-131.build.scl1.mozilla.com
talos-r3-fed64-033.build.scl1.mozilla.com talos-r3-xp-132.build.scl1.mozilla.com
talos-r3-fed64-034.build.scl1.mozilla.com talos-r3-xp-133.build.scl1.mozilla.com
talos-r3-fed64-036.build.scl1.mozilla.com talos-r3-xp-134.build.scl1.mozilla.com


I have modified DNS, inventory, nagios, and deploy studio.
Assignee: arich → server-ops-dcops
colo-trip: --- → scl1
Component: Server Operations: RelEng → Server Operations: DCOps
QA Contact: arich → dmoore
The following machines imaged correctly from the first batch:

talos-r3-xp-125.build.scl1.mozilla.com
talos-r3-xp-126.build.scl1.mozilla.com
talos-r3-xp-127.build.scl1.mozilla.com is working
Netbooted 128-134 after they failed 3 times. Will come back and check everything in the morning. 

Accidently hosed talos-r3-fed-031, created Bug 866992 for it.
The following machines are now working:

talos-r3-xp-129.build.scl1.mozilla.com
talos-r3-xp-131.build.scl1.mozilla.com

The following can not run tasklist correctly, and I have kicked off a second (or fourth) reimage:

talos-r3-xp-128.build.scl1.mozilla.com
talos-r3-xp-130.build.scl1.mozilla.com
talos-r3-xp-132.build.scl1.mozilla.com
talos-r3-xp-134.build.scl1.mozilla.com

I am unable to reach:

talos-r3-xp-133.build.scl1.mozilla.com
The following machine is now working:

talos-r3-xp-128.build.scl1.mozilla.com
Blocks: 864741
No longer depends on: 864741
Added the following new Windows XP machines to the production pool:
talos-r3-xp-128
talos-r3-xp-129
--
talos-r3-xp-131
The updated list of hosts in production is:
talos-r3-xp-125
talos-r3-xp-126
talos-r3-xp-127
talos-r3-xp-128
talos-r3-xp-129
--
talos-r3-xp-131
--
--
--
talos-r3-xp-133 is working now.
(In reply to Amy Rich [:arich] [:arr] from comment #11)
> talos-r3-xp-133 is working now.

Added to the pool.
The following machines are now working:

talos-r3-xp-130.build.scl1.mozilla.com
talos-r3-xp-132.build.scl1.mozilla.com
The updated list of hosts in production is:
talos-r3-xp-125
talos-r3-xp-126
talos-r3-xp-127
talos-r3-xp-128
talos-r3-xp-129
talos-r3-xp-130
talos-r3-xp-131
talos-r3-xp-132
talos-r3-xp-133
--
Can you also please re-image these machines as WinXP?
| talos-r3-fed-006   |
| talos-r3-fed-007   |
| talos-r3-fed-008   |
| talos-r3-fed-009   |
| talos-r3-fed-010   |
| talos-r3-fed64-037 |
| talos-r3-fed64-038 |
| talos-r3-fed64-039 |

I will not be requesting anymore rev3 re-purposing until:
1) bug 837017 and/or bug 850105 get resolved (move Fedora load somewhere else)
2) the Win7 on iX switch over project is completed

Those projects might be completed in May but I don't expect any changes in the next week.

Thanks for your help!
I've modified DNS, inventory, nagios, and DS for the following hosts, so they are ready to try netbooting now (note the new names):

talos-r3-fed-006 -> talos-r3-xp-135
talos-r3-fed-007 -> talos-r3-xp-136
talos-r3-fed-008 -> talos-r3-xp-137
talos-r3-fed-009 -> talos-r3-xp-138
talos-r3-fed-010 -> talos-r3-xp-139
talos-r3-fed64-037 -> talos-r3-xp-140
talos-r3-fed64-038 -> talos-r3-xp-141
talos-r3-fed64-039 -> talos-r3-xp-142


talos-r3-xp-134 is the only host left of the original 10 that is not working correctly yet.

DCOps, please therefore netboot talos-r3-xp-134 - talos-r3-xp-142.
any changes since yesterday? thanks!
Host have been labeled with new host name.
Netbooting now: 
    talos-r3-xp-133
    talos-r3-xp-135
    talos-r3-xp-136
    talos-r3-xp-137
    talos-r3-xp-138
    talos-r3-xp-139
talos-r3-xp-134 is able to run tasklist
Able to run tasklist:
talos-r3-xp-135
talos-r3-xp-136
Talos-r3-xp-134 is now working
In production:
talos-r3-xp-134
talos-r3-xp-135
talos-r3-xp-136

Not yet:
talos-r3-xp-137
talos-r3-xp-138
talos-r3-xp-139
talos-r3-xp-140
talos-r3-xp-141
talos-r3-xp-142
talos-r3-xp-137 is now able to run tasklist
In the process of reimaging:

talos-r3-xp-138
talos-r3-xp-139
talos-r3-xp-140
talos-r3-xp-141
talos-r3-xp-142
Talos-r3-xp-139 is now able to run tasklist.
Tasklist failed a total of 5 times. This is going to be the 6th reimage for:

talos-r3-xp-138
talos-r3-xp-140
talos-r3-xp-141
talos-r3-xp-142
Should we try to image them as win7? I know that we say that we are going to switch to Win7 on iX but that could take a bit more.

We can also put them on the side and wait to see.
It seems that we have enough WinXP machines to meet our current SLA of 95% wait times.
I need a day with real high load to see if this is true.
In production:
talos-r3-xp-134
talos-r3-xp-135
talos-r3-xp-136
talos-r3-xp-137
talos-r3-xp-139

Not yet:
talos-r3-xp-138
talos-r3-xp-140
talos-r3-xp-141
talos-r3-xp-142
Reimaging now:
talos-r3-xp-138
talos-r3-xp-140
talos-r3-xp-141
talos-r3-xp-142
Able to run tasklist:
talos-r3-xp-142
Status: NEW → ASSIGNED
Whiteboard: [Reimaging]
(In reply to Ashlee Chavez [:Ashlee] from comment #29)
> Able to run tasklist:
> talos-r3-xp-142

Added to the pool.
Reimaging now:
talos-r3-xp-138
talos-r3-xp-140
talos-r3-xp-141
Able to run tasklist:
talos-r3-xp-140
Able to run tasklist:
talos-r3-xp-138
Able to run tasklist:
talos-r3-xp-141
I've added the last three.
Unless I hear otherwise I believe that we're done in here.

Thank you all!

This has made a big difference on WinXP's wait times.
Status: ASSIGNED → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.