Closed Bug 889106 Opened 11 years ago Closed 11 years ago

RMA foopy87.p7.releng.scl1.mozilla.com

Categories

(Infrastructure & Operations :: DCOps, task, P5)

x86
Linux

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: vinh, Assigned: achavez)

References

Details

(Whiteboard: Ticket ID: YKW-368842)

Host won't power on. Needs RMA with iXSystems.
Ticket ID:  YKW-368842
Assignee: server-ops-dcops → vhua
colo-trip: --- → scl1
OS: Mac OS X → Linux
Group: infra
Blocks: foopy87
It's important that we get this machine back online asap, has iX done the RMA yet?  If not, can they just give us a new unit?
Update from iX:



This repair should be completed and Q/C'ed this week. As soon as the new unit is pristine and ready to go, we will be updating you ASAP.

Cheers,

-- 
Paul Gikas
iXsystems, Inc
408.943.4100 x107
408.943.4101 fax
www.ixsystems.com
I picked up the node from iX and racked it back. It's still continuing to NOT power on. 

I emailed and called Paul Gikas from iX letting him know how urgent this was and that the same problem was still occurring. 

He thinks it could be the chassis so he's sending Sean from iX to scl1 on Monday at 10:30am to do some trouble shooting and/or replace whatever needs to be replaced so we can get this back online asap. 

Sorry for the inconvenience!
Status: NEW → ASSIGNED
Update from iX:

Per our discussion, Shawn will be meeting you on-site Monday (7/15) morning, to help confirm the chassis as the culprit.

We do have the replacement chassis on-order, in case needed, and will be keeping you posted on it's ETA.

If there was anything else we could help address, until our next set of updates, please let us know.

Thank you,

-- 
Paul Gikas
iXsystems, Inc
408.943.4100 x107
408.943.4101 fax
www.ixsystems.com
-- 


Ticket Details
===================
Ticket ID: YKW-368842
Department: RMA
Priority: Normal
Status: Open
iX picked up node again, concluded that it wasn't the whole chassis. A replacement node will be ordered today. I'll update the bug as soon as I get an ETA on the node.
Assignee: vhua → achavez
Update from iX:

Just got off the phone with Supermicro and they will either have the parts for us late this afternoon or tomorrow morning.

------------
Shawn Cox
iXsystems, Inc.
408.943.4100
408.943.4101 fax
www.ixsystems.com

Ticket Details
===================
Ticket ID: YKW-368842
Department: RMA
Priority: Normal
Status: Open
Any update here? It's been almost a month since this bug was opened, and we still don't have the machine back in service.
Host is powered on and IPMI is pingable.  However I cannot ping/ssh foopy87.p7.releng.scl1.mozilla.com.  Network configs might have been erased?
:vinh, check the mac address. usually we only RMA the systemboard while the drives remain in the chassis so it *shouldn't* be affected.
MAC address for eth0 and mgmt updated in inventory.  Reimaging host, will check back once completed.
Host finished reimaging and is now reachable.

vhua$ ping foopy87.p7.releng.scl1.mozilla.com
PING foopy87.p7.releng.scl1.mozilla.com (10.12.134.23): 56 data bytes
64 bytes from 10.12.134.23: icmp_seq=0 ttl=58 time=9.172 ms
64 bytes from 10.12.134.23: icmp_seq=1 ttl=58 time=12.961 ms
64 bytes from 10.12.134.23: icmp_seq=2 ttl=58 time=11.783 ms
^C
--- foopy87.p7.releng.scl1.mozilla.com ping statistics ---
3 packets transmitted, 3 packets received, 0.0% packet loss
round-trip min/avg/max/stddev = 9.172/11.305/12.961/1.583 ms
vhua-07555:in-addr.arpa vhua$ ssh foopy87.p7.releng.scl1.mozilla.com
The authenticity of host 'foopy87.p7.releng.scl1.mozilla.com (10.12.134.23)' can't be established.
RSA key fingerprint is 00:ba:89:82:7d:eb:40:a8:e3:86:ae:de:be:e2:0d:42.
Are you sure you want to continue connecting (yes/no)?
Status: ASSIGNED → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.