Closed Bug 909790 Opened 12 years ago Closed 12 years ago

Paas connection issues

Categories

(Infrastructure & Operations :: IT-Managed Tools, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: andy+bugzilla, Assigned: cturra)

Details

My stackato instance keeps giving back: https://metaplace.paas.allizom.org/ An error has occurred: {"code":"ECONNREFUSED","errno":"ECONNREFUSED","syscall":"connect"} Intermittently. Looking at the stackato console at: http://api.paas.allizom.org/console/users/amckay%40mozilla.com/apps/metaplace/ It says its started, but also says its down. This is an instance that I've had running for a while and just ignored.
Assignee: server-ops → server-ops-webops
Component: Server Operations → WebOps: IT-Managed Tools
Product: mozilla.org → Infrastructure & Operations
QA Contact: shyam → nmaul
http://newrelic-irc-notify.paas.allizom.org/ is doing the same thing. A side note is that it shouldn't be responding at all as the console reports it as down. I've tried shutting it down again from the CLI to no avail. I can start a new instance, but then 2 are running.
http://json-ip.paas.allizom.org/ is doing the same thing, additionnally, here's the output from command line tools: $ stackato stats json-ip json-ip +----------+-------------+----------------+--------------+---------------+ | Instance | CPU (Cores) | Memory (limit) | Disk (limit) | Uptime | +----------+-------------+----------------+--------------+---------------+ | 0 | 0.0% (2) | 110.0M (512M) | 33.3M (2G) | 2d:1h:27m:24s | +----------+-------------+----------------+--------------+---------------+ $ stackato instances json-ip +-------+-------+--------------------+ | Index | State | Start Time | +-------+-------+--------------------+ | 0 | DOWN | 08/27/2013 10:58AM | +-------+-------+--------------------+ $ stackato instances json-ip +-------+-------+--------------------+ | Index | State | Start Time | +-------+-------+--------------------+ | 0 | DOWN | 08/27/2013 10:58AM | +-------+-------+--------------------+ $ stackato instances json-ip +-------+---------+--------------------+ | Index | State | Start Time | +-------+---------+--------------------+ | 0 | RUNNING | 08/25/2013 09:31AM | +-------+---------+--------------------+ $ stackato stats json-ip json-ip +----------+-------------+----------------+--------------+---------------+ | Instance | CPU (Cores) | Memory (limit) | Disk (limit) | Uptime | +----------+-------------+----------------+--------------+---------------+ | 0 | 0.0% (2) | 110.0M (512M) | 33.3M (2G) | 2d:1h:27m:54s | +----------+-------------+----------------+--------------+---------------+
last week i made a change to the stackato router to support websockets (see bug 907407). i am suspicious about these being related to that work.
i have restarted the `router2g` service, which appears to have resolved these issues. we are also working with activestate to review any of the logs to see if we can track down the root cause. can i have each of you test that your applications are functioning correctly again?
Flags: needinfo?
Working for my instances.
This is occurring again today.
Flags: needinfo?
(In reply to Andy McKay [:andym] from comment #6) > This is occurring again today. i have restarted the router2g process again on the stackato cloud controller and it looks to have sorted itself. i am going to follow up with activestate on this to see if we can track down the root cause.
A slightly-different symptom that may warrant a new bug: http://flue.paas.allizom.org/ is now 404ing on all requests.
:chuck - the flue application doesn't appear to be running. this would be result of the work i did last night on the dea nodes. see the following email thread on that: https://mail.mozilla.org/pipermail/paas-users/2013-August/000073.html you can solve this by restarting your application.
after seeing a number of bugs like this one get opened after i made the router2g changes last week, i have reverted back to the default router in stackato 2.6. it appears to have cleared up these issues for others and i suspect addresses this bug also. you can see more details in my comments on bug 910931. marking this bug as r/fixed. please do reopen however if these connection issues persist.
Assignee: server-ops-webops → cturra
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.