Closed Bug 869213 Opened 11 years ago Closed 10 years ago

Enable NewRelic for Reps

Categories

(Infrastructure & Operations Graveyard :: WebOps: Engagement, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: hoosteeno, Assigned: bburton)

References

Details

Attachments

(1 file)

Reps dev (reps-dev.allizom.org) has code to support NewRelic, and it will be moving to stage/prod soon.

Please go ahead with the needed server changes and add them to NewRelic panel.

Related bug:
https://bugzilla.mozilla.org/show_bug.cgi?id=860266

Thanks!
Completed! Should be live in 30min or less... if not, let me know... we may need to manually kick Apache, I'm not entirely sure. :)
Assignee: server-ops-webops → nmaul
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Had to revert this, it broke reps.mozilla.org. Details to come.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Okay so... traceback fun attached.

One working theory is that New Relic, while not in itself dependent on unidecode or django-uuslug, being loaded early in the wsgi file is somehow causing that to break.

:pmac and I have guessed that moving New Relic further down in the wsgi file may alleviate the problem. Perhaps after line 24: https://github.com/mozilla/remo/blob/master/wsgi/playdoh.wsgi#L25. This is a guess, not a guarantee of success.

I can say that, at the system level, neither bedrock nor the generic cluster (basket, mozillians) has a system-level package installed to provide this library. New Relic does not seem to require it (upstream does not indicate it's a dependency, either). I haven't examined vendor-local for this projects, but I suspect we'll find they don't include django-uuslug, and hence don't run into this issue... whatever it is.
CC'ing other folks that helped out with this troubleshooting, in case they have additional insight.
Flagging NEEDINFO, either for additional troubleshooting that can definitively pin down the cause and solution, or confirmation that the theoretical fix is in place and we can install the package and restart apache to test it.

Sadly we cannot test on dev or stage. The Engagement cluster uses 32-bit Seamicro Atom nodes for dev and stage, and thus far I have not been able to package New Relic up to work on them. It's not out of the realm of possibility, I just haven't gotten it to work yet. Bug 788675 is on our plate to replace those nodes.
Flags: needinfo?
Moving newrelic import after 'import manage' as suggested in comment 3, seems to fix the problem. :nemo tested that on a similar to production environment with success.

New code in prod, stage and dev servers.

:jakem can we try enabling NewRelic again?
Flags: needinfo?
Seems to be working!
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Resolution: --- → FIXED
Can we also get NewRelic for reps-dev.a.o and reps.a.o?

Thanks!
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Can't do this until engagement dev and stage are moved off of the Seamicro Atom platform and onto something 64-bit. Blocking on the appropriate bug and triaging to the right component.
Assignee: nmaul → server-ops-webops
Component: Server Operations: Web Operations → WebOps: Engagement
Depends on: 788675
Product: mozilla.org → Infrastructure & Operations
Depends on: 964320
No longer depends on: 788675
Now that we moved to generic cluster I see reps.a.o and reps-dev.a.o in NR but they are not reporting any data.

Jake can you please take a look?
Flags: needinfo?(nmaul)
I'll look into this later today
Assignee: server-ops-webops → bburton
Flags: needinfo?(nmaul)
It's getting data now that it's on generic, it just gets so few requests there isn't much data.

By hammering it with curl I am seeing a spike in request rate in https://rpm.newrelic.com/accounts/263620/applications/3650909 , you can see the spike at 14:40 PST in http://note.io/1fFowAo

Same thing on staging, https://rpm.newrelic.com/accounts/263620/applications/3689774 you can see me hammering it at 14:50 PST, http://note.io/1fFpSLt
Status: REOPENED → RESOLVED
Closed: 11 years ago10 years ago
Resolution: --- → FIXED
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: