444749 - Complex or long searches are timing out

Reporter

Description

•

16 years ago

For the last day or so, every search I've performed at http://crash-stats.mozilla.com/ has returned "Error 500. Internal Server Error". This includes even the simplest searches -- like for topcrashers.

Steven Michaud [:smichaud] (Retired)

Reporter

Updated

•

16 years ago

OS: Linux → All

Hardware: PC → All

Aravind Gottipati [:aravind]

Comment 1

•

16 years ago

The database is pretty heavily loaded and we have a few ideas on reducing the load. However, it will probably be next week by the time we put them into production.

Assignee: server-ops → aravind

Steven Michaud [:smichaud] (Retired)

Reporter

Comment 2

•

16 years ago

The socorro servers have been badly overloaded for months. But it's only in the last couple of days that (as far as I can tell) they've become completely non-functional. Is something down? Has the server setup been changed recently?

Aravind Gottipati [:aravind]

Comment 3

•

16 years ago

I wouldn't say they are completely non-functional. Its aggregate reporting thats broken currently. Viewing individual reports works just fine. We have been revamping the back-end recently and some of the database tables have gotten out of control - thats the reason for the recent problems with some of the reporting.

Steven Michaud [:smichaud] (Retired)

Reporter

Comment 4

•

16 years ago

(In reply to comment #3) OK, I stand corrected. I just redid a search by bug ID that worked a couple of days ago ... and it still works. Probably the largest volume of searches is by bug ID (from about:crashes), so it's good they're working. But it'd still be nice to get the other searches working again, even if only sub-optimally. Good luck! I'm glad I'm not the one who has to fix this :-)

Summary: Socorro servers returning Error 500 on every search → Socorro servers returning Error 500 on every aggregate search

Tony Mechelynck [:tonymec]

Updated

•

16 years ago

Blocks: 445244, 445246

Samuel Sidler (old account; do not CC)

Updated

•

16 years ago

No longer blocks: 445244

Samuel Sidler (old account; do not CC)

Updated

•

16 years ago

No longer blocks: 445246

Aravind Gottipati [:aravind]

Comment 6

•

16 years ago

Waiting on code fixes from morgamic.

Assignee: aravind → morgamic

Jeremy Orem [:oremj]

Updated

•

16 years ago

QA Contact: justin → mrz

Aravind Gottipati [:aravind]

Updated

•

16 years ago

Assignee: morgamic → aravind

Mark Banner (:standard8)

Comment 8

•

16 years ago

(In reply to comment #6) > Waiting on code fixes from morgamic. > This comment was almost a month ago, any updates/other bugs to watch?

Aravind Gottipati [:aravind]

Comment 9

•

16 years ago

We have code updates, aggregate tables and such. Waiting on me to push this stuff out.

Marc Bejarano

Comment 10

•

16 years ago

hi, aravind. any ETA on your push? thanks!

Henrik Skupin [:whimboo][⌚️UTC+2]

Comment 11

•

16 years ago

This bug is very odd and still exists for one and a half month. There are a lot of crash reports and no way to search the database. If we really want to decrease the amount of crashers on trunk we should fix it asap. There is no crash analysis possible at the moment. Nearly everything is returning the error 500. Even the search for top crashers. Any chance to push all the stuff out within the next days?

Aravind Gottipati [:aravind]

Comment 12

•

16 years ago

We are working on it and the code is currently in our staging environment. Pushing this to production will involve some downtime and stuff. We will probably be pushing this out later this week or next week if it doesn't make it this week. I understand the frustration that there are no aggregate reports available, but there are other changes to the app (like crash report throttling, etc) that take priority.