Closed Bug 665251 Opened 14 years ago Closed 14 years ago

500 / Internal Server Errors on Elastic-Search-running staging

Categories

(Socorro :: General, task)

task
Not set
critical

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: stephend, Assigned: adrian)

References

Details

Attachments

(8 files)

Attached file Logfile
The new Elastic Search integration is throwing a ton of 500 Internal Server Errors on staging: https://crash-stats.allizom.org/products/Firefox Will include output from PowerFuzzer.
Assignee: nobody → adrian
This patch fixes the first programming error, and also fixes some security issues by encoding the data that goes into the query_string. This was tested on khan with some of the URIs given by stephend, it seems to be ok now. Btw http://agaudebert.khan.mozilla.org/query is supposed to be the same as crash-stats.allizom.org so you can test here before the patch is applied.
Attachment #540203 - Flags: review?(rhelmer)
Comment on attachment 540203 [details] [diff] [review] Fixing bug 665251 + securing data Committed revision 3228.
Attachment #540203 - Flags: review?(rhelmer) → review+
@Stephen: did you test that again? Is this fixed?
adrian, if the patch is ready to be qa'd can you bump the bug to resolved/fixed
Yep! :)
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
(In reply to comment #3) > @Stephen: did you test that again? Is this fixed? I tested again on both https://crash-stats-dev.allizom.org/query and https://crash-stats.allizom.org/query, and there are tons of 500s, still. Reopening; on which should I be looking?
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
@Stephen: https://crash-stats.allizom.org should be OK now, can you run your tests against it? I have some problems with the Postgres part, queries time out, but the 500 problem should be fixed too... I'm working on it with jberkus, I'll let you know when it's ok. Hopefully tomorrow.
(In reply to comment #7) > @Stephen: https://crash-stats.allizom.org should be OK now, can you run your > tests against it? > > I have some problems with the Postgres part, queries time out, but the 500 > problem should be fixed too... I'm working on it with jberkus, I'll let you > know when it's ok. Hopefully tomorrow. http://pastebin.mozilla.org/1259479 is what I'm still seeing.
Adrian, status update? Stephend, should this block ship? We'd be shipping the pg version.
(In reply to comment #9) > Adrian, status update? > > Stephend, should this block ship? We'd be shipping the pg version. I don't fully understand the implications of shipping the Postgres version, so can't make that call. Do you mean we wouldn't be using elasticsearch at all, this time?
(In reply to comment #10) > (In reply to comment #9) > > Adrian, status update? > > > > Stephend, should this block ship? We'd be shipping the pg version. > > I don't fully understand the implications of shipping the Postgres version, > so can't make that call. Do you mean we wouldn't be using elasticsearch at > all, this time? We won't be using Elastic Search until the production boxes for it are ready, which will not be at least until tomorrow, so we have been planning to ship the new code with the configuration pointing everything to PG. After discussing with Adrian in irc, at least one of the problems here is a severe performance regression (using an unindexed column on the very large reports table in where/order by) and at least one other minor bug, so we're holding the release on this. He's working on it now.
Fix multiple things in the Postgres implementation: * Fix 500 errors due to programming errors * Optimize queries so it doesn't time out more than before Tested on khan ( http://agaudebert.khan.mozilla.org/query ) with Powerfuzzer, no more 500 Errors reported.
Attachment #542553 - Flags: review?(rhelmer)
Attachment #542553 - Flags: review?(rhelmer) → review+
Per conversation with adrian... this is ready to be resolved/fixed. Ready for QA
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Re-openned... power fuzzer is still discovering 500s. Adrian has the latest output.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
'build_id' and 'reason' were assumed to be strings when they actually could be lists. This patch fixes this problem, and even extends the functionalities by allowing to specify several build ids or crash reasons.
Attachment #542665 - Flags: review?(rhelmer)
Comment on attachment 542665 [details] [diff] [review] Fixing 500 Errors with Postgres (securing user inputs) Landed on trunk and branch: 3251, 3252
Attachment #542665 - Flags: review?(rhelmer) → review+
Bumping to resolved/fixed per a conversation on irc. The patch has landed on stage and is ready for QA.
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Reopening. I'm working with adrian to test this. The two behaviors we're seeing: 1. The fuzzer is still finding 500's, ex: https://crash-stats.allizom.org/query/query?query=%2Fe%00&do_query=1&query_type=simple 2. The query results table appears to be missing results for the platform counts.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Blocks: 667819
This patch fixes: * A problem with the platforms config leading to not counting for Windows * A regression that did not show the type of crash (browser, plugin, hang... ) * An issue with Apache refusing encoded slashes (%2F) in URLS
Attachment #543001 - Flags: review?(rhelmer)
Attachment #543001 - Flags: review?(rhelmer) → review+
Comment on attachment 543001 [details] [diff] [review] Fixing config + regression + 500 errors on stage Landed on trunk: 3255 And branch: 3256
Not really related to this bug, but I found 2 new bugs while testing my code: * crash_reason was called reason when calling the API, resulting in not filtering by crash reason ; * the API did not return the types of crashes (browser, hang, plugin... ) This patch fixes those issues.
Attachment #543033 - Flags: review?(rhelmer)
Attachment #543033 - Flags: review?(rhelmer) → review+
The mode "starts_with" for searching in plugins doesn't work, this patch fixes it.
Attachment #543209 - Flags: review?(lars)
Attachment #543209 - Flags: review?(lars) → review+
Comment on attachment 543209 [details] [diff] [review] Fix search in plugins committed 3260/3261
bumping to resolved/fixed per discussion in irc with adrian.
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
FYI accidentally used this bug number when committing rev 3263/3264 for the related bug 668621, note to any future archaeologists :)
adrian and rhelmer, thanks for the incredible amount of work done over the last couple of days to get this bug squashed. QA verified, ran the powerfuzzer and 0 500 errors. UI looks great.
Status: RESOLVED → VERIFIED
Component: Socorro → General
Product: Webtools → Socorro
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: