Closed
Bug 665251
Opened 14 years ago
Closed 14 years ago
500 / Internal Server Errors on Elastic-Search-running staging
Categories
(Socorro :: General, task)
Socorro
General
Tracking
(Not tracked)
VERIFIED
FIXED
2.0
People
(Reporter: stephend, Assigned: adrian)
References
Details
Attachments
(8 files)
55.67 KB,
text/plain
|
Details | |
3.96 KB,
patch
|
rhelmer
:
review+
|
Details | Diff | Splinter Review |
6.65 KB,
patch
|
rhelmer
:
review+
|
Details | Diff | Splinter Review |
1.89 KB,
patch
|
rhelmer
:
review+
|
Details | Diff | Splinter Review |
107.66 KB,
image/png
|
Details | |
6.80 KB,
patch
|
rhelmer
:
review+
|
Details | Diff | Splinter Review |
2.37 KB,
patch
|
rhelmer
:
review+
|
Details | Diff | Splinter Review |
2.62 KB,
patch
|
lars
:
review+
|
Details | Diff | Splinter Review |
The new Elastic Search integration is throwing a ton of 500 Internal Server Errors on staging:
https://crash-stats.allizom.org/products/Firefox
Will include output from PowerFuzzer.
Assignee | ||
Updated•14 years ago
|
Assignee: nobody → adrian
Assignee | ||
Comment 1•14 years ago
|
||
This patch fixes the first programming error, and also fixes some security issues by encoding the data that goes into the query_string.
This was tested on khan with some of the URIs given by stephend, it seems to be ok now. Btw http://agaudebert.khan.mozilla.org/query is supposed to be the same as crash-stats.allizom.org so you can test here before the patch is applied.
Attachment #540203 -
Flags: review?(rhelmer)
Comment 2•14 years ago
|
||
Comment on attachment 540203 [details] [diff] [review]
Fixing bug 665251 + securing data
Committed revision 3228.
Attachment #540203 -
Flags: review?(rhelmer) → review+
Assignee | ||
Comment 3•14 years ago
|
||
@Stephen: did you test that again? Is this fixed?
Comment 4•14 years ago
|
||
adrian, if the patch is ready to be qa'd can you bump the bug to resolved/fixed
Assignee | ||
Comment 5•14 years ago
|
||
Yep! :)
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Reporter | ||
Comment 6•14 years ago
|
||
(In reply to comment #3)
> @Stephen: did you test that again? Is this fixed?
I tested again on both https://crash-stats-dev.allizom.org/query and https://crash-stats.allizom.org/query, and there are tons of 500s, still.
Reopening; on which should I be looking?
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Assignee | ||
Comment 7•14 years ago
|
||
@Stephen: https://crash-stats.allizom.org should be OK now, can you run your tests against it?
I have some problems with the Postgres part, queries time out, but the 500 problem should be fixed too... I'm working on it with jberkus, I'll let you know when it's ok. Hopefully tomorrow.
Reporter | ||
Comment 8•14 years ago
|
||
(In reply to comment #7)
> @Stephen: https://crash-stats.allizom.org should be OK now, can you run your
> tests against it?
>
> I have some problems with the Postgres part, queries time out, but the 500
> problem should be fixed too... I'm working on it with jberkus, I'll let you
> know when it's ok. Hopefully tomorrow.
http://pastebin.mozilla.org/1259479 is what I'm still seeing.
Comment 9•14 years ago
|
||
Adrian, status update?
Stephend, should this block ship? We'd be shipping the pg version.
Reporter | ||
Comment 10•14 years ago
|
||
(In reply to comment #9)
> Adrian, status update?
>
> Stephend, should this block ship? We'd be shipping the pg version.
I don't fully understand the implications of shipping the Postgres version, so can't make that call. Do you mean we wouldn't be using elasticsearch at all, this time?
Comment 11•14 years ago
|
||
(In reply to comment #10)
> (In reply to comment #9)
> > Adrian, status update?
> >
> > Stephend, should this block ship? We'd be shipping the pg version.
>
> I don't fully understand the implications of shipping the Postgres version,
> so can't make that call. Do you mean we wouldn't be using elasticsearch at
> all, this time?
We won't be using Elastic Search until the production boxes for it are ready, which will not be at least until tomorrow, so we have been planning to ship the new code with the configuration pointing everything to PG.
After discussing with Adrian in irc, at least one of the problems here is a severe performance regression (using an unindexed column on the very large reports table in where/order by) and at least one other minor bug, so we're holding the release on this. He's working on it now.
Assignee | ||
Comment 12•14 years ago
|
||
Fix multiple things in the Postgres implementation:
* Fix 500 errors due to programming errors
* Optimize queries so it doesn't time out more than before
Tested on khan ( http://agaudebert.khan.mozilla.org/query ) with Powerfuzzer, no more 500 Errors reported.
Attachment #542553 -
Flags: review?(rhelmer)
Updated•14 years ago
|
Attachment #542553 -
Flags: review?(rhelmer) → review+
Comment 13•14 years ago
|
||
Per conversation with adrian... this is ready to be resolved/fixed.
Ready for QA
Status: REOPENED → RESOLVED
Closed: 14 years ago → 14 years ago
Resolution: --- → FIXED
Comment 14•14 years ago
|
||
Re-openned... power fuzzer is still discovering 500s. Adrian has the latest output.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Assignee | ||
Comment 15•14 years ago
|
||
'build_id' and 'reason' were assumed to be strings when they actually could be lists. This patch fixes this problem, and even extends the functionalities by allowing to specify several build ids or crash reasons.
Attachment #542665 -
Flags: review?(rhelmer)
Comment 16•14 years ago
|
||
Comment on attachment 542665 [details] [diff] [review]
Fixing 500 Errors with Postgres (securing user inputs)
Landed on trunk and branch:
3251, 3252
Attachment #542665 -
Flags: review?(rhelmer) → review+
Comment 17•14 years ago
|
||
Bumping to resolved/fixed per a conversation on irc. The patch has landed on stage and is ready for QA.
Status: REOPENED → RESOLVED
Closed: 14 years ago → 14 years ago
Resolution: --- → FIXED
Comment 18•14 years ago
|
||
Reopening. I'm working with adrian to test this. The two behaviors we're seeing:
1. The fuzzer is still finding 500's, ex: https://crash-stats.allizom.org/query/query?query=%2Fe%00&do_query=1&query_type=simple
2. The query results table appears to be missing results for the platform counts.
Updated•14 years ago
|
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Assignee | ||
Comment 19•14 years ago
|
||
This patch fixes:
* A problem with the platforms config leading to not counting for Windows
* A regression that did not show the type of crash (browser, plugin, hang... )
* An issue with Apache refusing encoded slashes (%2F) in URLS
Attachment #543001 -
Flags: review?(rhelmer)
Updated•14 years ago
|
Attachment #543001 -
Flags: review?(rhelmer) → review+
Comment 20•14 years ago
|
||
Comment on attachment 543001 [details] [diff] [review]
Fixing config + regression + 500 errors on stage
Landed on trunk: 3255
And branch: 3256
Assignee | ||
Comment 21•14 years ago
|
||
Not really related to this bug, but I found 2 new bugs while testing my code:
* crash_reason was called reason when calling the API, resulting in not filtering by crash reason ;
* the API did not return the types of crashes (browser, hang, plugin... )
This patch fixes those issues.
Attachment #543033 -
Flags: review?(rhelmer)
Updated•14 years ago
|
Attachment #543033 -
Flags: review?(rhelmer) → review+
Assignee | ||
Comment 22•14 years ago
|
||
The mode "starts_with" for searching in plugins doesn't work, this patch fixes it.
Attachment #543209 -
Flags: review?(lars)
Updated•14 years ago
|
Attachment #543209 -
Flags: review?(lars) → review+
Comment 23•14 years ago
|
||
Comment on attachment 543209 [details] [diff] [review]
Fix search in plugins
committed 3260/3261
Comment 24•14 years ago
|
||
bumping to resolved/fixed per discussion in irc with adrian.
Status: REOPENED → RESOLVED
Closed: 14 years ago → 14 years ago
Resolution: --- → FIXED
Comment 25•14 years ago
|
||
FYI accidentally used this bug number when committing rev 3263/3264 for the related bug 668621, note to any future archaeologists :)
Comment 26•14 years ago
|
||
adrian and rhelmer, thanks for the incredible amount of work done over the last couple of days to get this bug squashed. QA verified, ran the powerfuzzer and 0 500 errors. UI looks great.
Status: RESOLVED → VERIFIED
Updated•14 years ago
|
Component: Socorro → General
Product: Webtools → Socorro
You need to log in
before you can comment on or make changes to this bug.
Description
•