559965 - [amo] Database access denied sporadically

Reporter

Description

•

14 years ago

This is from python: OperationalError: (1045, "Access denied for user 'remora'@'10.8.70.201' (using password: YES)")

But I saw it in a php cron job too.  I saw it twice this morning and a lot at 3:29pm and 3:50pm.

Jeremy Orem [:oremj]

Comment 1

•

14 years ago

15:37 < oremj> it's because the queries are going through the load balancers
15:37 < oremj> and if a different load balancer picks up the ip
15:37 < oremj> it start originating from that host
15:38 < oremj> so need to do db grants for all of them


Sorry, I told Wil about this and forgot to mention it in #webdev.

Assignee: server-ops → jeremy.orem+bugs

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Jeff Balogh (:jbalogh)

Reporter

Comment 2

•

14 years ago

I'm seeing this again for 10.8.70.201.

Jeff Balogh (:jbalogh)

Reporter

Updated

•

14 years ago

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Jeremy Orem [:oremj]

Comment 3

•

14 years ago

Fixed that now too. There shouldn't be anymore.

Status: REOPENED → RESOLVED

Closed: 14 years ago → 14 years ago

Resolution: --- → FIXED

Wil Clouser [:clouserw]

Comment 4

•

14 years ago

Got this at 1:28am:
Cron <root@ip-admin02> cd	/data/amo/www/addons.mozilla.org-remora/bin;	/usr/bin/python26 import-personas.py
"Can't connect to MySQL server on '10.2.70.147' (110)"

Got this at 8:33am:
Cron <root@ip-admin02> cd	/data/amo/www/addons.mozilla.org-remora/bin;	/usr/bin/python26 maintenance.py personas_adu
"Can't connect to MySQL server on '10.2.70.147' (110)"

Got this from zprod at 9:25am:
Error (EXTERNAL IP): /fr/thunderbird/api/1.2/list/featured/all/10/WINNT/3.0
OperationalError: (2003, "Can't connect to MySQL server on '10.8.70.19' (111)")

Jeremy Orem [:oremj]

Comment 5

•

14 years ago

The personas crons should be fixed now.  I think the last one was probably a fluke.

Wil Clouser [:clouserw]

Comment 6

•

14 years ago

We've gotten "Access denied for user 'personas'@'10.8.70.200' (using password: YES)" twice since yesterday.  It can connect not but doesn't have permissions.  Can we just clone all the access rules from the original boxes?

php -f maintenance.php l10n_rss
php -f maintenance.php l10n_stats
both failed at 3:01am with "Can't connect to MySQL server on '10.8.70.10'"

and we've had production zamboni errors complaining about the same thing all through the night, a total of 19 errors.

Are we bumping up against max_clients again?

Wil Clouser [:clouserw]

Updated

•

14 years ago

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Jeremy Orem [:oremj]

Comment 7

•

14 years ago

The personas permissions are fixed for real this time. Just ran both of those scripts to double check.  Not sure about the unable to connect errors.

Dave or Tim, do you have any graphs on what was going on with 10.8.70.200 @ 3:00am?

Wil Clouser [:clouserw]

Comment 8

•

14 years ago

bug 555880 says max_connections is 1500.  Was that carried over to the new servers?

Wil Clouser [:clouserw]

Comment 9

•

14 years ago

We've been experiencing this all weekend.  Can someone reply to comment 8?

Dave Miller [:justdave]

Assignee

Comment 10

•

14 years ago

Has absolutely nothing to do with max connections, this is a permission issue.  Apparently nobody set up the ACLs in MySQL when the app moved to Phoenix.  I'll poke.

Assignee: jeremy.orem+bugs → justdave

Dave Miller [:justdave]

Assignee

Comment 11

•

14 years ago

OK, looks like someone did attempt to set them up, but granted them individually for each load balancer relay IP rather than using the wildcard that gets all of them at once and allows for future expansion.  It should still work though.  This probably is the max clients thing then, which you could probably figure out better if you had your script actually report the error it gets instead of just saying it can't connect.

The max connections thing isn't going to get solved without the app fixing the queries it's running.  We decided that after the last discussion on this.  No matter how high we set max_connections, the app will always hit it when it does the queries that back everything up.

Wil Clouser [:clouserw]

Comment 12

•

14 years ago

This has started happening far more frequently since the move to phoenix.  We're working on fixing the scripts, but can you make sure the limit is still 1500 on these servers?

Dave Miller [:justdave]

Assignee

Comment 13

•

14 years ago

it's still talking to the same server.  The master database server didn't change.

Dave Miller [:justdave]

Assignee

Comment 14

•

14 years ago

mysql> show global variables like 'max_connections';
+-----------------+-------+
| Variable_name   | Value |
+-----------------+-------+
| max_connections | 1500  | 
+-----------------+-------+
1 row in set (0.00 sec)

Wil Clouser [:clouserw]

Comment 15

•

14 years ago

fwiw, Dave bumped up the value on the slaves from 1200 this morning, I haven't seen a problem since.

Dave Miller [:justdave]

Assignee

Comment 16

•

14 years ago

We had a per-ip connection limit on the zeus VIP that was actually interfering.  I bumped that from 1200 to 20000 just to remove it from the equation.  Seems to have fixed the problem as far as I can tell.  AMO has moved back to SJC since, which makes it moot anyhow (but good to have it fixed when we eventually move back to phx again).

Status: REOPENED → RESOLVED

Closed: 14 years ago → 14 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

11 years ago

Component: Server Operations: Web Operations → WebOps: Other

Product: mozilla.org → Infrastructure & Operations

BMO Automation

Updated

•

5 years ago

Product: Infrastructure & Operations → Infrastructure & Operations Graveyard

Bugzilla

Quick Search

[amo] Database access denied sporadically

Categories

(Infrastructure & Operations Graveyard :: WebOps: Other, task)

Tracking

(Not tracked)

People

(Reporter: jbalogh, Assigned: justdave)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Updated

Comment 3

Comment 4

Comment 5

Comment 6

Updated

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 16

Updated

Updated