Closed Bug 691315 Opened 14 years ago Closed 14 years ago

IntegrityError: Duplicate entry ___ for key PRIMARY during INSERT INTO wbo

Tracking

(Not tracked)

Status:

VERIFIED FIXED

People

(Reporter: Atoll, Assigned: rfkelly)

References

Details

(Whiteboard: [qa+][python regression])

Attachments

(6 files, 2 obsolete files)

Sync log 14 years ago Ladar Levison 11.44 KB, text/plain		Details
Screenshot showing request methods... 14 years ago Ladar Levison 23.27 KB, image/png		Details
White washed log showing a single sample error from sync attempt (there were several). 14 years ago Ladar Levison 386 bytes, text/plain		Details
White washed tcp dump of http request/response showing headers and failure response. 14 years ago Ladar Levison 875 bytes, text/plain		Details
patch to return 409-conflict on duplicate BSO insertion 14 years ago Ryan Kelly [:rfkelly] 5.40 KB, patch	telliott : review+	Details \| Diff \| Splinter Review
updated patch to return 409-conflict on duplicate BSO insertion 14 years ago Ryan Kelly [:rfkelly] 5.88 KB, patch	telliott : review- Atoll : feedback-	Details \| Diff \| Splinter Review
patch to allow overwriting of items with expired ttls 14 years ago Ryan Kelly [:rfkelly] 3.17 KB, patch	telliott : review+	Details \| Diff \| Splinter Review
updated patch to return 409-conflict on duplicate BSO insertion 14 years ago Ryan Kelly [:rfkelly] 6.29 KB, patch	telliott : review+	Details \| Diff \| Splinter Review

:Atoll

Reporter

Description

•

14 years ago

I think this is a missing "on duplicate key" exception handler in the sync server. Found in the logs, no further information available. 2011-10-03 06:10:12,312 ERROR [syncserver] acbaf5e4d13e65b77f9a24c9404cca81 2011-10-03 06:10:12,312 ERROR [syncserver] Traceback (most recent call last): File "/usr/lib/python2.6/site-packages/services/util.py", line 495, in __call__ return self.app(environ, start_response) File "/usr/lib/python2.6/site-packages/paste/translogger.py", line 68, in __call__ return self.application(environ, replacement_start_response) File "/usr/lib/python2.6/site-packages/webob/dec.py", line 147, in __call__ resp = self.call_func(req, *args, **self.kwargs) File "/usr/lib/python2.6/site-packages/webob/dec.py", line 208, in call_func return self.func(req, *args, **kwargs) File "/usr/lib/python2.6/site-packages/services/baseapp.py", line 191, in __notified response = func(self, request) File "/usr/lib/python2.6/site-packages/services/baseapp.py", line 270, in __call__ result = function(request, **params) File "/usr/lib/python2.6/site-packages/syncstorage/controller.py", line 289, in set_item res = storage.set_item(user_id, collection_name, item_id, **wbo) File "/usr/lib/python2.6/site-packages/syncstorage/storage/memcachedsql.py", line 232, in set_item storage_time=storage_time, **values) File "/usr/lib/python2.6/site-packages/syncstorage/storage/sql.py", line 609, in set_item return self._set_item(user_id, collection_name, item_id, **values) File "/usr/lib/python2.6/site-packages/syncstorage/storage/sql.py", line 593, in _set_item safe_execute(self._engine, query) File "/usr/lib/python2.6/site-packages/services/util.py", line 622, in safe_execute return engine.execute(*args, **kwargs) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1788, in execute return connection.execute(statement, *multiparams, **params) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1191, in execute params) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1271, in _execute_clauseelement return self.__execute_context(context) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1302, in __execute_context context.parameters[0], context=context) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1401, in _cursor_execute context) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/base.py", line 1394, in _cursor_execute context) File "/usr/lib/python2.6/site-packages/sqlalchemy/engine/default.py", line 299, in do_execute cursor.execute(statement, parameters) File "/usr/lib/python2.6/site-packages/pymysql/cursors.py", line 108, in execute self.errorhandler(self, exc, value) File "/usr/lib/python2.6/site-packages/pymysql/connections.py", line 185, in defaulterrorhandler raise errorclass, errorvalue IntegrityError: (IntegrityError) (1062, u"Duplicate entry '1898290-11-a1267300151a5f92af84f73ea5b5907aef73de4e' for key 'PRIMARY'") 'INSERT INTO wbo90 (id, username, collection, modified, payload, payload_size, ttl) VALUES (%s, %s, %s, %s, %s, %s, %s)' (u'a1267300151a5f92af84f73ea5b5907aef73de4e', '1898290', 11, 131764741227, '{"ciphertext":"REDACTED==","hmac":"REDACTED"}', 251, 2100000000)

Tarek Ziadé (:tarek)

Comment 1

•

14 years ago

Yeah the on duplicate is present only when we POST batches, so the error occurs on a PUT of a single item in a collection. I am not quite sure to understand yet how we get to this status. Are you able to get the initial call log ?

:Atoll

Reporter

Comment 2

•

14 years ago

I think Zeus replicated the request when the sync server took too long to reply the first time. So it shouldn't be a common issue. We should probably still handle this particular error ('Object already exists') with Something other than 500 Server Error. Http1.1 spec suggest 409 Conflict would be the appropriate response, but I don't know what Sync API and Firefox expect.

Toby Elliott [:telliott]

Comment 3

•

14 years ago

409 would probably confuse the heck out of the frontend. The 'traditional' way we've handled this in reg-server is 400 (response 4). However, I'm not convinced that a 503 isn't OK here due to the obscurity, and we should consider restoring on-duplicate-key-updates to the PUT query.

Tarek Ziadé (:tarek)

Comment 4

•

14 years ago

(In reply to Toby Elliott [:telliott] from comment #3) ... > and we should consider restoring on-duplicate-key-updates to the PUT query. yes, filled bug 691409

Richard Newman [:rnewman]

Updated

•

14 years ago

OS: Mac OS X → All

Hardware: x86 → All

James Bonacci [:jbonacci]

Updated

•

14 years ago

Whiteboard: [qa+]

Toby Elliott [:telliott]

Updated

•

14 years ago

Group: services-infra

Toby Elliott [:telliott]

Comment 6

•

14 years ago

Ryan - let's backport this one as a bugfix to 1.1 as well, since we're seeing reports of the problem.

Assignee: nobody → rkelly

Whiteboard: [qa+] → [qa+][python regression]

Joachim Breuer

Comment 7

•

14 years ago

I'm seeing a number of these resulting in "Sync encountered an error while syncing: Unknown error. Sync will automatically retry this action." A retry (manual or automatic) does not clear the problem, the server log shows the same IntegrityErrors for multiple tries. This is with FF 10.0.1 (gentoo) against my own sync server; due to the problem I've just upgraded it to today's default tip - no change. The problem MAY have started with me entering saved passwords on two different synced browsers while one of them was disconnected from the sync - but this is after-the-fact conjecture. I can provide further information (logs etc.), just tell me what to do.

:Atoll

Reporter

Comment 8

•

14 years ago

(In reply to Joachim Breuer from comment #7) > I'm seeing a number of these resulting in "Sync encountered an error while > syncing: Unknown error. Sync will automatically retry this action." Hi Joachim. You'll need to file a completely new bug with information and logs attached - we won't be able to do anything about your problem report in this bug otherwise. Please file it under Mozilla Services / General. Thanks!

Ladar Levison

Comment 9

•

14 years ago

I'm seeing this error too. I suspect its because the other computers in the group have identical URLs in their history. The records probably just have different timestamps. Either way, the system should either 'replace', or preserve the most recent entry. Attaching log. I tried to only remove the user data, but if I grabbed something critical let me know.

Ladar Levison

Comment 10

•

14 years ago

Attached file Sync log — Details

Toby Elliott [:telliott]

Comment 11

•

14 years ago

Identical URLs should replace. The fact they're not is what's the bug here. What would actually be hugely useful here would be for someone to look at their access logs and see if the requests that are erroring out are coming in from PUTs or POSTs. If it's posts, then this is something new and unknown.

:Atoll

Reporter

Comment 12

•

14 years ago

My understanding is that some future version of the Sync storage server will log the request method and URI in backtraces, so that we have some hope of correlating (which we do not otherwise).

Joachim Breuer

Comment 13

•

14 years ago

This is causing at least some portion of the data not to be synced, see Bug #731515.

Joachim Breuer

Comment 14

•

14 years ago

@telliot: It appears to me the errors are happening out of POSTs. I only see GET and POST against mozsync, never a PUT.

Ladar Levison

Comment 15

•

14 years ago

Attached image Screenshot showing request methods... — Details

Ladar Levison

Comment 16

•

14 years ago

Attached file White washed log showing a single sample error from sync attempt (there were several). — Details

Ladar Levison

Comment 17

•

14 years ago

Attached file White washed tcp dump of http request/response showing headers and failure response. — Details

Ladar Levison

Comment 18

•

14 years ago

I captured a TCP dump showing the actual HTTP requests, and stored the associated sync error logs. It appears all of the requests are GET, POST and DELETES. I didn't see any PUTS. I tried to upload a representative sample as an attachment after removing the encrypted data fields. I will hold onto the raw files for a few days, so let me know if I should go looking for something in particular...