Closed Bug 554206 Opened 10 years ago Closed 10 years ago

[k] Paginator throws UnicodeEncodeError on non-ascii (?) characters

Categories

(support.mozilla.org :: General, defect, P1, blocker)

defect

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: paulc, Assigned: jsocol)

References

()

Details

The paginator doesn't seem to like non-ascii characters, e.g. urls like:
/ru/search?w=3&q=%D0%A3%D1%81%D1%82%D0%B0%D0%BD%D0%BE%D0%B2%D0%BA%D0%B0
/fr/search?w=3&q=Gloss%C3%A1rio
We should be using flatqs for building the URL anyway, in which case this problem can be fixed as part of flatqs.

I've found these resources that should help:
http://stackoverflow.com/questions/804336/best-way-to-convert-a-unicode-url-to-ascii-utf-8-percent-escaped-in-python
http://stackoverflow.com/questions/1077564/python-unicode-unicodeencodeerror
Assignee: nobody → james
Severity: normal → major
Priority: -- → P1
Duplicate of this bug: 557453
I've been working on this but every query with an accent guaranteed to return more than a page of results runs into bug 554210.
Depends on: 554210
I did push a new version of flatqs that should handle UTF-8 a little better:

http://github.com/jsocol/flatqs/commit/fc0b5d89cc78b894eaf73c48dde81d0f197c7a15
Severity: major → blocker
Wooot!

http://github.com/jsocol/kitsune/commit/240c9469ee4f5928b2ca1b85e7b15f36229ed778

Build #28.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.