525537 - Gloda search tokenizer should perform case-folding and accent-folding to be case-insensitive and accent-insensitive, also handle half-width katakana

Reporter

Description

•

16 years ago

Doing same search using Cyrilic letters but with different case you get different results. While doing so using English only letter gloda produce same result. I can't test this right now using trunk, because new tabs are broken but sure this is exist on 3.0.

Andrew Sutherland [:asuth] (he/him)

Comment 1

•

16 years ago

Yes, the only special processing we do for non-CJK characters is case-folding of ASCII characters (which the SQLite tokenizer already knew how to do). If someone can take a crack at augmenting the tokenizer right away using libintl we might be able to fix this for 3.0, otherwise not.

Nikolay Shopik

Reporter

Comment 2

•

16 years ago

If that not make into first 3.0 release this should be noted in Release Notes or some other way, so user don't expect it working correctly with language other than English.

Ludovic Hirlimann [:Usul]

Updated

•

16 years ago

Keywords: relnote

Jens Müller (:tessarakt)

Comment 3

•

15 years ago

what is not working? case-insensitive search in locales other than en, or case-insensitive search with non-ASCII letters?

Roel van Os

Comment 4

•

15 years ago

Searching is case-insensitive in Dutch, so it would seem that the problem is case-insensitive search with non-ASCII letters. The title of this bug should be changed to reflect this. Tested with Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.5) Gecko/20091121 Thunderbird/3.0.

Nikolay Shopik

Reporter

Updated

•

15 years ago

Summary: Gloda search is case sensitive for languages other than English → Gloda search is case sensitive for non-ASCII letters

Andrew Sutherland [:asuth] (he/him)

Updated

•

15 years ago

OS: Windows XP → All

Hardware: x86 → All

Whiteboard: [gloda key]

sample of mail 15 years ago Tim 1.66 KB, application/octet-stream		Details
WIP v0.1 15 years ago Makoto Kato [:m_kato] 3.94 KB, patch		Details \| Diff \| Splinter Review
patch v1 15 years ago Makoto Kato [:m_kato] 13.81 KB, patch		Details \| Diff \| Splinter Review
patch v2 15 years ago Makoto Kato [:m_kato] 19.66 KB, patch		Details \| Diff \| Splinter Review
patch v2.1 15 years ago Makoto Kato [:m_kato] 19.20 KB, patch	m_kato : review-	Details \| Diff \| Splinter Review
WIP v3 15 years ago Makoto Kato [:m_kato] 254.75 KB, patch		Details \| Diff \| Splinter Review
patch v4 http://hg.mozilla.org/comm-central/rev/d67ca271fb2d 15 years ago Makoto Kato [:m_kato] 252.17 KB, patch	asuth : review+	Details \| Diff \| Splinter Review
review changes patch v1 on top of m_kato's v4 patch http://hg.mozilla.org/comm-central/rev/5d97bb8bf0cc 15 years ago Andrew Sutherland [:asuth] (he/him) 30.72 KB, patch	m_kato : review+	Details \| Diff \| Splinter Review