Closed Bug 596862 Opened 14 years ago Closed 14 years ago

Allow including of characters with diacritics in search results

Categories

(Core :: Find Backend, defect)

defect
Not set
normal

Tracking

()

RESOLVED DUPLICATE of bug 202251

People

(Reporter: pander, Unassigned)

Details

User-Agent:       Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.9) Gecko/20100825 Ubuntu/9.10 (karmic) Firefox/3.6.9
Build Identifier: 

When searching text in a web page with Firefox, text in e-mails with Thunderbird or text in any other Mozilla product, please allow for matching of characters with diacritics when the search is with characters without diacritics.

Many texts are written without diacritics or diacritics are used irregularly. In those cases, finding certain words is laborious. For example, a French text might contain occurrences of "cafe" and "café" and a Japanese text might contain occurences of "omote" and "ōmote".

Especially in pages with many authors, like blogs, wikis and threats, this occurs a lot. Also texts with mixed languages with are prone to these, especially when in one language, the author is talking about another language. Even transliterations have different versions in which diacritics are used differently.

An extensive listing of diacritics can be found here:
  http://en.wikipedia.org/wiki/Diacritics
Note that they can occur on both vowels and consonants. Probably, some open source filters are already existing for this purpose.

This option could be called "Match diacritics" and be, like "Match case", be set to FALSE by default. Note that the "Match case" option should work well together with this option. In the Find toolbar, this option can be toggled by means of a check box. The quick search/filter in Thunderbird and Sunbird does not allow space for any options. "Match case" is disabled in those cases, so it would follow convention to also disable "Match diacritics" in those cases.

Search engines like Google, Yahoo, etc. and many other applications are also providing this functionality (by default or not).

Implementing this functionality will allow for more efficient finding all occurrences of a terms that have or can have diacritics and are written in different spellings (correct or not).

Reproducible: Always
Status: UNCONFIRMED → RESOLVED
Closed: 14 years ago
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.