Closed Bug 506064 Opened 15 years ago Closed 1 year ago

Thunderbird filters should ignore the accents

Categories

(MailNews Core :: Search, defect)

defect

Tracking

(Not tracked)

RESOLVED DUPLICATE of bug 284856

People

(Reporter: striptm, Unassigned)

References

Details

(Keywords: intl)

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; es-ES; rv:1.9.1.1) Gecko/20090715 Firefox/3.5.1
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; es-ES; rv:1.9.1.1) Gecko/20090715 Thunderbird/3.0b3

Thunderbird should ignore whether a word has or not an accent when performing a search, in the same way that it is not "case sensitive".

For example, if there are two messages with the following subjects: "Vacaciones en París" and "hoteles paris", a search for "París" should return both results.

In Spanish language, unfortunately, there are too many people who writes without the accents, then it is important that á é í ó ú ü are equivalent to a a e i o u u.

Reproducible: Always

Steps to Reproduce:
1. Inbox with a message that contains an accented word in the subjet or receiver, ej. 'parís'
2. In search box write 'paris'
Actual Results:  
The message is not found, because it thinks that parís != paris

Expected Results:  
The message with is founded 'paris'
This is the kind of search customization that is well suited to the new custom search terms functionality from bug 495519.
Component: General → Search
Product: Thunderbird → MailNews Core
QA Contact: general → search
Well, this sounds like a good idea in general not just customization, where I'm wondering though if there are clear transcription rules for all character sets.
Confirmed also in SeaMonkey 2.0b1 (Mozilla/5.0 (X11; U; Linux i686; es-ES; rv:1.9.1.1pre) Gecko/20090717 SeaMonkey/2.0b1).
Keywords: intl
OS: Windows XP → All
Hardware: x86 → All
Version: unspecified → Trunk
I was unable to find a dup - marking this RFE as valid.
Status: UNCONFIRMED → NEW
Ever confirmed: true
Assignee: nobody → m_kato
Fernando, you issue is about the old "Search Messages", or the new "Search all messages"
Wayne, Is that a question?

My issue is about the search box on the top-right corner of Thunderbird. For example when selecting "By subject..." and then typing "guia" a message with the subject "Guía para principiantes" does not match the search.
(In reply to comment #6)
> Wayne, Is that a question?
> 
> My issue is about the search box on the top-right corner of Thunderbird. For
> example when selecting "By subject..." and then typing "guia" a message with
> the subject "Guía para principiantes" does not match the search.

I thought I saw another bug about this, that wasn't bug 284856 or bug 525537. Aureliano, do you recall seeing one?
[getting harder to finish a thought]

(In reply to comment #6)
> Wayne, Is that a question?

yes :(
 
> My issue is about the search box on the top-right corner of Thunderbird. For
> example when selecting "By subject..." and then typing "guia" a message with
> the subject "Guía para principiantes" does not match the search.

so filters
Summary: Thunderbird search engine should ignore the accents → Thunderbird filters should ignore the accents
This should apply to  
1 filters 
2 search all messages, and 
3 search within a message

and (as an aside) search within a page in firefox

If you actually want to search for an accented character you can, and it all operates quite correctly
but normally you want to search for paris or parís however it was typed.

In French at least it's particularly relevant, because it is very common to omit the accent on upper case characters. Filters/searches are case independent by default, it would be better if they were also accent independent by default. 

Google seems to do it right:
if you click on http://www.google.fr/search?q=site:usefulshortcuts.com+e&sourceid=navclient-ff&rlz=1B3GGLL_en-GBFR382FR382&ie=UTF-8&hl
you will see that all versions of letter e (with or without accent) are bolded ie marked as hits

whereupon http://www.google.fr/search?q=site:usefulshortcuts.com+e&sourceid=navclient-ff&rlz=1B3GGLL_en-GBFR382FR382&ie=UTF-8&hl=#sclient=psy&hl=fr&rlz=1B3GGLL_en-GBFR382FR382&source=hp&q=site:usefulshortcuts.com+%C3%AB&aq=f&aqi=&aql=&oq=&gs_rfai=&pbx=1&fp=32bebe5f540a3320
only finds the accented character 

I've just tested the behaviour of IE and Opera and they both seem to do as TB/FF do, which is a shame.

Steve
Don't forget The cédille ¸ (cedilla) is found only on the letter C. It changes a hard C sound (like K) into a soft C sound (like S), e.g., garçon. The cedilla is never placed in front of E or I, because C always sounds like an S in front of these vowels.

I also recently received an e-mail from Norway whether the O had a forward slash through it.
I don't know if this is relevant to this discussion of accents, nor have I noticed any problems.

The Norwegian alphabet consists of 29 letters.

The 'O with Slash' "Ø" and "ø"  (transcribed as OE oe)
The 'A with Ring' "Å" and "å"  (transcribed as AA aa)
And 'Latin AE' "Æ" and "æ"  (transcribe as AE ae)

These are Latin letters and NOT accents.

The sorting order is 'ABC...XYZÆØÅ' (ie: AE OE AA after Z)

To complicate matters they is sometimes written with the Swedish characters
the Å 'A with Dieresis' "Ä" and "ä"
the Ö 'O with Dieresis' "Ö" and "ö"
again, these are NOT accents, but normal letters of the alphabet.
See Also: → 1042681
Removing myslef on all the bugs I'm cced on. Please NI me if you need something on MailNews Core bugs from me.
Assignee: m_kato → nobody
Severity: normal → S3
See Also: → 284856
Duplicate of this bug: 1766868
Duplicate of this bug: 1761717
Duplicate of this bug: 1235444
Status: NEW → RESOLVED
Closed: 1 year ago
Duplicate of bug: 284856
Resolution: --- → DUPLICATE
See Also: 284856
You need to log in before you can comment on or make changes to this bug.