Closed Bug 389488 Opened 17 years ago Closed 14 years ago

"Entire message" search fails if charset not known - should use default charset.

Categories

(Thunderbird :: Mail Window Front End, defect)

defect
Not set
major

Tracking

(Not tracked)

RESOLVED DUPLICATE of bug 404255

People

(Reporter: bugzilla, Unassigned)

References

(Blocks 1 open bug)

Details

(Keywords: intl)

Attachments

(1 file)

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.5) Gecko/20070713 Firefox/2.0.0.5
Build Identifier: Thunderbird 2.0.0.5

Doing a "Entire message" search of a word in a message which charset is unknown causes a search failure.

After adding the alias to res/charsetalias.properties, search works.

Note that search in subject/header works fine.

The default charset (or utf-8 or...) should be used when charset is not recognised for searching "entire message".


Reproducible: Always

Steps to Reproduce:
1. Get a message with an unknown charset like "ASCII" (or change the message manually) ; set subject to "hello world" and message body to "hello world"
2. Search "world" with search drop-down list set to "Subject or Sender" => OK
3. Same search in "Entire message" => nothing
Actual Results:  
Search with "Entire message" fails.

Expected Results:  
Any default charset would find the word "world" and provides a search result including the message "hello world".

I set the bug as Major: it made me miss an important message!
This can cause a real problem in some case where you think you didn't receive a message, while you did.
will this also be an issue in the new TB3 search paradigm?
very close to mine bug 404255
Just hit the same problem with the current Thunderbird 2.0.0.19. Searched an hour for a mail that I knew was there but Thunderbird was not able to find. Turned out the problem was it had charset=ascii. This may not be a correct charset, but Thunderbird should deal with unknown charsets anyway - in this case the mail was delivered by Amazon.

For reproducing the problem, find attached a zip archive with 5 test mails which are identical except for the charset. If you search for a word in the message body, like "ragtime", you will only get 4 hits - the one with charset=ascii will not be found!
To bugzilla@oui.jp(bug opener) and Christoph Zwerschke:

Local mail folder(POP3 or "Local Folders"? IMAP?
If IMAP, online search? Local(or offline) search?
If IMAP and online search, I think DUPing to Bug 404255 is appropriate action, because Bug 404255 clearly explains issue.
I experienced this with *Local Folders*. You can simply import the 5 eml files in the attached zip file to any local folder (e.g. using ImportExportTool) and then search inside that folder for "ragtime" in the message text. All emails will be found, except the one with charset=ascii, although it also contains "ragtime" in the message text. Note that no non-ascii characters exist or are searched in these emails (different to bug 404255), it's all plain ascii.
Keywords: intl
Have look into Bug 404255 which is propose to use UTF-8 for online search. Probably we should mark this as dupe
Status: UNCONFIRMED → RESOLVED
Closed: 14 years ago
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: