Closed Bug 354445 Opened 18 years ago Closed 18 years ago

[enh] language filter feature

Categories

(Thunderbird :: Preferences, enhancement)

enhancement
Not set
normal

Tracking

(Not tracked)

VERIFIED DUPLICATE of bug 62598

People

(Reporter: DXpublica, Assigned: mscott)

Details

User-Agent:       Mozilla/5.0 (X11; U; Linux i686; ca; rv:1.8.0.7) Gecko/20060921 Ubuntu/dapper-security Firefox/1.5.0.7
Build Identifier: Mozilla/5.0 (X11; U; Linux i686; ca; rv:1.8.0.7) Gecko/20060921 Ubuntu/dapper-security Firefox/1.5.0.7

Hi,

Normally one user could receive mails in languages that he/she does not understand (for example english users receive japanese mails, or spanish users receive portuguese language mails).

Clearly these mails clould be marked as spam, but now, there is no rule/filter in thunderbird for mark all mails written in some (user defined) language

I think this feature could be useful for thunderbird users.

Perhaps it's scope is a little bit out thunderbird because perhaps it depends on external mail filter (like spamassassin). I don't know how true is it


Thanks in advance,
Xan.

Reproducible: Always

Actual Results:  
No filters for specific language discrimination
And how do you plan to recognize the different languages (to see what's inside the mails) ? The charset-header ? That's for the encoding (ISO-8859-1, UTF-8, Shift-JIS, ...), not for the languages itself. Spanish and Portuguese will use the same encoding (just like English or Dutch), so there's no way to recognize them. And if everybody is using UTF-8 (which everybody should), then there's no way to differentiate them.
Don't get me wrong - filtering out unrecognizable encodings (I don't read Japanese or Chinese) would be useful, but I just wanted to point out that you can't filter out languages. Your Spanish/Portuguese example will never work this way.

On the other hand, companies like Google can already guess in what language a webpage is, so there's a way. But I don't think it will be easy for Thunderbird.

*** This bug has been marked as a duplicate of 62598 ***
Status: UNCONFIRMED → RESOLVED
Closed: 18 years ago
Resolution: --- → DUPLICATE
> *** This bug has been marked as a duplicate of 62598 ***

As noted at bug 62598 comment 31, Search/Filter on the charset of the message is possible now simply by creating a filter to search the Content-Type field (which you need to add to the list of Custom Headers).

Search/Filter on the MIME-encoding of the headers is bug 320584.
Status: RESOLVED → VERIFIED
what if you could teach it by example? like, when you get spam not detected as such with words that don't occur in your non-spam messages, you could mark it as spam and tbird would learn about those words being spammy? oh, wait...

:)
You need to log in before you can comment on or make changes to this bug.