Scam detector should allow a user to train it so duplicate/similiar emails are not marked as a scam.
Categories
(Thunderbird :: Security, defect)
Tracking
(Not tracked)
People
(Reporter: mikel, Unassigned)
References
(Blocks 2 open bugs, )
Details
(Whiteboard: [gs])
Comment 1•19 years ago
|
||
Comment 2•19 years ago
|
||
Comment 3•19 years ago
|
||
Comment 5•19 years ago
|
||
Comment 7•19 years ago
|
||
Comment 8•19 years ago
|
||
Comment 9•19 years ago
|
||
Comment 10•18 years ago
|
||
Comment 11•18 years ago
|
||
Comment 12•18 years ago
|
||
Comment 14•18 years ago
|
||
Comment 15•17 years ago
|
||
Comment 16•17 years ago
|
||
Updated•17 years ago
|
Comment 17•16 years ago
|
||
Comment 19•16 years ago
|
||
Comment 20•16 years ago
|
||
Comment 22•15 years ago
|
||
Comment 24•15 years ago
|
||
Comment 25•15 years ago
|
||
Comment 26•15 years ago
|
||
Comment 27•15 years ago
|
||
Updated•14 years ago
|
Comment 29•14 years ago
|
||
Comment 30•14 years ago
|
||
Comment 31•14 years ago
|
||
![]() |
||
Comment 32•14 years ago
|
||
Comment 33•14 years ago
|
||
![]() |
||
Comment 34•14 years ago
|
||
Comment 35•14 years ago
|
||
![]() |
||
Comment 36•14 years ago
|
||
![]() |
||
Comment 38•14 years ago
|
||
Comment 39•14 years ago
|
||
Comment 40•14 years ago
|
||
Comment 41•14 years ago
|
||
Comment 42•12 years ago
|
||
Updated•12 years ago
|
Comment 43•12 years ago
|
||
Comment 44•12 years ago
|
||
Comment 45•12 years ago
|
||
![]() |
||
Comment 46•12 years ago
|
||
Comment 47•12 years ago
|
||
![]() |
||
Comment 48•12 years ago
|
||
Comment 49•12 years ago
|
||
Comment 50•12 years ago
|
||
Comment 51•12 years ago
|
||
Comment 52•10 years ago
|
||
Comment 53•10 years ago
|
||
Comment 54•10 years ago
|
||
Comment 55•10 years ago
|
||
Comment 56•9 years ago
|
||
Comment 57•9 years ago
|
||
Comment 58•9 years ago
|
||
Comment 59•9 years ago
|
||
Comment 60•9 years ago
|
||
Comment 61•9 years ago
|
||
Comment 62•9 years ago
|
||
Comment 63•9 years ago
|
||
Comment 64•9 years ago
|
||
Comment 65•9 years ago
|
||
Comment 66•9 years ago
|
||
Comment 67•9 years ago
|
||
Comment 70•9 years ago
|
||
Comment 71•9 years ago
|
||
Comment 72•9 years ago
|
||
Comment 73•9 years ago
|
||
Comment 74•9 years ago
|
||
Comment 75•9 years ago
|
||
![]() |
||
Comment 76•9 years ago
|
||
Comment 77•8 years ago
|
||
Comment 80•8 years ago
|
||
Comment 82•8 years ago
|
||
Comment 83•8 years ago
|
||
Comment 84•8 years ago
|
||
Updated•7 years ago
|
Comment 85•5 years ago
|
||
With bug 1476428 now fixed, I think there is very little to do here. That should basically cover the initial problem.
The premise of this bug is false though. You can't "train" what is a scam or not.
Comment 86•5 years ago
|
||
(In reply to Magnus Melin [:mkmelin] from comment #85)
With bug 1476428 now fixed, I think there is very little to do here. That should basically cover the initial problem.
The premise of this bug is false though. You can't "train" what is a scam or not.
Perhaps you can not train for it, but I think the reality is the users want to flag what they consider is not a scam. Almost all instances of users wanting the product to learn is they want a white list and don't now how to ask for it. Almost all the reports in support start with "Thunderbird thinks mail from XXXXX is a scam. How do I tell it it is not.This is especially the case with US political email. Folks get a little testy when their party of choice is considered a spam because of the tracking URL's in the email
Comment 87•5 years ago
|
||
Spot on @Matt, of course it could be trained. Deciding something is a scam is a heuristic or algorithm , and its almost always flawed thinking by developers to think their algorithm is always right. In this case the algorithm decided that links via common tracking sites were scams, when in the experience of many users, most of the time they weren't scams - for example many if not most company newsletters went back through certain tracking sites, which could - but is very rarely - a scam. It would be perfectly possible for TB to learn from user feedback that a message from domain a.a.a. with links to b.b.b. wasn't a scam. In the same way I can tell it allow remote content from certain email addresses.
So ... if you mean that you don't have time to fix this very common problem then fine - we know the TB is small and over-stretched to fix multi-year bugs, but please don't try and convince us that you "can't train what is a scam or not".
Comment 88•5 years ago
|
||
Magnus, Firefox has recently introduced a tracking cookie blocker where they have blocked some thousands of domains from dropping tracking cookies. What do you think of the idea of using that same database in the scam detection of determine known trackers and automatically white list them. That would reduce the number of false positives I am sure to quite low levels.
The json source is hosted here. https://github.com/mozilla-services/shavar-prod-lists
If you think it might be worth pursuing, I will file an appropriate bug.
Comment 89•5 years ago
|
||
I think there's a misunderstanding about the scam detection here.
First, scam is not the same as spam (though scam can be spam too). A scam is when you get an email trying to trick you into something, like following the link in the mail to update your bank details... If it's to be effective (for the scammer) it will look very similar to a real mail, but maybe the site name can be slightly different, but still similar enough for you to think it's legit. Therefore, you can't "train" such detection, or set whitelists - it would make the detection attempt completely pointless - you're not going to give details to the prince of Nigeria, but you're likely to give them to your mom (who you'd whitelist).
Second, the problem with wrong detections due to using of a tracker link instead is solved with bug 1476428 (this would also cover the complaints about political bias, of which there of course were none).
Let's not also mix in tracking (on the web) into the mix. We're not using any such database, and tracking is not scam. Either way, tracking requires remote content, for which we have the other warning bar. And, please try in Thunderbird 68, bug 1476428 will kick in and give you a choice if you get such a link.
Comment 90•5 years ago
|
||
Isn't one of the benefits of using the computer that it is very good at spotting differences in once char.
Actually a single char difference should be a big clue that it is spam or scam :)
Also, do not say it is impossible, we have already seen computers do things that was considered impossible just a few years ago.
It might be very difficult and result in an unacceptable number of false positive or negative but thats not the same as impossible.
As for how to determine a white list, could you use the full received list in the headers. If the sender, all receiver headers and any server and other origin pattern headers are the same you can be pretty sure the email was from the same sender.
Another option is, could thunderbird use the HEAD tag to check if it gets a redirect response matching the visible domain? That could also be a way to white list a link?
Comment 91•5 years ago
|
||
Detecting the one-char differences is already working.
Checking headers wouldn't do much good. You'd get many false positives.
As for checking HEAD - no need, since the code in 68 already handles it. Would also violate privacy.
Please before commenting further, please try out 68, and if you have any specific improvement requests after that, file specific bugs.
Comment 92•5 years ago
|
||
How do we update to 68 when it's not an update yet?
I don't want to have huge problems with TB using a BETA version.
Thanks
Comment 93•5 years ago
|
||
68 is already officially out. Auto-upgrades from 60.x are coming soon. You should be able to do a manual update directly:Just go to Help | About Thunderbird and it will download and install for you.
Comment 94•5 years ago
|
||
(In reply to Magnus Melin [:mkmelin] from comment #89)
First, scam is not the same as spam (though scam can be spam too). A scam is when you get an email trying to trick you into something, like following the link in the mail to update your bank details... If it's to be effective (for the scammer) it will look very similar to a real mail, but maybe the site name can be slightly different, but still similar enough for you to think it's legit. Therefore, you can't "train" such detection, or set whitelists - it would make the detection attempt completely pointless - you're not going to give details to the prince of Nigeria, but you're likely to give them to your mom (who you'd whitelist).
Yes, scam is not the same as spam, and that’s why failures in scam detection are even more problematic. DIsclaimer: I haven’t used Thunderbird since shortly after I submitted my original bug (so unfortunately I can no longer give more details) because Thunderbird’s false positive rate was way beyond my threshold of tolerance. When you get virtually no actual (or only easily spottable) scams but important emails keep getting flagged as “scam” the user is going to feel Thunderbird is a piece of garbage.
Comment 95•5 years ago
|
||
Right - and that's why it makes sense to have something that learns, Getting a single "scam" detect in a message is no big deal, getting them repeatedly on something you know is fine is the problem. Being able to do similar to the "Allow remote content" that then understood that this particular combination is not actually a scam, is exactly what was needed.
Comment 96•5 years ago
|
||
Once again, use 68. You'll find that is not a problem.
Comment 97•5 years ago
|
||
People on LInux LTS editions have only 60 or slightly newer.
It's pretty ridiculous that emails from Google calendar are marked as scam and warning is shown for google.com links (!)
Comment 98•5 years ago
|
||
(In reply to Daniel from comment #97)
People on LInux LTS editions have only 60 or slightly newer.
It's pretty ridiculous that emails from Google calendar are marked as scam and warning is shown for google.com links (!)
What version of Thunderbird is offered by your distribution is an issue you might want to address with the folks that manager the distribution, it has nothing to do with the Thunderbird release process. Or you could install the version offered on the Thunderbird.net web site and get the latest version.
Description
•