have global search support exact matching (disabling stemming). ex: Searching for 'wedding' finds 'weds'
Categories
(Thunderbird :: Search, enhancement)
Tracking
(Not tracked)
People
(Reporter: chris, Unassigned)
References
(Blocks 1 open bug)
Details
Comment 1•13 years ago
|
||
important |
Comment 5•11 years ago
|
||
Comment 6•11 years ago
|
||
important |
Comment 7•11 years ago
|
||
Comment 8•11 years ago
|
||
Comment 10•11 years ago
|
||
Comment 11•11 years ago
|
||
Comment 12•11 years ago
|
||
Comment 13•11 years ago
|
||
Comment 15•10 years ago
|
||
Comment 16•10 years ago
|
||
Comment 17•10 years ago
|
||
workaround |
Comment 18•9 years ago
|
||
Comment hidden (metoo) |
Comment hidden (metoo) |
Comment 21•8 years ago
|
||
workaround |
Comment 22•8 years ago
|
||
Comment 23•8 years ago
|
||
Comment hidden (metoo) |
Comment 25•8 years ago
|
||
Comment 26•8 years ago
|
||
Comment 27•7 years ago
|
||
Comment 28•7 years ago
|
||
Updated•7 years ago
|
Comment hidden (metoo) |
Comment hidden (metoo) |
Comment 33•5 years ago
|
||
(In reply to Caro Cogitatus from comment #31)
Please fix this. Fuzzy searches should be an option. This "feature" makes
the search function essentially useless.
I could not agree more. I have finally managed to introduce thunderbird in my company and being able to find email messages reliably is basic functionality.
This issue was opened 7 years ago and it appears there aren't any plans for changing it any time soon.
One last thought, IMHO this should not be considered an enhancement but a serious bug, as I said before, email search is basic functionality for any email client.
Comment 34•5 years ago
|
||
Agreed, this is infuriatingly broken! 99% of the time I do NOT want the stemmed result.
Today I am trying to find emails referencing products of the brand "Sensative", and it comes up with hits for "sense", "senses", etc., completely drowning out the few emails with 'sensative' in them...
This has been bugging people for 7 years, and still no-one fixed it.
Comment 35•5 years ago
|
||
This has been bugging people for 7 years, and still no-one fixed it.
"No-one" includes everyone who adds comments here instead of contributing a patch. Age of a ticket is entirely irrelevant: There is no unlimited (wo)manpower. If you want to see this fixed, write a patch to fix it. Please avoid more "me too" comments that don't help anyone. Thanks.
Comment 36•5 years ago
|
||
IMHO 7 years, 16 votes, 33 watchers, 1 blocked bug, 5 duplicates and a problem which is objectively important and a huge limitation in an e-mail client should not be considered "entirely irrelevant" when the devs decide the roadmap of the next Thunderbird release...
After all, we're talking about the ability to perform an "exact match" search, something that should be one of the very first thing that one needs in an e-mail client...
Of course you can always say "step and and fix it by yourself", but I don't think it's quite fair, I don't think there are many people here who have the knowledge to fix this, but this doesn't mean that they are not legitimate Thunderbird users who would like to see it improved. And if the "mee too" is annoying and "irrelevant", how could the community express its opinion on the importance of this problem vs another one? What is driving the devs decisions?
Just IMHO, as I already said.
Comment 37•5 years ago
|
||
@Mauro: Some ego's just don't like to be told what other people feel is important when it doesn't match the shiny new feature they want to add.
Adding new stuff is always more fun for a developer (I know, I've been one) than fixing old code!
I work 60-ish hours every week, have a family and a house, I don't even have time to play the games I paid cash for, let alone debug a mess of code like Thunderbird...
It is possible to fix it, because someone wrote a plugin that provides better search functionality than TB does itself, but it's ridiculous that something this basic is still bothering people 7 years and several duplicates later!
Just a little quote to close this comment:
"Constant and intense critique is one of the reasons we build great products. It's harder to fall into group-think if there is always a healthy amount of dissent. We want to encourage vibrant debate inside of the Mozilla community, we want you to disagree with us, and we want you to effectively argue your case. However, we require that in the process, you criticize things, not people"
Comment 38•5 years ago
|
||
I have been a volunteer free-software contributor[*] and I'm also an long-time thunderbird user frustrated by this bug like you. I can tell you that the tone of your comments is actually detracting from getting this bug fixed. Metoo comments (including this one) actually make developers mute and remove bugs from their lists to reduce the signal to noise ratio. Moreover, no volunteer dev likes to be told that their work is bad or a mess, that they must fix something in their limited free time that may not bother them personally, and they may not have enough free time to have a family or enough money to buy a house... Nobody wants to reward bad behaviour by fixing bugs under pressure.
There are many positive and constructive ways for a user to express their interest for a particular bug:
- Politely explain why you think the bug deserves more attention by adding arguments that have not been mentioned before (the minimum effort one can do before commenting is to actually read previous comments).
- Add constructive information to the bug report: a deep analysis of the issue, high-level suggestions for potential solutions designs, a high-level sketch of solution implementation, references to fixes implemented in add-ons or other email readers, etc.
- Volunteer in thunderbird in some other way that gives you a voice in setting the roadmap.
- Organise users affected to set up a substantial bounty.
- Reach out to companies using/contributing to Thunderbird to invest resources in this particular bug.
- etc.
[*] I recommend trying to be one, even for a short period, you'll learn a lot and you'll perspective will change on many things)
(I believe this comment should be marked as off-topic and hidden by default)
Comment 39•5 years ago
|
||
I'm wrote "me too" comment 6 years ago
Just FYI - there is a workaround how to find exact matches. Don't use global search, just right click on folder (even parent/top folder) and click Search messages
It's not indexed search, it takes time, you can search only one mailbox a time but it works
Comment 40•5 years ago
|
||
(In reply to M Lopez-Ibanez from comment #38)
I have been a volunteer free-software contributor[*] and I'm also an long-time thunderbird user frustrated by this bug like you. I can tell you that the tone of your comments is actually detracting from getting this bug fixed. Metoo comments (including this one) actually make developers mute and remove bugs from their lists to reduce the signal to noise ratio. Moreover, no volunteer dev likes to be told that their work is bad or a mess, that they must fix something in their limited free time that may not bother them personally, and they may not have enough free time to have a family or enough money to buy a house... Nobody wants to reward bad behaviour by fixing bugs under pressure.
There are many positive and constructive ways for a user to express their interest for a particular bug:
- Politely explain why you think the bug deserves more attention by adding arguments that have not been mentioned before (the minimum effort one can do before commenting is to actually read previous comments).
- Add constructive information to the bug report: a deep analysis of the issue, high-level suggestions for potential solutions designs, a high-level sketch of solution implementation, references to fixes implemented in add-ons or other email readers, etc.
- Volunteer in thunderbird in some other way that gives you a voice in setting the roadmap.
- Organise users affected to set up a substantial bounty.
- Reach out to companies using/contributing to Thunderbird to invest resources in this particular bug.
- etc.
[*] I recommend trying to be one, even for a short period, you'll learn a lot and you'll perspective will change on many things)
(I believe this comment should be marked as off-topic and hidden by default)
I respect you taking time to present a balanced, friendly response. It's a refreshing change from the high-and-mighty and arrogant responses.
It's the latter that triggered responses 36-38... It's not just users requesting bug fixes that should be aware of their tone!
I've been a developer and also did open source, so I know what you mean. And yes, IMHO comments # 33, 34 should be marked #MeToo, and 35, 37, 38 and 40 as #OffTopic.
But automatically considering MeToo's as bad and negative is NOT productive either! It's other people confirming that the bug is a problem that helps to identify issues that might need more attention. Just discarding MeToo's is disregarding that important indicator, and there seems to be no other way to help identify what users find important.
I never said that anyone's work is bad, or a mess, I referred to the WHOLE TB codebase as a Mess of Code... (Which it is! Nobody can say it is structured, uniform coding... At least not without lying)
And yes, this is a clear example why free/open-source software will never be able to fully get to the level commercial code does: Nobody wants to work on fixing old code that isn't a flashy new feature, whereas an employer can go tell developers "go fix this"... Having said that, I - most of the time - still prefer the free/open-source software!
Now let's close this discussion, as it contributes nothing to this bug, or the fixing of this broken behavior.
So finally, to actually add a constructive element to this comment to further indicate why this should be fixed:
I know of several people who prefer to use any of Microsoft's mail products over TB simply because they cannot find what they're looking for in TB when they need to search for things, despite me having it set up for them! So this is definitely costing TB users, and a bigger user base means a better representation, and a bigger 'voice' for the TB team and the Mozilla Foundation.
Now if I know several people, there's going to be more people that know several people like that...
Comment 41•5 years ago
|
||
Is this something money could fix?
I ask, because time & talent aren't things I can contribute, but money is.
I pledge $200 to this getting fixed. Is that realistic? Will others join my pledge?
Comment 42•5 years ago
|
||
I thought it was just me going crazy - no wonder I found TB search (filter) to be nearly useless if this is the default (!)
I want to add that this is very counter-intuitive.
When I type a search (filter) word (especially as I use 2-3 languages in my emails) I need to to use the exact word - especially as it is not clear how to "force" TB to use the search term verbatim.
Ah - found right click "search" - but searching the Body of the text seems to be broken for me - no results when I know their should be...
Search Subject does work...
Comment 43•5 years ago
|
||
I'll happily match the $200 pledge for getting this fixed. Is there a way to do that officially? (I didn't find one in a quick search.) I don't want to renege on it by say missing an email.
Post processing seems perfectly fine to me - if there is an internal limit hit, the result is not worse. The limit would already get hit in the current code, right? Just more to wade through manually. It might be nice to warn though.
Alternatively, a 2x doubling of index storage sounds like a small cost to get such an important feature working as expected. Even smaller now, relatively, given the passage of time (hardware specs continue to evolve, other software bloats much faster).
Comment 44•5 years ago
|
||
if there is an internal limit hit, the result is not worse
Oh except, when you add additional search terms that doesn't shrink the original query but expand it, so there's no easy user workaround(?) Hmm....
Comment 45•5 years ago
|
||
A larger index would seem like a very agreeable 'sacrifice' for a better (non-stemmed) search... Stemming might make sense in some cases, but in most cases (both myself as many people in my vicinity) a literal search is what we're looking for!
Comment 46•5 years ago
|
||
Anything is good - I last contributed to this thread * 6 * years ago, yes folks that's SIX years ago.
I'll happily test and do interface review - very experienced dev for web and UX here - and wrangler of database search - it would be trivially easy if they hadn't gone for stemming.
It's lunacy as it stands.
And add an up-front date filter - half the time there's no need to search 10 years of data, usually all I want is the last year.
Comment 47•5 years ago
|
||
This is a terrible issue. I´m trying to find some emails that mention a specific company name and it is showing all the emails that contain "Here".
I´ve quickly reviewed this issue discussion, and I understand there is no fix for this because we don´t want to expand the index.
Fine: Why just don´t double search, ex
Step 1) Find on index all messages that stemmed-hit our query as it is now
I understand that gives us a list of hits and what is the word where we got a hit (since thunderbid highlights the words)
Step 2) Filter the word matches and keep on the list only those that exactly hit the word (could be a checkmark or specific query string to activate this).
Looks so simple to implement. Am I missing something?
Comment 48•5 years ago
|
||
(In reply to cristian from comment #47)
...
I´ve quickly reviewed this issue discussion, and I understand there is no fix for this because we don´t want to expand the index.
No, index size and speed was the reason for stemming in the original design 10+ years ago. Hardware has improved so those should no longer a blocking factors.
The reason you haven't seen this yet is no volunteer has offered to fix it it, and amongst paid staff it far from the top of the heap of things of thousands of things to fix. I say this as someone who knows the process, and also like to see this fixed. It expect it will be higher when search i general gets a revamp, but I don't see that happening in the current development cycle for version 78 to come out this spring.
Comment 49•5 years ago
|
||
(In reply to Wayne Mery (:wsmwk) from comment #48)
The reason you haven't seen this yet is no volunteer has offered to fix it it, and amongst paid staff it far from the top of the heap of things of thousands of things to fix. I say this as someone who knows the process, and also like to see this fixed. It expect it will be higher when search i general gets a revamp, but I don't see that happening in the current development cycle for version 78 to come out this spring.
Thanks for your reply. I´m mentioning it because maybe some volunteer missed the point here, and is not willing to work on a full rewamp on search, but could work on a just quick post-filtering of search results.
I´m willing to do it but I´ll need a longer onboarding time on the project and I have few spare time. I mean, I would take some time but not 8 years to do it :). Maybe someone with more experience just needs the idea on how to proceed, is thinking on a too complex change to what could be a quick fix?
Comment 50•5 years ago
|
||
I think the easiest way to partially fix this is to replace stemming with lemmatization.
Comment 51•4 years ago
|
||
Is there any chnce this will be prioritzed in the newar future?
Seach in TB is a mess and it is driving me crazy not beng able to find mail that I KNOW are somewhere in my mail storage.
This is the single most reason I loathe TB as my mail client.
Please fix this!
Put a bounty on it and let users chip in.
Comment 52•4 years ago
|
||
I no longer use the global search, and just do CTRL+SHIFT+F (CMD +SHIFT+F on MacOS) now... It’s considerably slower in populating results, but at least it shows me what I am looking for, instead of the useless crap the stemmed option in global search regurgitates!
(IMHO they should just drop it altogether, and just give us back the storage and cpu cycles that creating the stemmed database cost...)
Comment 53•4 years ago
|
||
(In reply to Fear na Boinne from comment #52)
I no longer use the global search, and just do CTRL+SHIFT+F (CMD +SHIFT+F on MacOS) now... It’s considerably slower in populating results, but at least it shows me what I am looking for, instead of the useless crap the stemmed option in global search regurgitates!
(IMHO they should just drop it altogether, and just give us back the storage and cpu cycles that creating the stemmed database cost...)
I completely agree. I also deleted the search index recently and turned off the search in advanced settings as it uses over 10GB of data and I have to delete valuable data to keep my computer working properly. Search is (one of) the most used feature for me and I hate it that there is no proper alternative. I have over 20 e-mail accounts and I now search them one by one every time I need to find an email message with the per mailbox search. Unfortunately it is even more cumbersome as I filter many messages into several folders. This stemming makes using Thunderbird very time consuming.
Comment 57•3 years ago
|
||
I understand the reasons the global search returns unwanted results. However, why does it seem to not be the case for the message filter that searches messages in a particular folder (i.e inbox)?
For example, if I want to filter messages with a subject containing "direct debit," the filter is able to parse out and return messages only containing "direct debit" in the subject. It does not return messages containing either direct or debit, which the global search would return.
Is there any way to utilize the algorithm from the message filter in the global search?
Comment 58•3 years ago
|
||
Because quick filter doesn't use the global search (index), but searches on demand.
Comment 59•3 years ago
|
||
Magnus,
I appreciate your answer, but your answer feels like it is equivalent to "because I sad so."
Also, your answer does not address the questions that I asked. The point I tried to make (perhaps poorly) is: if Mozilla can do the search on "demand" for the filter, then why can't employ the same search method for the global search. It's analogous to building a car that can go forward and reverse, and building a truck that can only go forward.
Thanks anyway.
Comment 60•3 years ago
|
||
It may be practical to search a few messages, but impractical to search too many messages without using an index. And stemming makes it easier to create an index.
Comment 61•3 years ago
|
||
Easier to create an index at the cost of being unable to get accurate results for some searches. Personally, I'm happy to have to wait longer for exact match results on the relatively few occasions when it's required. As it stands the only way to do this is to use an extension and search folder-by-folder, which is about a 1 on a 1-10 UX scale.
Comment 62•3 years ago
|
||
(In reply to mozilla from comment #61)
Easier to create an index at the cost of being unable to get accurate results for some searches. Personally, I'm happy to have to wait longer for exact match results on the relatively few occasions when it's required. As it stands the only way to do this is to use an extension and search folder-by-folder, which is about a 1 on a 1-10 UX scale.
You can use the built-in Global Search ([CTRL|CMD]+SHIFT+F) to search one or more folders without stemming. I think it would be an easier ask to make Global Search a widget for the toolbar(s) than to change their mind on the usefulness of the stemming search (for me the usefulness is a -3 on a 1-10 scale, because I NEVER use it, but occasionally forget I should use the Global Search instead!) that is the default now…
Comment 63•3 years ago
|
||
Frankie suggests trying ("why can't employ the same search method for the global search"), but the architecture of global search doesn't fit quick filter. So Magnus is absolutely correct and directly answered the question of comment 57 - a solution is never going to come from plumbing global search capabilities into Quick Filter Bar.
The possible architectural solutions are comment 1, comment 6, and a complete replacement of gloda - none of which are in scope for the next version, 103.
For now,possible workarounds for the foreseeable future are Edit > Find > Search Messages mentioned in comment 17 (repeated in comment 23), and ...
(In reply to github from comment #21)
As a workaround with this plugin search seems to be better:
https://addons.mozilla.org/en-US/thunderbird/addon/gmailui/
The updated version is https://addons.thunderbird.net/en-US/thunderbird/addon/expression-search-ng/ If you want to support his work, suggest helping with the code, or donating. Open issues are listed at https://github.com/opto/Expression-Search-NG/issues (this bug report is not a place to discuss this addon).
Comment 65•3 years ago
|
||
Sorry, I am not getting this. This is not a bug, but feature enhancement request - thus not duplicate of my bugreport. The fact, that search often returns 98% of false positives means that search don't work, thus its not a feature request, its a bug and should be marked accordingly. As I see it's been very long ago since this behavior should be visible, but for some reason I bumped into it today. Polite requests, rude requests, doesn't matter, we go the one microsoft way: it's not a bug, it's a feature.
I don't care for any index, just do a full text search, I need results and if that means that it eats thousand times more resources than 20 years ago, ok, do it and maybe, after that we may try to see how to make it as fast as in version 2 where it worked well with very similar amount of mails as I have today - around ten thousand.
The main point is that the amount of text data in emails did not rise too much in the meantime, proly even got lower as many mails are just a title and link somewhere instead of the whole message. The amount of emails of active users did not rise too much and even if it did, the search may go in the background deeper and deeper to history. Thus if it was possible to do an exact match search in reasonable time 20 years ago, it should be instant today. I understand that as every developer adds his 1% negligible slowdown, we are doomed to wait as we always were as number of developers behind every button rises every year. Thus it could not be instant, but still it should be at worst as slow as it was back then.
Maybe, the stemming algorythm is language related, so maybe using Svahili version of thunderbird could solve this. Am I right or not? And if so, is there a way of using svahili stemming algorythm in english or czech speaking thunderbird?
Updated•2 years ago
|
Comment hidden (metoo) |
Comment hidden (metoo) |
Comment 71•11 months ago
|
||
Just use 'control shift F' and the poorly designed but workable slow search.
(Poorly designed because the defaults are very very dumb)
Comment 72•11 months ago
|
||
(In reply to Mark C from comment #71)
Just use 'control shift F' and the poorly designed but workable slow search.
(Poorly designed because the defaults are very very dumb)
That's a folder search, not a global search.
Comment 73•11 months ago
|
||
You can specify the top most wrapper folder e.g. 'Inbox' - I'm just making a suggestion to make it workable. But I have all my accounts combined into a unified inbox - it works for me.
Comment hidden (metoo) |
Comment 79•2 months ago
|
||
I had not realised this incredibly annoying 'feature' was so long lived.
The fact, that search often returns 98% of false positives means that search don't work, thus its not a feature request, its a bug and should be marked accordingly.
YES!!
It makes global search much less useful than it could be because it returns far too many results.
It also contradicts the online help: https://support.mozilla.org/en-US/kb/global-search says nothing about trying to be clever:
For example, if you enter:
converting “imap pop”
… Thunderbird will find emails that contain the term “converting” and the phrase “imap pop”.
Yes, it will. It will also return emails that have things like "conversion" but not "converting".
Looking at the results from TB 128.2.2esr this morning, looking for "continuity" gave me ones with "continues" and "continued".
I had another phrase, "group.io" in the search that wasn't being respected: anything with "group" (and presumably the letters 'io' somewhere) were being returned. So I suspect strongly that official example it doesn't care about getting only ones with "imap pop" in as well.
When running grep on the mailbox files produces much better results than the program's inbuilt search, that's a bug.
I have turned global search off. Are there now unused index files that can be deleted?
Comment 80•29 days ago
|
||
(In reply to Ian from comment #79)
I have turned global search off. Are there now unused index files that can be deleted?
This may help: https://support.mozilla.org/en-US/kb/rebuilding-global-database
Description
•