Add custom wordforms for common multi-word search terms

VERIFIED FIXED in 1.5

Status

support.mozilla.org
Search
P3
normal
VERIFIED FIXED
9 years ago
9 years ago

People

(Reporter: cilias, Assigned: jsocol)

Tracking

unspecified

Firefox Tracking Flags

(Not tracked)

Details

(Whiteboard: sumo_only)

Attachments

(1 attachment, 1 obsolete attachment)

(Reporter)

Description

9 years ago
I've come across a case where someone looking for issues with ZoneAlarm used the search term 'zone alarm'. The instances of "ZoneAlarm" did not appear in search results. This would also apply to "book marks", "fire wall", "home page", "web site", etc.

SUMO 1.5 worthy?
(Assignee)

Comment 1

9 years ago
Adding infix indexing (aka: substring searches) is extremely expensive for the indexer. However, an alternative method is to add a wordforms list with specific overrides, like

Zone Alarm > ZoneAlarm
book marks > bookmark (singular for stem matching)
fire wall > firewall

etc.
Component: General → Search
Priority: -- → P3
QA Contact: general → kb-software
Summary: Search should ignore spaces → Add custom wordforms for common multi-word search terms
Target Milestone: --- → 1.5
(Assignee)

Updated

9 years ago
Assignee: nobody → james

Comment 2

9 years ago
plug-in -> plugin
addons -> add-ons
(Assignee)

Comment 3

9 years ago
Created attachment 414112 [details] [diff] [review]
add wordforms.txt

NB: This patch needs to be applied directly in the scripts/sphinx directory, not from /trunk/.

This adds a wordforms.txt and references it in sphinx.conf (also correcting references to stopwords.txt). These .txt files are found in ROOT_PATH from localsettings.py--you may need to move things around.
Attachment #414112 - Flags: review?(morgamic)
(Assignee)

Comment 4

9 years ago
Oh, and a good search for testing is "face book". Before adding wordforms, I found 8 results in my database from Nov 18. After adding wordforms it was over 250.
(Assignee)

Updated

9 years ago
Attachment #414112 - Flags: review?(morgamic) → review?(paulc)
(Assignee)

Updated

9 years ago
Attachment #414112 - Flags: review?(paulc) → review?(morgamic)
Comment on attachment 414112 [details] [diff] [review]
add wordforms.txt

Looks reasonable to me.  Could you fix the justification for wordforms?  It's not aligned with the other stuff.
Attachment #414112 - Flags: review?(morgamic) → review+
(Assignee)

Comment 6

9 years ago
r56764.

Justified both the wordforms.txt and the instructions in sphinx.conf (will attach).
Status: NEW → RESOLVED
Last Resolved: 9 years ago
Resolution: --- → FIXED
(Assignee)

Comment 7

9 years ago
Created attachment 414294 [details] [diff] [review]
justification
Attachment #414112 - Attachment is obsolete: true
Verified, FIXED.
Status: RESOLVED → VERIFIED
(Assignee)

Updated

9 years ago
Blocks: 532156
(Assignee)

Updated

9 years ago
Whiteboard: sumo_only
You need to log in before you can comment on or make changes to this bug.