755843 - Enable proper search weights in production

Reporter

Description

•

12 years ago

Currently search is treating all matches with equal weight. This is impacting our search results. 

We'd like to do a quick sanity check with waffling so I can get a general sense of search accuracy. Once the sanity check is done, please waffle 50% of users on the new search weights and 50% on the current equal weights. We will monitor the CTR for a few days before making the change for all users. 

We will need the urls for the waffling to pull the CTR. Let me know if you have any questions.

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 1

•

12 years ago

So... couple of questions:

1. Next sprint I'll finish up the work for the unified search results view. I was under the impression we were going to leave the existing bucketed search results view as is and only fiddle with weights and other similar things with the unified search results view. Is this supposed to apply to the unified search results view, the bucketed search results view, or both?

2. This doesn't specify the weights you want to use. Do we have weights we want to use? We could use the weights we were using with Sphinx as a start.

Matt Grimes [:Matt_G]

Reporter

Comment 2

•

12 years ago

Hey Will. I spoke to Cheng and it looks like CTR reporting is broken right now anyway. We can discuss in more detail when we meet on Tues as I'll need to fully understand the unified search question. I was under the impression that these values were the "Out of the box" values that we were going to use:

http://kitsune.readthedocs.org/en/latest/searchchapter.html#search-scoring

We can tweak them as needed after we see how well they work.

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 3

•

12 years ago

Those are the values that Sphinx search uses and that I thought our Elasticsearch code was using, but discovered last week that wasn't the case because the weights weren't being applied.

I'm game for using those weights. Other than "that's what we used for Sphinx", I don't know what they're based on.

It's less "unified search" and more "unified search results". Right now the site shows a "bucketed results view"--it shows 10 kb articles first, then shows support forum results. It's bucketed in that the results come from one group, then the next regardless of the scoring.

We're moving to a "unified search results" view where the results are interweaved and ordered only by score. i.e. One big result set rather than kb results first, then support forum results, ...

The rest of that work will hopefully get done in the next sprint. Then you'll have a unified search results view to work with.

However, the plan is that we're going to tinker with the scoring to get it to "good enough" first before letting regular users see the unified search results view. Given that, I'm not sure the plan you've outlined in the description of this bug will work.

We can discuss more on Tuesday, but that's the gist of where things are at.

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 4

•

12 years ago

Putting this in the 2012.12 sprint and making it a 1 pointer. All we're going to do for this bug is add query-time weights to the ES unified search results view using the numbers we use for Sphinx search.

Any additional work will be shunted to other bugs.

Assignee: nobody → willkg

OS: Mac OS X → All

Hardware: x86 → All

Whiteboard: u=user c=search p=1

Target Milestone: --- → 2012.12

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 5

•

12 years ago

Also, making this a P1. This needs to get done for the next sprint.

Priority: -- → P1

Matt Grimes [:Matt_G]

Reporter

Comment 6

•

12 years ago

This sounds great. Can we waffle a couple of days in Prod to see current vs weighted search? I'd also like to grab Verdi and few others to do some manual testing before we push to prod 100%. Sound good?

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 7

•

12 years ago

Let's try not to use shorthand since it gets confusing because there are a lot of things going on here.

The current plan is that I'm pushing out the code changes for the unified search results and that they will be behind the esunified waffle flag. Users won't see it until the search results are "good enough".

This bug is one of the things we'll be doing to get the search results "good enough".

I'm hesitant to add a waffle flag covering weighted and unweighted ES unified search results. Why is weight vs. unweighted an interesting contrast? Depending on the outcome of that, what's the next step?

Matt Grimes [:Matt_G]

Reporter

Comment 8

•

12 years ago

If we won't be unveiling any changes until the results are "good enough" then I am fine with your plan. My concern is that we assume adding the weights as implemented in Sphinx will be an improvement. In the off chance that it is not an improvement, we would want to know so that could either back the weights out or tweak them as needed. It sounds like all of that can still be done before any users see the change. Is that correct?

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 9

•

12 years ago

Matt: Seems like there's some confusion here, but I don't understand what's going on. Let me know when you're around and we can chat on skype, then summarize the conclusion here.

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 10

•

12 years ago

Talked to Matt via the phone.

All we're going to do here is add the weights that we used in Sphinx to the unified search results view.

After that, any further weight tweaking will be spun out from work done on the analysis bug #765768.

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 11

•

12 years ago

In a pull request now: https://github.com/mozilla/kitsune/pull/660

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 12

•

12 years ago

Landed in master in https://github.com/mozilla/kitsune/commit/d7ad2f8fb9bf6e2cabbb3a0824a26a10a12cb263

Target Milestone: 2012.12 → 2012.11

Will Kahn-Greene [:willkg] ET needinfo? me

Assignee

Comment 13

•

12 years ago

Pushed to production just now.

Status: NEW → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

Bugzilla

Quick Search

Enable proper search weights in production

Categories

(support.mozilla.org :: Search, defect, P1)

Tracking

(Not tracked)

People

(Reporter: Matt_G, Assigned: willkg)

References

Details

(Whiteboard: u=user c=search p=1)

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13