Closed Bug 781177 Opened 12 years ago Closed 12 years ago

[ia] Use ES "more like this" for related documents

Categories

(support.mozilla.org :: Knowledge Base Software, task, P2)

Tracking

(Not tracked)

RESOLVED FIXED
2012Q3

People

(Reporter: rrosario, Assigned: rrosario)

References

Details

(Whiteboard: u=user c=wiki p=3)

ElasticSearch has a nice feature for finding similar documents:
http://www.elasticsearch.org/guide/reference/api/more-like-this.html

We should use this and delete our cron job with the huge SQL statement that uses tags. We could convert this to use topics and products but it will be much easier to just use elastic search.

Elasticutils doesn't support this yet but I filed an issue:
https://github.com/mozilla/elasticutils/issues/53
Putting this into Q3 milestone for now. We'll add to a sprint once elasticutils gets this feature landed.
Target Milestone: --- → 2012Q3
pyes.ES implements morelikethis(). We may be able to just call to that directly.
My vote is add a method to search.models.SearchMixin for morelikethis and use the same args as the pyes ES morelikethis. It would call get_es() to get an ES and call morelikethis on that.

Then in wiki.models.Document, create a "related_documents" or a "morelikethis" that calls SearchMixin.morelikethis with the specifics (e.g. the fields, the instance id, ...).

In elasticutils, if/when we ditch pyes, we'll continue to have an ES and we'll make sure it has a morelikethis going forward.
Assignee: nobody → rrosario
I think this will be 3pts. I know I will run into random issues.
Whiteboard: u=user c=wiki p= → u=user c=wiki p=3
Priority: -- → P2
Depends on: 778118
Blocks: 778118
No longer depends on: 778118
This landed this morning in prod.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.