ElasticSearch has a nice feature for finding similar documents: http://www.elasticsearch.org/guide/reference/api/more-like-this.html We should use this and delete our cron job with the huge SQL statement that uses tags. We could convert this to use topics and products but it will be much easier to just use elastic search. Elasticutils doesn't support this yet but I filed an issue: https://github.com/mozilla/elasticutils/issues/53
Putting this into Q3 milestone for now. We'll add to a sprint once elasticutils gets this feature landed.
Target Milestone: --- → 2012Q3
pyes.ES implements morelikethis(). We may be able to just call to that directly.
My vote is add a method to search.models.SearchMixin for morelikethis and use the same args as the pyes ES morelikethis. It would call get_es() to get an ES and call morelikethis on that. Then in wiki.models.Document, create a "related_documents" or a "morelikethis" that calls SearchMixin.morelikethis with the specifics (e.g. the fields, the instance id, ...). In elasticutils, if/when we ditch pyes, we'll continue to have an ES and we'll make sure it has a morelikethis going forward.
I think this will be 3pts. I know I will run into random issues.
Whiteboard: u=user c=wiki p= → u=user c=wiki p=3
This landed on master: https://github.com/mozilla/kitsune/commit/39841f51379da3aaa2d511f5b065dc7b98ae1d41 I'll deploy to prod on Monday.
This landed this morning in prod.
Status: NEW → RESOLVED
Last Resolved: 5 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.