182366 - Using machine learning to order autocomplete results

Assignee

Description

•

23 years ago

The attached patch is a first cut at using machine learning techniques to better order urls in the urlbar autocomplete dropdown window. The current implementation uses a simple linear classifier (perceptron) which is trained implicitly each time the user selects a url from the autocomplete dropdown. A bunch of numeric features (X1, X2, ... Xn) are calculated for each url and input to the perceptron along with the boolean value (B) of whether the url was selected by the user. The perceptron updates an array of numeric weights (W1, W2, ... Wn), one for each input feature, as it sees the training data and attempts to arrive at a set of weights such that: sigmoid( sigma(i = 1 to n, Wi * Xi) ) = B where sigmoid(x) = 1 / 1 + exp(-x) and sigma represents the summation operator over variable i from 1 to n. The goal is to train the perceptron with enough instances of training tuples, <X1, X2, ... Xn, B> so that it converges on a set of weights such that it can correctly output B from any new data instance, <X1, X2, ... Xn>. Once such a set of weights is reached, the perceptron can be used to predict whether a given url will be selected by the user or not. This is a very simple implementation and there are many ideas that have been left out in the above short description. I'll add more information here as questions are raised...

Patch v0.1 23 years ago Nisheeth Ranjan 39.08 KB, patch		Details \| Diff \| Splinter Review
Patch v0.2 23 years ago Nisheeth Ranjan 41.94 KB, patch		Details \| Diff \| Splinter Review
Patch v0.3 23 years ago Nisheeth Ranjan 43.99 KB, patch	hjtoi-bugzilla : review+ hewitt : superreview+	Details \| Diff \| Splinter Review
Patch v0.4 incorporates Heikki's comments 23 years ago Nisheeth Ranjan 45.76 KB, patch		Details \| Diff \| Splinter Review
Patch v.05 incorporates 2nd batch of comments from Heikki 23 years ago Nisheeth Ranjan 46.13 KB, patch	timeless : review- asa : superreview+	Details \| Diff \| Splinter Review
Patch v0.6: Incorporates timeless and boris' comments 22 years ago Nisheeth Ranjan 52.12 KB, patch		Details \| Diff \| Splinter Review
Patch v0.7: Created after compile and test cycle on Linux. 22 years ago Nisheeth Ranjan 50.53 KB, patch	hjtoi-bugzilla : review+	Details \| Diff \| Splinter Review
Patch v0.8: Patch that addresses Heikki and Chris Aillon's comments 22 years ago Nisheeth Ranjan 50.76 KB, patch		Details \| Diff \| Splinter Review
Patch v0.9: Does better out of memory checking. 22 years ago Nisheeth Ranjan 50.98 KB, patch	hewitt : superreview+	Details \| Diff \| Splinter Review
Sample data file without url information 22 years ago Nisheeth Ranjan 4.71 KB, text/plain		Details
Sample data file with url information 22 years ago Nisheeth Ranjan 5.42 KB, text/plain		Details
Patch v1.0: Final patch 22 years ago Nisheeth Ranjan 50.96 KB, patch		Details \| Diff \| Splinter Review
The PHP script behind the anonymous form 22 years ago Alex Bishop 3.49 KB, text/plain		Details