Changing estimation to 13 points as I may run into scalability issues. There are about 6M classified urls in Moreover corpus which may take time to go through. If DFR rule checking is too slow, I may have to go back and rework how rules are matched.
Points: 8 → 13
Assignee: nobody → mzhilyaev
Iteration: --- → 37.2
OS: Mac OS X → All
Hardware: x86 → All
Summary: Write script to run DFR rules on Moreover corpus and collect precision/recall stats per category and ruleset → Write script to run DFR rules on Moreover corpus to collect precision/recall stats per category and ruleset
Commit pushed to master at https://github.com/mzhilyaev/pfeed https://github.com/mzhilyaev/pfeed/commit/cd3f1fb20517cc77690ec59a589bed00d86efdcf Merge pull request #8 from mzhilyaev/dfr-test Closes Bug 1109977 - script to run DFR rules on Moreover corpus to collect precision/recall stats
Status: NEW → RESOLVED
Last Resolved: 4 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.