[back-end] Test effects of classification on non-content data (Common Crawl)

NEW
Unassigned

Status

Content Services Graveyard
Classification Engine
3 years ago
3 years ago

People

(Reporter: maxim zhilyaev, Unassigned)

Tracking

Firefox Tracking Flags

(Not tracked)

Details

(Reporter)

Description

3 years ago
Whatever classification is developed off Moreover corpus, it needs to be validated on other types of content.  Common Crawl could be a proper corpus to test precision/recall of content-base categorization.

Alternatively, classification may be turned off for sites not represented in Moreover corpus
You need to log in before you can comment on or make changes to this bug.