Utilize Intl.Segmenter to Segment Translations Source Text
Categories
(Firefox :: Translations, enhancement, P2)
Tracking
()
Tracking | Status | |
---|---|---|
firefox135 | --- | fixed |
People
(Reporter: nordzilla, Assigned: nordzilla)
References
(Depends on 1 open bug, Blocks 3 open bugs)
Details
Attachments
(5 files)
48 bytes,
text/x-phabricator-request
|
Details | Review | |
48 bytes,
text/x-phabricator-request
|
Details | Review | |
Bug 1917851 - Improve Left Double Quote Segmentation for CJK Translations r=#translations-reviewers!
48 bytes,
text/x-phabricator-request
|
Details | Review | |
48 bytes,
text/x-phabricator-request
|
Details | Review | |
48 bytes,
text/x-phabricator-request
|
Details | Review |
Description
Bug 1917849 will provide an API through which Bergamot can accept pre-segmented text as input.
At this point, we can utilize the Intl.Segmenter
to segment the text in a CJK-compatible way before sending it to Bergamot.
Steps to implement
- Modify the way that translation requests are sent to segment the text into sentences prior to sending them to Bergamot.
- Ensure this code is tested.
Assignee | ||
Updated•8 months ago
|
Assignee | ||
Comment 1•8 months ago
|
||
This patch updates the Translations typescript definintions
to conform to the updated WASM bindings in Bergamot version 2.0
Assignee | ||
Comment 2•8 months ago
|
||
This patch updates the TranslationsEngine code to utilize the
the updated WASM bindings in Bergamot version 2.0
Depends on D230443
Assignee | ||
Comment 3•8 months ago
|
||
This patch adds logic to our text-cleaning algorithm to improve
segmentation in a specific edge-case scenario where the Intl.Segmenter
algorithm produces a sentence break that is not ideal for translation.
Depends on D230444
Assignee | ||
Comment 4•8 months ago
|
||
This bug pulls in the latest generated JavaScript WASM
glue code for Bergamot version 2.0 and bumps the major-version
constant to ensure that only 2.x versions are used from here on.
Depends on D230445
Assignee | ||
Comment 5•8 months ago
|
||
This patch updates the URL of the Translations end-to-end test
artifact fetches to uniqlize the latest version of Bergamot,
compiled from https://github.com/mozilla/translations
Depends on D230446
Updated•8 months ago
|
Updated•8 months ago
|
Updated•8 months ago
|
Updated•8 months ago
|
Updated•8 months ago
|
Comment 7•7 months ago
|
||
bugherder |
https://hg.mozilla.org/mozilla-central/rev/f5f727dd3676
https://hg.mozilla.org/mozilla-central/rev/441819130698
https://hg.mozilla.org/mozilla-central/rev/9fe8b6c56c4c
https://hg.mozilla.org/mozilla-central/rev/c1117994ec4b
https://hg.mozilla.org/mozilla-central/rev/e4dbc6234960
Description
•