Closed Bug 1842763 Opened 2 years ago Closed 1 years ago

Improve translation of sentences containing numbers

Categories

(Firefox :: Translations, enhancement)

enhancement

Tracking

()

RESOLVED MOVED

People

(Reporter: marco, Unassigned)

References

Details

This might be fixed by data cleaning and data augmentation.

E.g. in our datasets, there are "bad" sentence pairs where different units of measurement are used (e.g. pound in English, kg in Italian). It's even worse when there is no formula to translate between units (e.g. dollars in en-US, euros in Italian).

Blocks: 1842765
Duplicate of this bug: 1797667

Aside from unit conversions, there are cases where it drops digits from numbers. For example, turning 244890 into 24,890. Since this is a silent error, and an order-of-magnitude error, it could be a problem...

See bug 1797667.

I wonder if we should also write some deterministic code to verify and audit numbers in translations, since the translation models are just statistical. This may be something to be done on the bergamot engine side of things.

An order-of-magnitude error is pretty concerning.

No longer blocks: 1842762, 1842765
Status: NEW → RESOLVED
Closed: 1 years ago
Resolution: --- → MOVED
You need to log in before you can comment on or make changes to this bug.