Open Bug 1932407 Opened 7 days ago Updated 7 days ago

Inference Engine: Memory-Efficient Model Downloading

Categories

(Core :: Machine Learning, enhancement)

Product:

Component:

Type:

enhancement

Priority:

Not set

Severity:

--

Tracking

()

Status:

NEW

People

(Reporter: atossou, Assigned: atossou)

References

Details

(Whiteboard: [genai])

Aristide Tossou

Assignee

Description

•

7 days ago

•

Currently, the model is fully downloaded to an arraybuffer which can uses too much memory for big models.

I suggest we download the model directly to a file while keeping the memory constant.

An implementation is already available here: https://phabricator.services.mozilla.com/D229641#change-F8nGVnX6zdic

Jira Integration Bot

Updated

•

7 days ago

See Also: → https://mozilla-hub.atlassian.net/browse/GENAI-402

You need to log in before you can comment on or make changes to this bug.