feature/fast-embeddings #405

gecBurton · 2024-05-17T12:32:59Z

Context

As a User I want the model to be loaded on start up and not on every embedding, so that the embeddings are done quickly

Changes proposed in this pull request

Previously we were using the fastapi Depends to handle dependency injection for the ml model, but this loads the model on every function call.

We could not switch to fastapi's lifetime because this is incompatible with faststream.

By reverting to pure faststream we can use its own lifetime which does work but this requires that we switch away from the http health check to a container healthcheck

Guidance to review

Relevant links

depends on https://github.com/i-dot-ai/i-ai-core-infrastructure/pull/260

Things to check

I have added any new ENV vars in all deployed environments
I have tested any code added or changed
I have run integration tests https://github.com/i-dot-ai/redbox-copilot/actions/runs/9130721587

use container healthcheck

95080c0

gecBurton marked this pull request as draft May 17, 2024 12:33

George Burton added 3 commits May 17, 2024 14:03

removed fastapi

c632371

switched to faststream

7592c82

restored image_tag

daddad1

gecBurton temporarily deployed to release May 17, 2024 13:27 — with GitHub Actions Inactive

fix typo

9e84625

gecBurton temporarily deployed to release May 17, 2024 13:27 — with GitHub Actions Inactive

gecBurton changed the title ~~use container healthcheck~~ feature/fast-embeddings May 17, 2024

gecBurton marked this pull request as ready for review May 17, 2024 13:51

pinned torch

4ffc1af

gecBurton temporarily deployed to release May 17, 2024 13:55 — with GitHub Actions Inactive

lmwilkigov approved these changes May 17, 2024

View reviewed changes

George Burton added 3 commits May 17, 2024 16:16

fixing bugs in dockerfile

317598d

corected test value

701a5ee

remove debug step

d22f0ca

gecBurton merged commit a5609e3 into main May 17, 2024
10 checks passed

gecBurton deleted the feature/container-health-check branch May 17, 2024 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature/fast-embeddings #405

feature/fast-embeddings #405

gecBurton commented May 17, 2024 •

edited

Loading

feature/fast-embeddings #405

feature/fast-embeddings #405

Conversation

gecBurton commented May 17, 2024 • edited Loading

Context

Changes proposed in this pull request

Guidance to review

Relevant links

Things to check

gecBurton commented May 17, 2024 •

edited

Loading