Should we build remote interface for UForm? #13

VoVoR · 2023-03-15T15:32:06Z

VoVoR
Mar 15, 2023
Collaborator

Guys, should we build and keep alive an API with embedding inference?

Mar 16, 2023

Of course! The question should be, how exactly do we build the remote interface?

At this point, all of the checkpoints pre-packaged into UForm are tiny. They are easy to deploy in any embedded setup. But some of the networks we are currently baking may require a custom setup for efficient inference at scale.

from ujrpc.rich_posix import Server
import ufrom

server = Server()
model = uform.get_model('unum-cloud/uform-vl-multilingual')

@server
def vectorize(description: str, photo: PIL.Image.Image) -> numpy.ndarray:
    image = model.preprocess_image(photo)
    tokens = model.preprocess_text(description)
    joint_embedding = model.encode_multimodal(image=image, text=tokens)

    return jo…

View full answer

ashvardanian · 2023-03-16T06:38:46Z

ashvardanian
Mar 16, 2023
Maintainer

Of course! The question should be, how exactly do we build the remote interface?

At this point, all of the checkpoints pre-packaged into UForm are tiny. They are easy to deploy in any embedded setup. But some of the networks we are currently baking may require a custom setup for efficient inference at scale.

from ujrpc.rich_posix import Server
import ufrom

server = Server()
model = uform.get_model('unum-cloud/uform-vl-multilingual')

@server
def vectorize(description: str, photo: PIL.Image.Image) -> numpy.ndarray:
    image = model.preprocess_image(photo)
    tokens = model.preprocess_text(description)
    joint_embedding = model.encode_multimodal(image=image, text=tokens)

    return joint_embedding.cpu().detach().numpy()

Let's use UJRPC for those Remote Procedure Calls. On DGX-A100 servers with current models, we can squeeze over 300K inferences/sec. FastAPI, and even gRPC, can't sustain that. Only UJRPC can. Luckily, there is now support for Pillow images so that the whole deployment script can fit in just 10 lines of Python.

0 replies

VoVoR · 2023-03-16T06:58:15Z

VoVoR
Mar 16, 2023
Collaborator Author

Great, let's do that!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should we build remote interface for UForm? #13

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Should we build remote interface for UForm? #13

VoVoR Mar 15, 2023 Collaborator

Replies: 2 comments

ashvardanian Mar 16, 2023 Maintainer

VoVoR Mar 16, 2023 Collaborator Author

VoVoR
Mar 15, 2023
Collaborator

ashvardanian
Mar 16, 2023
Maintainer

VoVoR
Mar 16, 2023
Collaborator Author