WASM AI

Everything you need to run llms natively in the browser
and look good doing it.

Live Demo • Key Features • One Click Deploy • Usage •

wasmaidemo.mp4

WASM AI is a quickstart template to run large language models completely in the browser. Modern 7B LLMs (even quantized to q4) are incredibly intelligent - good enough for text-to-SQL search, creative writing, analysis, NLP and other tasks - or to be a friend on an airplane. You can now run them in the browser for complete privacy, at blazing inference speeds, without a cent of cloud costs.

WASM AI puts together work from far more talented people (like the folks at MLC LLM, who built the library to compile huggingface models into other formats, and Vercel, who made Vercel AI and the chatbot template).

Key Features

This repo is meant to be a quickstart to build and iterate on local, open-source models in the browser, even distribute them as part of larger apps. We have a few things here that might be useful:

Two smart, compiled models
- Dolphin 2.2.1 and OpenHermes-2.5 are provided as compiled wasm-compatible models to test. I can compile other models on request, when I get the time.
Swap between local and cloud easily
- I kept things as compatible as I could with Vercel's AI library, which has useful things like backpressure and streaming. You can swap them by changing these two constants. That's it. This should make testing and validation easier for your apps.
Web workers
- took some figuring out, but the local model and inference sits inside a worker, so the UI can run smoother.
UI Bells and whistles
- live code and markdown formatting, scroll to bottom, etc. I got most of these from the chatbot template, but I've cleaned out everything else and done a fresh migration.
Local Transcription with Whisper
- In the spirit of doing everything on the browser, Whisper-turbo is now integrated, to do voice chat directly in the browser. If you'd like just the base chat things, pull the just-chat branch.

This repo is the work of one overworked dev, and meant to be for educational purposes. Use at your own risk!

For other projects, check out wishful search!, or say hi on Twitter!

One-click deploy

Deploy your own to Vercel with a single click:

Usage

Clone the repo. Then:

yarn
yarn dev

That's it!

Not done yet

Error handling - Sometimes things fail. I haven't handled those times yet. For all the other times, there's Masterca-
More support - There's a crypto.randomUUID issue on mobile even on WebGPU-enabled Chrome. I'm torn between patching the web-llm package or asking them to help.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.vscode		.vscode
public		public
src		src
.env		.env
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
middleware.ts		middleware.ts
next.config.js		next.config.js
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WASM AI

Everything you need to run llms natively in the browser
and look good doing it.

Key Features

One-click deploy

Usage

Not done yet

About

Releases

Packages

Languages

License

seanbirchall/wasm-ai

Folders and files

Latest commit

History

Repository files navigation

WASM AI

Everything you need to run llms natively in the browser and look good doing it.

Key Features

One-click deploy

Usage

Not done yet

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Everything you need to run llms natively in the browser
and look good doing it.

Packages