mixtral-inference

Inference code for the Mistral's "mixtral" 8x7B mixture of experts model. Largely based on the Mistral 7B inference repository. Requires ~100GB of VRAM.

Dependencies

PyTorch, SentencePiece, and xformers.

pip install -r requirements.txt

Usage

Assumes you have 8 CUDA devices. You can modify this near the bottom of main.py.

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
mixtral		mixtral
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mixtral-inference

Dependencies

Usage

About

Languages

License

vikhyat/mixtral-inference

Folders and files

Latest commit

History

Repository files navigation

mixtral-inference

Dependencies

Usage

About

Resources

License

Stars

Watchers

Forks

Languages