Almost safe wrapper for BitNet-cpp based on llama-cpp
See bitnet-cpp-sys
- minimize crate size
- rewrite sampler PR 9294
- look into Metal implementation of MatMul kernerls for BitNet
- add more examples
- replace all remaining python from BitNet repo
- add better division between
arm64
andx86_64
- move python code generation into
patch
files
arm64 | x64 | OS | comments |
---|---|---|---|
✅ | ❌ | MacOS | tested fully on Apple Silicon macs |
❌ | Linux | should test on ARM based linux | |
❓ | ❌ | Windows | should test on ARM based windows |
Heavily inspired by llama-cpp-rs
© 2024, Eugene Hauptmann