Skip to content

Latest commit

 

History

History
27 lines (17 loc) · 4.6 KB

Acknowledgement.md

File metadata and controls

27 lines (17 loc) · 4.6 KB

Acknowledgement

Design

Code

We referenced or used the following projects:

# Project Description Location License
1 Unsloth calculate_settings to determine block size and warp; We reuse it for Norm and MLP Liger Kernel Utils Apache
2 Unsloth We modified and added dW calculation on top of Unsloth implementation Liger Kernel RMS Norm Apache
3 Triton tutorial We modified on top of triton tutorials Liger Kernel RMS Norm MIT
4 tiny shakespeare dataset We use tiny shakespeare dataset to conduct convergence test on mini model Liger Kernel Convergence N/A
5 Efficient Cross Entropy We use the idea of gradient-in-forward and chunking Liger Kernel Linear Cross Entropy MIT
6 Flash attn We take many optimization ideas from the work, such as tiling and recomputation BSD
7 AutoAWQ We reference the design of automodel Liger Kernel Auto Model MIT
8 llm.c We reference the design of end-to-end testing Liger Kernel Convergence Tests MIT

Many thanks to the contributors to these projects for their invaluable work that helped make Liger possible.