- @claire_yishan for the LOGO design
- Wave Snippets for generating the animated code snippets
We referenced or used the following projects:
# | Project | Description | Location | License |
---|---|---|---|---|
1 | Unsloth | calculate_settings to determine block size and warp; We reuse it for Norm and MLP |
Liger Kernel Utils | Apache |
2 | Unsloth | We modified and added dW calculation on top of Unsloth implementation | Liger Kernel RMS Norm | Apache |
3 | Triton tutorial | We modified on top of triton tutorials | Liger Kernel RMS Norm | MIT |
4 | tiny shakespeare dataset | We use tiny shakespeare dataset to conduct convergence test on mini model | Liger Kernel Convergence | N/A |
5 | Efficient Cross Entropy | We use the idea of gradient-in-forward and chunking | Liger Kernel Linear Cross Entropy | MIT |
6 | Flash attn | We take many optimization ideas from the work, such as tiling and recomputation | BSD | |
7 | AutoAWQ | We reference the design of automodel | Liger Kernel Auto Model | MIT |
8 | llm.c | We reference the design of end-to-end testing | Liger Kernel Convergence Tests | MIT |
Many thanks to the contributors to these projects for their invaluable work that helped make Liger possible.