Skip to content
Change the repository type filter

All

    Repositories list

    • lorax

      Public
      Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
      Python
      Apache License 2.0
      1432.2k13019Updated Nov 15, 2024Nov 15, 2024
    • Python
      21820Updated Sep 5, 2024Sep 5, 2024
    • Jupyter Notebook
      1300Updated Mar 3, 2024Mar 3, 2024
    • Best practices for distilling large language models.
      Jupyter Notebook
      2939300Updated Feb 1, 2024Feb 1, 2024
    • The official Python client for the Huggingface Hub.
      Python
      Apache License 2.0
      554000Updated Dec 18, 2023Dec 18, 2023
    • volcano

      Public archive
      A Cloud Native Batch System (Project under CNCF)
      Go
      Apache License 2.0
      969001Updated Dec 4, 2023Dec 4, 2023
    • punica

      Public
      Serving multiple LoRA finetuned LLM as one
      Cuda
      46100Updated Nov 24, 2023Nov 24, 2023
    • volcano-apis

      Public archive
      The API (CRD) of Volcano
      Go
      Apache License 2.0
      59000Updated Nov 8, 2023Nov 8, 2023
    • LlamaIndex (GPT Index) is a data framework for your LLM applications
      Python
      MIT License
      5.3k000Updated Aug 1, 2023Aug 1, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      15k000Updated Jul 20, 2023Jul 20, 2023
    • Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container images on each node.
      Go
      Eclipse Public License 2.0
      30000Updated Apr 20, 2023Apr 20, 2023
    • PyBump

      Public
      Bump version in Helm Chart.yaml and setup.py files
      Python
      Apache License 2.0
      8000Updated Dec 22, 2022Dec 22, 2022
    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k000Updated Oct 22, 2022Oct 22, 2022
    • An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
      HTML
      Apache License 2.0
      831000Updated Aug 24, 2022Aug 24, 2022
    • dask-sql

      Public
      Distributed SQL Engine in Python using Dask
      Python
      MIT License
      72100Updated Apr 5, 2022Apr 5, 2022
    • Python
      BSD 3-Clause "New" or "Revised" License
      13100Updated Feb 23, 2022Feb 23, 2022
    • neuropod

      Public
      A uniform interface to run deep learning models from multiple frameworks
      C++
      Apache License 2.0
      77300Updated Feb 23, 2022Feb 23, 2022
    • GitHub action for identifying the last successful commit for a given workflow and branch.
      JavaScript
      49000Updated Jan 5, 2021Jan 5, 2021