Skip to content
Change the repository type filter

All

    Repositories list

    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.1k002Updated Nov 12, 2024Nov 12, 2024
    • NeMo

      Public
      A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
      Python
      Apache License 2.0
      2.5k000Updated Aug 2, 2024Aug 2, 2024
    • NeMo Megatron launcher and tools
      Python
      Apache License 2.0
      140000Updated Aug 2, 2024Aug 2, 2024
    • llama-moe

      Public
      ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
      Python
      Apache License 2.0
      4688340Updated Jun 25, 2024Jun 25, 2024
    • OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.
      Python
      Apache License 2.0
      437001Updated Nov 3, 2023Nov 3, 2023
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.4k001Updated Oct 13, 2023Oct 13, 2023
    • llama

      Public
      Inference code for LLaMA models
      Python
      GNU General Public License v3.0
      9.6k100Updated Apr 15, 2023Apr 15, 2023