Skip to content

stefan-it/model-garden-lms

Repository files navigation

🏡 TensorFlow Model Garden LMs

BERT with TensorFlow Model Garden

🔎 Overview

This repository showcases language model pretraining with the awesome TensorFlow Model Garden library.

The following LMs are currently supported:

💡 Features

Additionally, the following features are provided:

  • A cheatsheet for TPU VM creation (including all necessary dependencies to pretrain models with TF Model Garden library), which can be found here.
  • An extended pretraining data generation script that allows, for example, the use of tokenizers from the Hugging Face Model Hub or different data packing strategies (Original BERT packing or RoBERTa-like packing), which can be found here.
  • Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models, which can be found here.

🏡 Model Zoo

FineWeb-LMs

Following LMs were pretrained on the (10BT subset) of the famous FineWeb and FineWeb-Edu dataset:

All models can be found in the TensorFlow Model Garden LMs organization on the Model Hub and in this collection.

Detailed evaluation results with the ScandEval library are available in this repository.

❤️ Acknowledgements

This repository is the outcome of the last two years of working with TPUs from the awesome TRC program and the TensorFlow Model Garden library.

Made from Bavarian Oberland with ❤️ and 🥨.

About

Language Model Pretraining with TensorFlow Model Garden

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published