Fauno - The Italian Large Language Model that will leave you senza parole!

Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.

The research team behind Fauno includes Andrea Bacciu, Dr. Giovanni Trappolini, Professor Emanuele Rodolà, Andrea Santilli, and Professor Fabrizio Silvestri.

Fauno represents a cutting-edge development in open-source Italian Large Language Modeling. It's trained on extensive Italian synthetic datasets, encompassing a wide range of fields such as medical data 🩺, technical content from Stack Overflow 💻, Quora discussions 💬, and Alpaca data 🦙 translated into Italian.

Hence, our model is able to answer to your questions in Italian 🙋, fix your buggy code 🐛 and understand medical literature 💊.

Compared to Camoscio this model should deliver a better conversational experience like with ChatGPT, since it is trained on more data in conversational format generated with self-conversation of ChatGPT and then translated to Italian (see Datasets section and the Blaize paper for more details).

The 🇮🇹 open-source version of chatGPT!

Discover the capabilities of Fauno and experience the evolution of Italian language models for yourself.

Why the name Fauno?

We started with a model called Baize, named after a legendary creature from Chinese literature. Continuing along this thematic line, we developed our Italian model based on Baize and named it Fauno, inspired by an iconic figure from Roman mythology. This choice underlines the link between the two models, while maintaining a distinctive identity rooted in Italian culture.

🔎 Model's details

Fauno is a fine-tuned version of the LoRa weights of Baize, that is an improved version of LLama.

We translated and cleaned the data of Baize, and then we fine-tuned the 7b model using a single RTX A6000 (48GB of VRAM) with 19 hours for one epoch.

7B Model: https://huggingface.co/andreabac3/Fauno-Italian-LLM-7B
13B Model: https://huggingface.co/andreabac3/Fauno-Italian-LLM-13B
Fauno 30B and 60 are coming soon!

Datasets

We uploaded the 4 italian datasets on HuggingFace used to train Fauno.

Did you know that you can run Fauno on Colab base?

Follow this link to access a Colab notebook with our 7B version!

🖥 Local Usage

Installation

pip3 install -r requirements.txt

Inference

model = LlamaForCausalLM.from_pretrained(
    "decapoda-research/llama-7b-hf",
    load_in_8bit=True,
    device_map="auto",
)
tokenizer = LlamaTokenizer.from_pretrained(
    "decapoda-research/llama-7b-hf", add_eos_token=True
)
model = PeftModel.from_pretrained(model, f"andreabac3/Fauno-Italian-LLM-7B")

🔧 Fine-tuning

To fine-tune LLama follow this instruction

python3 finetune.py --exp_name all_data --model_size 7b --data_folder data_ITA/

🌐 Website

cd demo/ && python3 app.py decapoda-research/llama-7b-hf andreabac3/Fauno-Italian-LLM-7B

📖 Cite our work

We are delighted to announce that the Fauno paper has been accepted @ IIR 2023. To cite our work or our translated dataset and model weights in your research, remember to cite us.

@article{bacciu2023fauno,
  title={Fauno: The Italian Large Language Model that will leave you senza parole!},
  author={Bacciu, Andrea and Trappolini, Giovanni and Santilli, Andrea and Rodol{\`a}, Emanuele and Silvestri, Fabrizio},
  journal={arXiv preprint arXiv:2306.14457},
  year={2023}
}

🔑 License

This project is a derivative of Baize, and we adhere to the licensing constraints imposed by both Baize's creators and the authors of LLama.

⚠️ Hallucinations

It is important to remark that current generation models are prone to the problem of hallucinations. So we advise you not to take their answers seriously.

Who we are?

We are RSTLess a research group that focuses our research on Robust, Safety and Transparency in multiple fields such as Machine Learning, LLMs and NLP, Graph Learning, Web Search, and Recommendation Systems.

👏 Acknowledgement

LLama - Meta AI: https://github.com/facebookresearch/llama
Baize: https://github.com/project-baize/baize-chatbot
Standford Alpaca: https://github.com/tatsu-lab/stanford_alpaca
Camoscio: https://github.com/teelinsan/camoscio

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data_ITA		data_ITA
data_utilities		data_utilities
demo		demo
images		images
.gitignore		.gitignore
Fauno_7B_demo.ipynb		Fauno_7B_demo.ipynb
LICENSE		LICENSE
README.md		README.md
README_BAIZE.md		README_BAIZE.md
example.gif		example.gif
finetune.py		finetune.py
requirements.txt		requirements.txt
run_finetune.sh		run_finetune.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fauno - The Italian Large Language Model that will leave you senza parole!

The 🇮🇹 open-source version of chatGPT!

Why the name Fauno?

🔎 Model's details

Datasets

Did you know that you can run Fauno on Colab base?

🖥 Local Usage

Installation

Inference

🔧 Fine-tuning

🌐 Website

📖 Cite our work

🔑 License

⚠️ Hallucinations

Who we are?

👏 Acknowledgement

Image Credits

About

Releases

Packages

Contributors 4

Languages

License

RSTLess-research/Fauno-Italian-LLM

Folders and files

Latest commit

History

Repository files navigation

Fauno - The Italian Large Language Model that will leave you senza parole!

The 🇮🇹 open-source version of chatGPT!

Why the name Fauno?

🔎 Model's details

Datasets

Did you know that you can run Fauno on Colab base?

🖥 Local Usage

Installation

Inference

🔧 Fine-tuning

🌐 Website

📖 Cite our work

🔑 License

⚠️ Hallucinations

Who we are?

👏 Acknowledgement

Image Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages