Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Samples] merge LLM samples to "text_generation" folder #1411

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

olpipi
Copy link
Collaborator

@olpipi olpipi commented Dec 19, 2024

No description provided.

@github-actions github-actions bot added category: GHA CI based on Github actions category: cmake / build Cmake scripts category: samples GenAI samples labels Dec 19, 2024
@olpipi olpipi force-pushed the samples_movement branch 2 times, most recently from 9e7f861 to 3901fbb Compare December 19, 2024 11:22
samples/cpp/text_generation/CMakeLists.txt Outdated Show resolved Hide resolved
samples/cpp/text_generation/README.md Outdated Show resolved Hide resolved
samples/cpp/text_generation/README.md Outdated Show resolved Hide resolved
samples/cpp/text_generation/README.md Outdated Show resolved Hide resolved
- **Main Feature:** Demonstrates simple text continuation.
- **Run Command:**
```bash
./text_generation -m <model> -i "Hello, how are you?" -d CPU
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. If it's an example command, it must be specific, replace <model> with TinyLlama. If it's a help message, align it with what the sample would print.
  2. There's no text_generation target.
  3. The samples don't take device as an arg. They also don't have named args

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it's a help message, align it with what the sample would print.

Align them to match exactly. For example greedy would print greedy_causal_lm <MODEL_DIR> "<PROMPT>" (remove Usage: because you have *Run Command:).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed them according to your comments

samples/cpp/text_generation/README.md Outdated Show resolved Hide resolved
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same must be done for python

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suggest to apply all comments first. And then make similar changes to python samples (or in the next PR)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suggest to apply all comments first. And then make similar changes to python samples (or in the next PR)

Please change the structure of cpp sample and python in the same PR. It will be more convenient to update OpenVINO documentation and other public materials, which reference these samples.

samples/cpp/text_generation/README.md Show resolved Hide resolved
@ilya-lavrenov ilya-lavrenov changed the title Samples movement [Samples] merge LLM samples to "text_generation" folder Dec 20, 2024
COMPILE_PDB_NAME prompt_lookup_decoding_lm
# Ensure out of box LC_RPATH on macOS with SIP
INSTALL_RPATH_USE_LINK_PATH ON)
# Don't install prompt_lookup_decoding_lm because it doesn't use openvino_genai library and is not verified yet.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's obsolete information and now prompt_lookup_decoding_lm uses GenAI.
Please, uncomment this code.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@@ -2,14 +2,8 @@
# SPDX-License-Identifier: Apache-2.0
#

add_subdirectory(cpp/beam_search_causal_lm)
add_subdirectory(cpp/benchmark_genai)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added


This example showcases inference of text-generation Large Language Models (LLMs): `chatglm`, `LLaMA`, `Qwen` and other models with the same signature. The application doesn't have many configuration options to encourage the reader to explore and modify the source code. For example, change the device for inference to GPU. The sample fearures `ov::genai::LLMPipeline` and configures it to run the simplest deterministic greedy sampling algorithm. There is also a Jupyter [notebook](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/llm-chatbot) which provides an example of LLM-powered Chatbot in Python.
# OpenVINO AI Text Generation Samples
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# OpenVINO AI Text Generation Samples
# OpenVINO GenAI Text Generation Samples

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


These samples showcase the use of OpenVINO's inference capabilities for text generation tasks, including different decoding strategies such as beam search, multinomial sampling, and speculative decoding. Each sample has a specific focus and demonstrates a unique aspect of text generation.
The applications don't have many configuration options to encourage the reader to explore and modify the source code. For example, change the device for inference to GPU.
There is also a Jupyter [notebook](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/llm-chatbot) that provides an example of LLM-powered text generation in Python.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we have multiple text generation notebooks in openvino_notebooks?
If yes, we need per-sample reference to notebook

CC @eaidova @sbalandi

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

## Table of Contents
1. [Download and Convert the Model and Tokenizers](#download-and-convert-the-model-and-tokenizers)
2. [Running the Samples](#running-the-samples)
3. [Using encrypted models](#using-encrypted-models)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this section is missed.

BTW, it not a dedicated section, it's also just a sample as others.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed. This was a legacy section from old revision

@@ -13,32 +24,99 @@ pip install --upgrade-strategy eager -r ../../requirements.txt
optimum-cli export openvino --trust-remote-code --model TinyLlama/TinyLlama-1.1B-Chat-v1.0 TinyLlama-1.1B-Chat-v1.0
```

## Run
## Running the Samples
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Run Command in each sample already demonstrates how to run

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

@@ -13,32 +24,99 @@ pip install --upgrade-strategy eager -r ../../requirements.txt
optimum-cli export openvino --trust-remote-code --model TinyLlama/TinyLlama-1.1B-Chat-v1.0 TinyLlama-1.1B-Chat-v1.0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose we cannot use just one model for all use cases.

For chat sample - chat model
For typical generation - either instruct or typical model
For speculative decoding we need 2 models and it's not shown at all what models to use

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added model examples:
chat_sample - meta-llama/Llama-2-7b-chat-hf
speculative_decoding_lm - meta-llama/Llama-2-13b-hf as main model and TinyLlama/TinyLlama-1.1B-Chat-v1.0 as draft model
other samples - meta-llama/Llama-2-7b-hf

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 3, 2025
github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 3, 2025
@olpipi
Copy link
Collaborator Author

olpipi commented Jan 3, 2025

If there is no more major comments, I will make the similar changes to python samples

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: cmake / build Cmake scripts category: GHA CI based on Github actions category: samples GenAI samples no-match-files
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants