Skip to content

ionet-official/io-ray-serve-chat-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

io-ray-serve-chat-demo

a Ray Serve Chat Demo Serving Hugging Face Models

How to get started

  1. Open Up io.net account

  2. Follow through standard procedure on launching a Ray Cluster. Select a small cluster, for example 4 T4.

  3. When the cluster is ready, select Visual Studio Code (VSCode) IO Cloud Ray Cluster T4

  4. Launch Visual studio code terminal and clone this repo

git clone https://github.com/ionet-official/io-ray-serve-chat-demo.git
  1. Go to the folder
cd  io-ray-serve-chat-demo
  1. Start the chat server via
serve run chat.yaml

Serve run chat

  1. Wait till the Ray serve deploys the chat app across workers. You will see on the terminal a "Model loaded" message.
  2. Test your Chatbot from the cluster. Open a new terminal and run the sample chat client
python chat_client.py

chat client example

  1. Test your Chatbot server endpoint from outside the Cluster
    1. Server endpoint: https://exposed-service-[YOUR-CLUSTER-SUFFIX].tunnels.io.systems/
    2. If your cluster suffix is 1d47a, then: https://exposed-service-1d47a.tunnels.io.systems/
    3. One way to identify your prefix is from the the VSCode URL, which looks like https://vscode-1d47a.tunnels.io.systems/
    4. You can use below code snippet to interact with the Ray serve application created (update the endpoint to your server)
import requests
SERVER_ENDPOINT = "https://exposed-service-1d47a.tunnels.io.systems/"
message = "What is the capital of France?"
history = []
response = requests.post(SERVER_ENDPOINT, json={"user_input": message, "history": history})
print(response.json())

or on a terminal:

curl -X POST https://exposed-service-1d47a.tunnels.io.systems/ \
-H "Content-Type: application/json" \
-d '{"user_input": "What is the capital of France?", "history": []}'

About

a Ray Serve Chat Demo Serving Hugging Face Models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages