[Feature Request]: Add Groq to model providers #1432

Palvr · 2024-07-08T12:36:58Z

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

Add Groq to model providers

Describe implementation you've considered

Just like Ollama or XInference are used to manage the load of models

Documentation, adoption, use case

No response

Additional information

No response

JinHai-CN · 2024-07-10T02:46:37Z

#1447

#1432 #1447 This PR adds support for the GROQ LLM (Large Language Model). Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at [groq.com](https://groq.com/). Supported Models | ID | Requests per Minute | Requests per Day | Tokens per Minute | |----------------------|---------------------|------------------|-------------------| | gemma-7b-it | 30 | 14,400 | 15,000 | | gemma2-9b-it | 30 | 14,400 | 15,000 | | llama3-70b-8192 | 30 | 14,400 | 6,000 | | llama3-8b-8192 | 30 | 14,400 | 30,000 | | mixtral-8x7b-32768 | 30 | 14,400 | 5,000 | --------- Co-authored-by: paresh0628 <[email protected]> Co-authored-by: Kevin Hu <[email protected]>

infiniflow#1432 infiniflow#1447 This PR adds support for the GROQ LLM (Large Language Model). Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at [groq.com](https://groq.com/). Supported Models | ID | Requests per Minute | Requests per Day | Tokens per Minute | |----------------------|---------------------|------------------|-------------------| | gemma-7b-it | 30 | 14,400 | 15,000 | | gemma2-9b-it | 30 | 14,400 | 15,000 | | llama3-70b-8192 | 30 | 14,400 | 6,000 | | llama3-8b-8192 | 30 | 14,400 | 30,000 | | mixtral-8x7b-32768 | 30 | 14,400 | 5,000 | --------- Co-authored-by: paresh0628 <[email protected]> Co-authored-by: Kevin Hu <[email protected]>

KevinHuSh added the Feature label Jul 9, 2024

KevinHuSh mentioned this issue Jul 10, 2024

ROADMAP 2024 #162

Closed

27 tasks

paresh2806 mentioned this issue Jul 11, 2024

added SVG for Groq model model providers #1470

Merged

KevinHuSh closed this as completed Jul 16, 2024

yingfeng mentioned this issue Aug 6, 2024

ROADMAP 2024 #1821

Closed

61 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Add Groq to model providers #1432

[Feature Request]: Add Groq to model providers #1432

Palvr commented Jul 8, 2024 •

edited

Loading

JinHai-CN commented Jul 10, 2024

[Feature Request]: Add Groq to model providers #1432

[Feature Request]: Add Groq to model providers #1432

Comments

Palvr commented Jul 8, 2024 • edited Loading

Is there an existing issue for the same feature request?

Is your feature request related to a problem?

Describe the feature you'd like

Describe implementation you've considered

Documentation, adoption, use case

Additional information

JinHai-CN commented Jul 10, 2024

Palvr commented Jul 8, 2024 •

edited

Loading