Added Groq LLM model support #1447

paresh2806 · 2024-07-09T09:08:31Z

What problem does this PR solve?

This PR adds support for the Groq LLM (Large Language Model).

Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at groq.com.

ID	Requests per Minute	Requests per Day	Tokens per Minute
gemma-7b-it	30	14,400	15,000
gemma2-9b-it	30	14,400	15,000
llama3-70b-8192	30	14,400	6,000
llama3-8b-8192	30	14,400	30,000
mixtral-8x7b-32768	30	14,400	5,000

Type of change

New Feature
Other (added model Groq llm model support):

paresh2806 · 2024-07-10T17:22:21Z

@JinHai-CN & @KevinHuSh

Should I add support for Gemini APIs to model providers in the same pull request, or should I create a new pull request for it ?

KevinHuSh · 2024-07-11T07:43:54Z

@JinHai-CN & @KevinHuSh

Should I add support for Gemini APIs to model providers in the same pull request, or should I create a new pull request for it ?

Gemini‘s already supported.

KevinHuSh · 2024-07-11T09:56:15Z

@JinHai-CN & @KevinHuSh

Should I add support for Gemini APIs to model providers in the same pull request, or should I create a new pull request for it ?

Why closed?
I was in the middle of merging conflicts.

paresh2806 · 2024-07-11T10:41:00Z

Apologies from my side. I thought there were conflicts on my end. I am new to open-source contributions.

…

________________________________ From: Kevin Hu ***@***.***> Sent: Thursday, July 11, 2024 3:26 PM To: infiniflow/ragflow ***@***.***> Cc: PARESH MAKWANA ***@***.***>; State change ***@***.***> Subject: Re: [infiniflow/ragflow] Added Groq LLM model support (PR #1447) @JinHai-CN<https://github.com/JinHai-CN> & @KevinHuSh<https://github.com/KevinHuSh> Should I add support for Gemini APIs to model providers in the same pull request, or should I create a new pull request for it ? Why closed? I was in the middle of merging conflicts. — Reply to this email directly, view it on GitHub<#1447 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ATOTSC2DJJXMPJ3MESWVMLDZLZJFLAVCNFSM6AAAAABKSNRBSWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRSGUYTMMRTGA>. You are receiving this because you modified the open/close state.Message ID: ***@***.***>

#1432 #1447 This PR adds support for the GROQ LLM (Large Language Model). Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at [groq.com](https://groq.com/). Supported Models | ID | Requests per Minute | Requests per Day | Tokens per Minute | |----------------------|---------------------|------------------|-------------------| | gemma-7b-it | 30 | 14,400 | 15,000 | | gemma2-9b-it | 30 | 14,400 | 15,000 | | llama3-70b-8192 | 30 | 14,400 | 6,000 | | llama3-8b-8192 | 30 | 14,400 | 30,000 | | mixtral-8x7b-32768 | 30 | 14,400 | 5,000 | --------- Co-authored-by: paresh0628 <[email protected]> Co-authored-by: Kevin Hu <[email protected]>

infiniflow#1432 infiniflow#1447 This PR adds support for the GROQ LLM (Large Language Model). Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at [groq.com](https://groq.com/). Supported Models | ID | Requests per Minute | Requests per Day | Tokens per Minute | |----------------------|---------------------|------------------|-------------------| | gemma-7b-it | 30 | 14,400 | 15,000 | | gemma2-9b-it | 30 | 14,400 | 15,000 | | llama3-70b-8192 | 30 | 14,400 | 6,000 | | llama3-8b-8192 | 30 | 14,400 | 30,000 | | mixtral-8x7b-32768 | 30 | 14,400 | 5,000 | --------- Co-authored-by: paresh0628 <[email protected]> Co-authored-by: Kevin Hu <[email protected]>

JinHai-CN mentioned this pull request Jul 10, 2024

[Feature Request]: Add Groq to model providers #1432

Closed

1 task

paresh2806 force-pushed the groq-llm branch from a22045d to b19b35d Compare July 11, 2024 05:45

paresh2806 closed this Jul 11, 2024

paresh2806 force-pushed the groq-llm branch from b19b35d to 7f4c63d Compare July 11, 2024 09:24

paresh2806 mentioned this pull request Jul 11, 2024

added SVG for Groq model model providers #1470

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Groq LLM model support #1447

Added Groq LLM model support #1447

paresh2806 commented Jul 9, 2024 •

edited

Loading

paresh2806 commented Jul 10, 2024

KevinHuSh commented Jul 11, 2024

KevinHuSh commented Jul 11, 2024

paresh2806 commented Jul 11, 2024 via email

Added Groq LLM model support #1447

Added Groq LLM model support #1447

Conversation

paresh2806 commented Jul 9, 2024 • edited Loading

Type of change

paresh2806 commented Jul 10, 2024

KevinHuSh commented Jul 11, 2024

KevinHuSh commented Jul 11, 2024

paresh2806 commented Jul 11, 2024 via email

paresh2806 commented Jul 9, 2024 •

edited

Loading