Update samples for the In-Memory VectorStore Driver #110

joehu21 · 2024-07-10T20:36:25Z

Updated samples for the In-Memory VectorStore Driver, and added the missing PDF test file. Manually tested with MemoryDB 7.1.1 cluster.

samples/inmemory/retriever.ipynb

3coins · 2024-07-11T17:44:38Z

samples/inmemory/retriever.ipynb

    "from langchain_aws.embeddings import BedrockEmbeddings\n",
-    "from langchain_aws.llms.bedrock import Bedrock\n",
+    "from langchain_aws.llms.bedrock import BedrockLLM\n",


Can simplify the import.

from langchain_aws.llms import BedrockLLM

Also, why are we not using the ChatBedrock class?

Tried changing it to ChatBedrock, but the RetrievalQA code below failed. I think the code below also needs to change. Will investigate when I get some free time. It is a learning opportunity for me. We can leave it as is for now.

qa_prompt = RetrievalQA.from_chain_type( llm=llm, chain_type="stuff", retriever=vds.as_retriever(), return_source_documents=True, chain_type_kwargs={"prompt": PROMPT}, #verbose="true" ) query = "How do i create a MemoryDB cluster" result = qa_prompt({"query": query})

RetrievalQA chain has been deprecated. Use this instead.

from langchain.chains import create_retrieval_chain from langchain.chains.combine_documents import create_stuff_documents_chain from langchain_core.prompts import ChatPromptTemplate from langchain_aws import ChatBedrock retriever = ... # Your retriever llm = ChatBedrock(...) system_prompt = ( "Use the given context to answer the question. " "If you don't know the answer, say you don't know. " "Use three sentence maximum and keep the answer concise. " "Context: {context}" ) prompt = ChatPromptTemplate.from_messages( [ ("system", system_prompt), ("human", "{input}"), ] ) question_answer_chain = create_stuff_documents_chain(llm, prompt) chain = create_retrieval_chain(retriever, question_answer_chain) chain.invoke({"input": query})

Still getting the same error:

ValueError: Error raised by bedrock service: An error occurred (ValidationException) when calling the InvokeModel operation: Malformed input request: #: subject must not be valid against schema {"required":["messages"]}#: extraneous key [max_tokens_to_sample] is not permitted, please reformat your input and try again.

code:

system_prompt = ( "Use the given context to answer the question. " "If you don't know the answer, say you don't know. " "Use three sentence maximum and keep the answer concise. " "Context: {context}" ) prompt = ChatPromptTemplate.from_messages( [ ("system", system_prompt), ("human", "{input}"), ] ) question_answer_chain = create_stuff_documents_chain(llm, prompt) chain = create_retrieval_chain(retriever, question_answer_chain) query = "How do i create a MemoryDB cluster?" chain.invoke({"input": query})

Thanks for the offline help. Pushed the 3rd commit to change to ChatBedrock and ChatPromptTemplate.

3coins · 2024-07-11T17:46:47Z

samples/inmemory/retriever.ipynb

    "from langchain_aws.embeddings import BedrockEmbeddings\n",
-    "from langchain_aws.llms.bedrock import Bedrock\n",
+    "from langchain_aws.llms.bedrock import BedrockLLM\n",
    "load_dotenv()"


See my comment above. Even if you want to use the dotenv, the correct syntax would be to use the magics in a new cell. You don't need the import above for this.

%load_ext dotenv %dotenv

Good to know.

3coins · 2024-07-11T17:48:33Z

samples/inmemory/retriever.ipynb

-    "        model_id=\"anthropic.claude-v2\", #use the Anthropic Claude model\n",
+    "# use the Anthropic Claude model\n",
+    "llm = BedrockLLM(\n",
+    "        model_id=\"anthropic.claude-v2\",\n",


Users are most likely to use claude-3 models.

Tried to change it to claude-3, but it fails:

ValueError: Error raised by bedrock service: An error occurred (ValidationException) when calling the InvokeModel operation: The provided model identifier is invalid.

Here's my test:

from langchain_aws import BedrockLLM # initialize the Bedrock LLM llm = BedrockLLM( model_id="anthropic.claude-v3" ) prompt = "What is the largest city in Vermont?" # return a response to the prompt response = llm.invoke(prompt) print(response)

Any idea why the above fails?

BTW, langchain-aws README still uses claude-2.

That model id seems wrong. You can use one of these.

anthropic.claude-3-haiku-20240307-v1:0 anthropic.claude-3-opus-20240229-v1:0 anthropic.claude-3-sonnet-20240229-v1:0

Also, can you use ChatBedrock.

samples/inmemory/retriever.ipynb

Update samples for the In-Memory VectorStore Driver

1d27a16

3coins reviewed Jul 11, 2024

View reviewed changes

Joe Hu added 3 commits July 12, 2024 04:37

Clear outputs of all cells. Remove unused code.

ee283c9

Change to ChatBedrock and ChatPromptTemplate

5b4a60c

Remove the old code of RetrievalQA

70493d9

3coins approved these changes Jul 26, 2024

View reviewed changes

3coins merged commit 767fa5a into langchain-ai:main Jul 26, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update samples for the In-Memory VectorStore Driver #110

Update samples for the In-Memory VectorStore Driver #110

joehu21 commented Jul 10, 2024 •

edited

Loading

3coins Jul 11, 2024

joehu21 Jul 12, 2024

3coins Jul 12, 2024

joehu21 Jul 13, 2024

joehu21 Jul 16, 2024

3coins Jul 11, 2024

joehu21 Jul 12, 2024

3coins Jul 11, 2024

joehu21 Jul 12, 2024

3coins Jul 12, 2024

joehu21 Jul 16, 2024

Update samples for the In-Memory VectorStore Driver #110

Update samples for the In-Memory VectorStore Driver #110

Conversation

joehu21 commented Jul 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joehu21 commented Jul 10, 2024 •

edited

Loading