Add stuff-summarisation using runnable #558

andy-symonds · 2024-06-11T22:40:04Z

Context

First pass as adding stuff summarisation functionality using Langchain Expression Language (LCEL) runnables.

Changes proposed in this pull request

I pulled in changs from REDBOX 337 chat file selection #556 and from Turning RAG into a runnable function #554
Keep a single build chain method in chat.py which uses the routing layer to decide with chain to build.
Pulled out specific build_'x'_chains into a separate file, called build_chains.py. In build_chains.py there is the original build_vanilla_chain and build_retrieval_chain (both of which we will want to change to LCEL runnables as soon as possible, but that is not the focus of this PR)
The ROUTING_RESPONSES dict has been updated, so that if the summarisation route is chosen, the build_chain selected is build_stuff_chain. The idea behind this is it would be extensive to many different types of chains mapped to different routes in future.
This is a draft! So build_stuff_chain is not yet complete. It uses the make_stuff_document_runnable. @wpfl-dbt could you add retrieval of the documents for summarisation?

Guidance to review

Not ready for PR review yet - first pass. Please could you sense check all of this!
I was trying to figure this out and seeing what would work, so have not got to tests yet. Sharing this early, so you can pick it up in the morning :)

Relevant links

Things to check

I have added any new ENV vars in all deployed environments
I have tested any code added or changed
I have run integration tests

…e through the chat view.

…d-stuff-summarisation Pulling in work from REDBOX-337 so I can unblock myself developing summarisation

wpfl-dbt

I initially wasn't sold on the direction, but the more I think about it, the more I like it.

Overall, I think:

We should have a principle of one build_* function per route (even the mundane ones), and these functions should use common keyword arguments and datatypes, and identical output shapes
This enables build_chain to focus on what it's really about -- routing logic -- because it makes chain building trivial
The summary chain needs work
If Turning RAG into a runnable function #554 gets in before this PR, why not bring LCEL on the original runnables into this PR? As I see it, this is the "everything's in, plumb it up" PR

core_api/src/routes/chat.py

wpfl-dbt · 2024-06-12T03:55:35Z

core_api/src/routes/chat.py

+        if callable(route_response):
+            # if route_response is not None:
+            build_chain = route_response
+            chain, params = await build_chain(


See my previous comment about **kwargs. Here you could then safely do...

build_chain( **{ "chat_request": chat_request, "user_uuid": user_uuid, "llm": llm, "vector_store": vector_store, "storage_handler": storage_handler } )

...content in the knowledge it'll work for every single build_* function. No more treating the chat function differently, no more worrying about how to handle future use cases!

wpfl-dbt · 2024-06-12T04:06:40Z

core_api/src/build_chains.py

+async def build_stuff_chain(
+    chat_request: ChatRequest,
+    user_uuid: UUID,
+    llm: ChatLiteLLM,
+    vector_store: ElasticsearchStore,
+):


-> tuple[Runnable, dict[str, Any]] on all these to help understand what they're doing

wpfl-dbt · 2024-06-12T04:12:59Z

core_api/src/build_chains.py

+async def build_vanilla_chain(
+    chat_request: ChatRequest,
+) -> ChatPromptTemplate:


Add **kwargs to all these function signatures

Ensure (as you currently have) that when they use a common keyword argument, it takes the same data

Now you can call them all with unpacked dictionaries of keyword arguments

This means that build_vanilla_chain can be called in just the same way as all the other build_* functions even though it only needs a fraction of the arguments.

wpfl-dbt · 2024-06-12T04:23:20Z

core_api/src/build_chains.py

+    chat_request: ChatRequest,
+    user_uuid: UUID,
+    llm: ChatLiteLLM,
+    vector_store: ElasticsearchStore,


Suggest generalising to build_summary_chain -- I think these build_* functions should match the routes even if they wrangle multiple runnables to do so.

See summarise() in the summarise notebook for my take on this function. You're going to need a storage_handler: ElasticsearchStorageHandler arg so you can retrieve the documents to summarise using core_api.src.format.get_file_chunked_to_tokens().

wpfl-dbt · 2024-06-12T04:25:20Z

core_api/src/build_chains.py

+    params = {
+        "question": question,
+        "content": context,
+        "messages": [(msg.role, msg.text) for msg in previous_history],
+    }


See the unit test for what this should be (and actually a good example of what you need in this function overall)

wpfl-dbt · 2024-06-12T04:26:10Z

core_api/src/routes/chat.py

+        # elif route_response is a Runnable
+        if callable(route_response):
+            # if route_response is not None:
+            build_chain = route_response


This overwrites its own function name -- change it

wpfl-dbt · 2024-06-12T04:26:34Z

core_api/src/routes/chat.py

-
-    return docs_with_sources_chain, params
-
-
 async def build_chain(


Get an output signature on this.

I also think a name like route_input_to_chain() would be more helpful to understand what's going on here.

As I suggest above, I think this function's main role is to deal with routing logic. Getting from route -> chain can be made trivial, especially in a-build_-function-per-route world.

wpfl-dbt · 2024-06-12T04:49:30Z

core_api/src/routes/chat.py

@@ -54,72 +56,13 @@
    "ability": ChatPromptTemplate.from_template(ABILITY_RESPONSE),


⚠️ I am 50/50 on the quality of this suggestion

Imo make this a nice consistent dict[str, Callable] -- make all values functions. Ways you might tackle this:

build_precanned_chain(response: ChatPromptTemplate, ...), and functools.partial a version with the response filled into each route

Start the pattern of "one build_* function per route" on the principle that it makes them all extensible, and most of them will one day need to be

I like this because it means in the function that deals with routing, once you've got the route name, you can just

... return ROUTE_RESPONSES.get(route.name)( **{ "chat_request": chat_request, "user_uuid": user_uuid, "llm": llm, "vector_store": vector_store, "storage_handler": storage_handler } )

This means that function can focus on the routing logic, which will get more complex as user override is added, and the "what chain do I need" stuff is moved out of the way.

Tbh, if you got to the point where the above was possible, consider changing build_chain to pure routing logic, returning a string, because that little snippet would be all you needed to connect routes with the configured runnables you need. Just put it in the endpoint directly.

As I say, 50/50 on this one...

Although strap in for mypy's take on that code...

… WIP, as needs integrating with streaming

wpfl-dbt · 2024-06-13T14:36:04Z

Merged as part of #570

brunns and others added 30 commits June 7, 2024 12:49

Add ChatMessage many-to-many relation to File, and make them availabl…

faed993

…e through the chat view.

Show selected files on chats page.

3add66a

Save selected file, and send to core, for streamed chat version.

1b8e317

Use checkboxes for selecting files

f7ff925

Move "Files to use" to sidebar

3eb716f

Tests for saving & sending selected files.

bffcb44

Setup document selection for streaming client-side

cf63656

Save selected file, and send to core, for streamed chat version.

2c913e0

Enable chat streaming in all tests.

c4aa4f3

add eval results visualisation and calculate uncertainity

8c254cc

remove inline outputs

6b14a74

Address Will's PR comments

e8562eb

Remove streaming demo

f842c3e

Save non-rag responses to DB - tactical fix.

26b199d

Revert tactical fix - we are doing it properly here.

99252a2

Recieve selected file list in core API for streaming.

0f1bfe6

Merge branch 'main' into feature/REDBOX-337-chat-file-selection

4fe6d67

Merge branch 'main' into feature/REDBOX-337-chat-file-selection

6fab42f

Add selected files to e2e tests.

0a5c6aa

Bug - ensure latest question is always the one answered.

f7d555a

Merge branch 'main' into feature/REDBOX-337-chat-file-selection

3f9c028

Unit tests not working but the core plumbing is there

117b152

Pulled changes to core-api only

43cef05

Reverting to chat from main

d1df1ca

Reverting to test chat from main

f45efec

Merge branch 'main' into feature/REDBOX-337-chat-file-selection

00f8d44

Post merge formatting.

aa66c80

wip

bfb193e

test now passing

9a4a3bb

Working runnables and unit tests

c7e68d2

andy-symonds added 4 commits June 11, 2024 21:03

Merge branch 'feature/REDBOX-337-chat-file-selection' into feature/ad…

0725140

…d-stuff-summarisation Pulling in work from REDBOX-337 so I can unblock myself developing summarisation

resolved conflicts

66bf852

first pass at adding stuff summarisation using runnables

2496854

wip

0c2edb7

wpfl-dbt reviewed Jun 12, 2024

View reviewed changes

andy-symonds and others added 6 commits June 12, 2024 14:32

[REDBOX-324] | WL, GB, AS | Set up stuff summarisation for streaming.…

463b72c

… WIP, as needs integrating with streaming

stuff

bb98c8a

Fix chunk retrieval

bad7251

More fixes to the build stuff chain

4d90b97

Fixed unit tests

db63aed

reverted django change

ac115d0

wpfl-dbt mentioned this pull request Jun 12, 2024

Added summarisation to core api [rebase 2] #570

Merged

3 tasks

wpfl-dbt closed this Jun 13, 2024

gecBurton deleted the feature/add-stuff-summarisation branch July 15, 2024 07:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stuff-summarisation using runnable #558

Add stuff-summarisation using runnable #558

andy-symonds commented Jun 11, 2024

wpfl-dbt left a comment •

edited

Loading

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024

wpfl-dbt Jun 12, 2024 •

edited

Loading

wpfl-dbt Jun 12, 2024

wpfl-dbt commented Jun 13, 2024


		return docs_with_sources_chain, params


		async def build_chain(

		@@ -54,72 +56,13 @@
		"ability": ChatPromptTemplate.from_template(ABILITY_RESPONSE),

Add stuff-summarisation using runnable #558

Add stuff-summarisation using runnable #558

Conversation

andy-symonds commented Jun 11, 2024

Context

Changes proposed in this pull request

Guidance to review

Relevant links

Things to check

wpfl-dbt left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wpfl-dbt Jun 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wpfl-dbt commented Jun 13, 2024

wpfl-dbt left a comment •

edited

Loading

wpfl-dbt Jun 12, 2024 •

edited

Loading