-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Internet search #10
Comments
Integrating search into the chat while preserving user privacy is no small task. I've broken down my thoughts about the relevant challenges and how we could implement the feature. But first a quick recap on how search is integrated into an AI chat.
Now, let's evaluate the privacy challenges with this approach: Fetching web page contents: The challenge here is twofold. We are fetching the content of the returned web pages. This implies a potential exposure to the various site trackers or cookies. Sadly, we cannot do a lot of things against the tracker on the web pages. Also there is the issue of metadata leakage that can reveal which websites are queried (by inspecting the network of the service fetching the content). Here we have basically two options we can do the search locally or in an enclave. Option 1 : Local implementation of the search ? Option 2: Use an enclave |
Interesting insights @clauverjat My feedback:
My assumption is that given that people already use web search a lot, both for professional and personal, and that good privacy solutions exist through VPNs as you mentioned or DuckDuckGo, the question is more: can the use of our service with web browser expose data to us, and in some way to those other services we rely on. If we assume we just do synthesis generation and the user gets the content to be synthesized by the LLM in an enclave, then there is no more exposure to us than usual. I think the easiest way to move forward is to ask our community what they want for search, and what they want to protect from whom but good mapping :) |
No description provided.
The text was updated successfully, but these errors were encountered: