Use mozilla/readability to extract the text content of webpages #84
tomasgvivo
started this conversation in
Ideas
Replies: 1 comment
-
Already testing |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I would like the extension to include the websites content, not only the headlines.
The package https://github.com/mozilla/readability allows you to extract the text from a website for readability (this is what Firefox uses on it's "reader mode"). This may simplify the process of extracting the texts from the websites, but obviously you will get to the problem of token limit for a message.
After reading other issues, I think that if you divide the process of accessing the web in various steps/prompts you might be able to avoid that limit by separating the results in multiple messages.
A couple of days ago, I tried something like this:
At first it worked and responded with "..." after the first text to ingest, but after the second text, it jumped to conclusions and tried to give an opinion about the text I provided. I think this is just a problem with the initial prompt and after some tweaks it should work most of the times.
So what I imagine is:
Beta Was this translation helpful? Give feedback.
All reactions