Markdowner ⚡📝

A fast tool to convert any website into LLM-ready markdown data.

👀 Why?

I'm building an AI app called Supermemory - https://git.new/memory. Where users can store website content in the app and then query it using AI. One thing I noticed was - when data is structured and predictable (in markdown format), the LLM responses are much better.

There are other solutions available for this - https://r.jina.ai, https://firecrawl.dev, etc. But they are either:

too expensive / proprietary
or too limited.
very difficult to deploy

Here's a quote from my friend @nexxeln

So naturally, we fix it ourselves ⚡

Features 🚀

Convert any website into markdown
LLM Filtering
Detailed markdown mode
Auto Crawler (without sitemap!)
Text and JSON responses
Easy to self-host
... All that and more, for FREE!

Usage

To use the API, just make GET a request to https://md.dhr.wtf

Usage example:

$ curl 'https://md.dhr.wtf/?url=https://example.com'

REQUIRED PARAMETERS

url (string) -> The website URL to convert into markdown.

OPTIONAL PARAMETERS

enableDetailedResponse (boolean: false) -> Toggle for detailed response with full HTML content. crawlSubpages (boolean: false) -> Crawl and return markdown for up to 10 subpages. llmFilter (boolean: false) -> Filter out unnecessary information using LLM.

Response Types

Add Content-Type: text/plain in headers for plain text response. Add Content-Type: application/json in headers for JSON response.

Tech

Under the hood, Markdowner utilises Cloudflare's Browser rendering and Durable objects to spin up browser instances and then convert it to markdown using Turndown.

Self hosting

You can easily self host this project. To use the browser rendering and Durable Objects, you need the Workers paid plan

Clone the repo and download dependencies

git clone https://github.com/dhravya/markdowner
npm i

Run this command:

npx wrangler kv:namespace create md_cache

Open Wrangler.toml and change the IDs accordingly
Run npm run deploy
That's it 👍

Support

Support me by simply starring this repository! ⭐

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
code_of_conduct.md		code_of_conduct.md
package.json		package.json
tsconfig.json		tsconfig.json
worker-configuration.d.ts		worker-configuration.d.ts
wrangler.toml		wrangler.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Markdowner ⚡📝

👀 Why?

Features 🚀

Usage

REQUIRED PARAMETERS

OPTIONAL PARAMETERS

Response Types

Tech

Self hosting

Support

About

Releases

Packages

Languages

License

supermemoryai/markdowner

Folders and files

Latest commit

History

Repository files navigation

Markdowner ⚡📝

👀 Why?

Features 🚀

Usage

REQUIRED PARAMETERS

OPTIONAL PARAMETERS

Response Types

Tech

Self hosting

Support

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages