🤖 Chat with PDF locally using Ollama + LangChain

A powerful local RAG (Retrieval Augmented Generation) application that lets you chat with your PDF documents using Ollama and LangChain. This project includes both a Jupyter notebook for experimentation and a Streamlit web interface for easy interaction.

📺 Video Tutorial

✨ Features

🔒 Fully local processing - no data leaves your machine
📄 PDF processing with intelligent chunking
🧠 Multi-query retrieval for better context understanding
🎯 Advanced RAG implementation using LangChain
🖥️ Clean Streamlit interface
📓 Jupyter notebook for experimentation

🚀 Getting Started

Prerequisites

Install Ollama
- Visit Ollama's website to download and install
- Pull required models:
```
ollama pull llama2  # or your preferred model
ollama pull nomic-embed-text
```

Clone Repository

git clone https://github.com/tonykipkemboi/ollama_pdf_rag.git
cd ollama_pdf_rag

Set Up Environment

python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate
pip install -r requirements.txt

🎮 Running the Application

Option 1: Streamlit Interface

streamlit run streamlit_app.py

Then open your browser to http://localhost:8501

Streamlit interface showing PDF viewer and chat functionality

Option 2: Jupyter Notebook

jupyter notebook

Open updated_rag_notebook.ipynb to experiment with the code

💡 Usage Tips

Upload PDF: Use the file uploader in the Streamlit interface or try the sample PDF
Select Model: Choose from your locally available Ollama models
Ask Questions: Start chatting with your PDF through the chat interface
Adjust Display: Use the zoom slider to adjust PDF visibility
Clean Up: Use the "Delete Collection" button when switching documents

🤝 Contributing

Feel free to:

Open issues for bugs or suggestions
Submit pull requests
Comment on the YouTube video for questions
Star the repository if you find it useful!

⚠️ Troubleshooting

Ensure Ollama is running in the background
Check that required models are downloaded
Verify Python environment is activated
For Windows users, ensure WSL2 is properly configured if using Ollama

Common Errors

ONNX DLL Error

If you encounter this error:

DLL load failed while importing onnx_copy2py_export: a dynamic link Library (DLL) initialization routine failed.

Try these solutions:

Install Microsoft Visual C++ Redistributable:
- Download and install both x64 and x86 versions from Microsoft's official website
- Restart your computer after installation

If the error persists, try installing ONNX Runtime manually:

pip uninstall onnxruntime onnxruntime-gpu
pip install onnxruntime

CPU-Only Systems

If you're running on a CPU-only system:

Ensure you have the CPU version of ONNX Runtime:

pip uninstall onnxruntime-gpu  # Remove GPU version if installed
pip install onnxruntime  # Install CPU-only version

You may need to modify the chunk size in the code to prevent memory issues:
- Reduce chunk_size to 500-1000 if you experience memory problems
- Increase chunk_overlap for better context preservation

Note: The application will run slower on CPU-only systems, but it will still work effectively.

📝 License

This project is open source and available under the MIT License.

⭐️ Star History

Built with ❤️ by Tony Kipkemboi

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.gitignore		.gitignore
LICENSE		LICENSE
LLMs.pdf		LLMs.pdf
README.md		README.md
WEF_The_Global_Cooperation_Barometer_2024.pdf		WEF_The_Global_Cooperation_Barometer_2024.pdf
local_ollama_rag.ipynb		local_ollama_rag.ipynb
requirements.txt		requirements.txt
scammer-agent.pdf		scammer-agent.pdf
st_app_ui.png		st_app_ui.png
streamlit_app.py		streamlit_app.py
updated_rag_notebook.ipynb		updated_rag_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Chat with PDF locally using Ollama + LangChain

📺 Video Tutorial

✨ Features

🚀 Getting Started

Prerequisites

🎮 Running the Application

Option 1: Streamlit Interface

Option 2: Jupyter Notebook

💡 Usage Tips

🤝 Contributing

⚠️ Troubleshooting

Common Errors

ONNX DLL Error

CPU-Only Systems

📝 License

⭐️ Star History

About

Releases

Packages

Contributors 2

Languages

License

tonykipkemboi/ollama_pdf_rag

Folders and files

Latest commit

History

Repository files navigation

🤖 Chat with PDF locally using Ollama + LangChain

📺 Video Tutorial

✨ Features

🚀 Getting Started

Prerequisites

🎮 Running the Application

Option 1: Streamlit Interface

Option 2: Jupyter Notebook

💡 Usage Tips

🤝 Contributing

⚠️ Troubleshooting

Common Errors

ONNX DLL Error

CPU-Only Systems

📝 License

⭐️ Star History

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages