Convert PDF documents into Excel files effortlessly with this powerful PDF to Excel converter built using Python and Flask.
The PDF to Excel Converter is a web application developed using Flask, a micro web framework for Python. This application enables users to upload PDF files and convert them into Excel spreadsheets. It utilizes the pdfplumber and tabula-py libraries to extract textual content and tables from PDFs, then formats and exports them into Excel files.
- Convert entire PDF text content into Excel format.
- Extract tables from PDF pages and export them as separate sheets in Excel.
- Easy-to-use web interface for uploading PDF files.
- Supports multiple processing options for different user needs.
- Well-structured codebase for easy customization and extension.
[Insert demo video or link to live demo if available]
-
Clone the repository:
git clone https://github.com/yourusername/pdf-to-excel-converter.git
-
Navigate to the project directory:
cd pdf-to-excel-converter
-
Install dependencies:
pip install -r requirements.txt
-
Start the Flask server:
python app.py
-
Open your web browser and navigate to http://localhost:5000.
-
Upload a PDF file using the provided form.
-
Choose the processing option:
- All Text: Convert entire text content of the PDF into an Excel file.
- Tables Only: Extract tables from the PDF and export them as separate sheets in Excel.
-
Click on the "Convert" button and wait for the conversion to complete.
-
Download the generated Excel file.
Contributions are welcome! Please follow these steps to contribute:
- Fork the repository.
- Create a new branch (
git checkout -b feature/yourfeature
). - Make your changes.
- Commit your changes (
git commit -am 'Add new feature'
). - Push to the branch (
git push origin feature/yourfeature
). - Create a new Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.