-
Notifications
You must be signed in to change notification settings - Fork 360
Cannot use on password protected files #162
Comments
There are two issues with this:
I have a potential pull request for camelot to add the mediocre level of support offered by PyPDF2, but am uncertain whether that approach is desirable. @vinayak-mehta: given the lack of maintenance have you considered moving to an alternative PDF library? PikePDF looks promising but would require backporting for python 2.7 |
@rbares Thanks for the detailed comment! I've faced issues with PyPDF2 decryption in the past (I remember the same problem of it not being able to support some encryption types). I've found @heroic Maybe you can add a preprocessing step that decrypts all your PDFs at once with qpdf and a wildcard (*.pdf), before extracting tables using Camelot? |
Update API and CLI to accept ASCII passwords to decrypt PDFs encrypted by algorithm code 1 or 2 (limited by support from PyPDF2). Update documentation and unit tests accordingly. Example document health_protected.pdf generated as follows: qpdf --encrypt userpass ownerpass 128 -- health.pdf health_protected.pdf Issue atlanhq#162
Explicitly check passwords for None rather than falsey. Correct read_pdf documentation for Owner/User password. Issue atlanhq#162
* [MRG] Add basic support for encrypted PDF files Update API and CLI to accept ASCII passwords to decrypt PDFs encrypted by algorithm code 1 or 2 (limited by support from PyPDF2). Update documentation and unit tests accordingly. Example document health_protected.pdf generated as follows: qpdf --encrypt userpass ownerpass 128 -- health.pdf health_protected.pdf Issue #162 * Support encrypted PDF files in python3 Issue #162 * Address review comments Explicitly check passwords for None rather than falsey. Correct read_pdf documentation for Owner/User password. Issue #162 * Correct API documentation changes for consistency Issue #162 * Move error tests from test_common to test_errors Issue #162 * Add qpdf example * Remove password is not None check * Fix merge conflict * Fix pages example
any update on this ? |
@Fabian1337 Does the the |
Most of the files that we have, have a password. PdfFileReader does support decrypting the file, but without an option to pass the password in read_pdf it cannot be done
The text was updated successfully, but these errors were encountered: