GitHub - morganrivers/problem_set_ocr: OCR using gpt4 to go from handwritten to latex psets

This codebase is for uploading psets using gpt4 vision.

Before:

After:

Requirements:

GPT4 api key. For me, in adddition to the $20 a month gpt4 charge, gpt4 vision is about $0.01 per query, so the cost will be much less than a dollar per pset.
pdflatex installed
acroread installed (optional, but it's nice for viewing the latex when they are generated)

Tested on Debian. Your mileage may vary.

Usage:

rename the example_data/ to data/. Add your api key in the data/params.json file.
Get a bunch of images of your pset and download them to a folder within psets/ subfolder
Now navigate to src/ and run
```
python3 gpt4_to_tex.py
```
That will let you select the pset folder, and then for all the images in the pset
1. show them to you before you upload them to gpt4
2. render gpt4's latex version in latex using pdflatex (option to ask gpt4 to try again if the continuation is not valid latex)
3. show you the rendered pdf with acroread
Note: to get out of a compile error and have gpt4 try again to generate valid latex, just enter "x" and that exits pdflatex in a civilized way.

Finally, all the latex docs are consolidated into one large latex file which is the latex version of your problem set!

Also! Useful command (unrelated to anything above):

python -c "import subprocess; subprocess.run(['pdflatex', 'output.tex'], check=True)"

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
example_data		example_data
psets		psets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback