Python Highlight OCR

Extract highlighted text from book images using Python, OpenCV, and Tesseract OCR.

Note

I have ported this project to JavScript to run in the browser without any installation. Check out highlights.

Usage

Install dependencies:

pip install pytesseract opencv-python numpy

Run with an image:

python main.py path/to/your/book-image.jpg