This project is a Python application with a graphical user interface (GUI) that allows users to drag and drop or click to select a PDF file for Optical Character Recognition (OCR) processing. The application processes the selected PDF file and displays the results.
my-python-gui-app
├── src
│ ├── main.py # Entry point of the application
│ ├── ocr_processor.py # Logic for processing PDF files using OCR
│ └── gui
│ └── app.py # GUI components and file selection handling
├── requirements.txt # List of dependencies
└── README.md # Project documentation
To run this application, you need to install the following dependencies:
- tkinter
- pillow
- pytesseract
- pdf2image
You can install the required packages using pip:
pip install -r requirements.txt
- Clone the repository or download the project files.
- Navigate to the project directory.
- Install the required dependencies as mentioned above.
- Run the application using the following command:
python src/main.py
- A window will open where you can drag and drop a PDF file or click to select one.
- The application will process the PDF file and display the OCR results.
This project is licensed under the MIT License. See the LICENSE file for more details.