The Document Processing Web App is a tool designed to upload and process documents with ease. It allows users to upload images, and retrieve both processed images and extracted text files. The app supports functionality such as drag-and-drop uploads, previewing drafts and downloading results.
- Donald Xu
- Shanette Presas
- Hanna De Los Santos
- Ken Cheng
- Jersey To
- Auto Crop and Warp Image
- Automatically detects the edges of an image, crops it, and applies perspective warping for better alignment.
- Auto Extract Text
- Utilizes Tesseract OCR to extract text from the processed image, enabling easy text retrieval and editing.
- Framework: React.js
- CSS: Tailwind CSS
- HTTP Client: Axios
- Framework: Flask (Python)
- Libraries:
-
os
for file handling -send_from_directory
for file serving - Hosting: Local server on
http://localhost:5000
cv2
for image processingimutils
for simplifying image processing taskspytesseract
for integrating with Tesseract OCR to extract text from imagesnumpy
for numerical operations
-
Install Tesseract OCR
-
Activate the virtual environment:
.\venv\Scripts\activate
- Run the Flask server:
python server.py
The server will be available at http://localhost:5000
.
- Navigate to the
frontend
folder. - Install dependencies:
npm install
- Start the React development server:
npm run dev
The app will be available athttp://localhost:3000
.
<name>
name of the document with file extension
-
http://localhost:5000/uploads/<name>
: Download processed output files. -
http://localhost:5000/drafts/<name>
: Retrieve draft images. -
http://localhost:5000/texts/<name>
: Retrieve text files.
http://localhost:5000/delete
: Clears all session-related files from the backend.
-
Launch both the frontend and backend servers.
-
Navigate to
http://localhost:3000
. -
Upload an image file.
-
View draft previews and download processed outputs.
- Donald Xu: Backend development and API implementation.
- Shanette Presas: Frontend UI/UX design and integration.
- Hanna De Los Santos: Testing, debugging, and documentation.
- Ken Cheng: Backend development and API implementation.
- Jersey To: Frontend UI/UX design and integration.
- Support for additional file formats (e.g., PDFs).
- Temporary Cloud storage integration.
This project is open-source and licensed under the MIT License.