Extract Text from Image and Generate Word Document

Overview

This is a Streamlit application that allows users to upload an image, extracts text from the image using Azure Computer Vision, and then generates a Word document with the extracted text. The Word document is formatted with a step-by-step guide based on the extracted text, and it also includes the uploaded image as a screenshot. The application is built by Tony Esposito.

Features

Upload an image in PNG, JPEG, or JPG format.
Extract text from the image using Azure Computer Vision.
Generate a Word document that includes the extracted text and the image.
Download the generated Word document.

Prerequisites

Python 3.x
Streamlit
Azure Cognitive Services SDK
PIL (Pillow)
python-docx
OpenAI Python package

Setup and Installation

Clone the repository:
```
git clone <repository-url>
```
Navigate to the project directory:
```
cd <project-directory>
```
Install the required packages:
```
pip install -r requirements.txt
```
Set up environment variables:
- AZURE_SUBSCRIPTION_KEY: Your Azure subscription key for Computer Vision.
- AZURE_ENDPOINT: Endpoint URL for Azure Computer Vision.
- OPENAI_API_KEY: Your OpenAI API key.
You can set these variables in a .env file or directly in your system's environment variables.
Run the Streamlit application:
```
streamlit run <your-script-name>.py
```

Usage

Open the Streamlit application in your web browser.
Upload an image using the file uploader.
The application will display the text extracted from the image.
A Word document will be generated, and a download button will appear.
Click the download button to get the generated Word document.

Code Structure

generate_word_document(text, image_path, doc_path): Function to generate the Word document.
Streamlit UI: Code for rendering the Streamlit interface.
Azure Computer Vision: Code for extracting text from images.
Word Document Generation: Code for creating and saving the Word document.

Contributions

Built by Tony Esposito.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Extract Text from Image and Generate Word Document

Overview

Features

Prerequisites

Setup and Installation

Usage

Code Structure

Contributions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

fbanespo1/nycers-procs

Folders and files

Latest commit

History

Repository files navigation

Extract Text from Image and Generate Word Document

Overview

Features

Prerequisites

Setup and Installation

Usage

Code Structure

Contributions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages