Segmentation Project

This project is focused on processing and segmenting OCR (Optical Character Recognition) data. It includes tools and scripts to parse OCR outputs and prepare them for further analysis or processing.

Features

Extract text from videos of documents.
Parse raw OCR outputs into structured formats.
Clean and preprocess OCR data for downstream AI/ML.
Export video scans to PDF.

Parsing text from videos

The make parse_OCR target is designed to parse OCR data and extract meaningful information. This target automates the process of handling raw OCR outputs, cleaning the data, and organizing it into a structured format.

Put your videos in Videos/ before continuing. These are videos of flipping pages of documents. The OCR will be run on these videos, and the output will be saved in the test_frames/ directory.

Usage

Once the videos are in the appropriate directory, simply execute the following command (from the main project directory):

make parse_OCR

Example: Parsing OCR data from a sample video

To parse OCR data from a sample video, follow these steps:

Place your video file in the Videos/ directory (e.g., sample_video.mov).
Run the following command from the main project directory:
```
make parse_OCR
```
The output will be saved in the test_frames/ directory.
Check the structured output in test_frames/sample_video.mov/.

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch for your feature or bugfix.
Commit your changes and push the branch.
Submit a pull request.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
batch_image_cropper.py		batch_image_cropper.py
edge_detection.py		edge_detection.py
featurize.py		featurize.py
frameextraction.py		frameextraction.py
notebook.ipynb		notebook.ipynb
ocr.py		ocr.py
pageselection.py		pageselection.py
requirements.txt		requirements.txt
streamlit_crop_batch.py		streamlit_crop_batch.py
unwarp.py		unwarp.py
yolo.py		yolo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Segmentation Project

Features

Parsing text from videos

Usage

Example: Parsing OCR data from a sample video

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

bwerick/segmentation

Folders and files

Latest commit

History

Repository files navigation

Segmentation Project

Features

Parsing text from videos

Usage

Example: Parsing OCR data from a sample video

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages