ocrTESSERACT

This repository is a tutorial for using the Tesseract software. It is a simple Shell script. You can run it simply over any of your directories. Before you start, please note, that a config.jsonis necessary to launch the script.

In the following I'll explain which properties you have to determine.

Argument	config.json Attribute	Description
results	resultFiles	directory of the image folder
createTXT	createTXT	`true`or `false` whether a `.txt` will be created or not
createPDF	createPDF	`true`or `false` whether a searchable `.pdf` will be created or not
deleteIMG	deleteIMG	`true`or `false` whether you want to delete the orig. image or not

dir="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )"

predictions=`jq '.predictionFiles' config.json | xargs -n 1`
results=`jq '.resultFiles' config.json | xargs -n 1`
createTXT=`jq '.createTXT' config.json | xargs -n 1`
createPDF=`jq '.createPDF' config.json | xargs -n 1`
deleteIMG=`jq '.deleteIMG' config.json | xargs -n 1`

After all insert the directory with the images to be recognized into the results-attribute in the config.json-file and run the script with the following command:

bash setup.sh

Used tools

jq - Reading and writing .json with shell script
Imagemagick – Imagemagick, image processing
Tesseract – Tesseract - OCR tool

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
config.json		config.json
install.sh		install.sh
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ocrTESSERACT

Used tools

About

Uh oh!

Languages

License

karhunenloeve/TesseRACT

Folders and files

Latest commit

History

Repository files navigation

ocrTESSERACT

Used tools

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages