Tesseract

A simple Elixir wrapper for the Tesseract OCR.

Requirements

It's assumed that you have tesseract and all desired languages installed. For example, if you wanted to scan English (Tesseract's default language) and Japanese, you could get set up in a Debian-based system with:

$ sudo apt-get install tesseract-ocr tesseract-ocr-jpn

Installation

I didn't bother to put this in Hex since it's so simple. lib/tesseract.ex is really all there is.

$ curl https://raw.githubusercontent.com/bchase/tesseract-elixir/master/lib/tesseract.ex >> lib/tesseract.ex

Usage

We can scan the text from the image at test/support/reibun.png like so:

Tesseract.scan! "test/support/reibun.png" # defaults to English
# => "my: (ﬂLMSi/v) (n) model sentence: (P):\n\n"

Tesseract.scan! "test/support/reibun.png", lang: :jpn
# => "例文 (れいぶん) (n) m0deー SentenCe; (P);\n\n"

Tesseract.scan! "test/support/reibun.png", lang: [:jpn, :eng]
# => "例文 (れいぶん) (n) model sentence: (P):\n\n"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
lib		lib
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mix.exs		mix.exs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tesseract

Requirements

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

bchase/tesseract-elixir

Folders and files

Latest commit

History

Repository files navigation

Tesseract

Requirements

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages