ocr++(an update to existing old ocrs)

The code is to convert the image to speech. An image is processed and segmented to identify the characters in the image. Then the characters are combined to form words and save it as a text file. This text file is converted to speech. We have divided the project into four sub parts : image is pre-processed, segmented to extract the images of characters, then characters are recognized and combined , then the text is translated then converted into speech.

For language translation(english to french) I achieved an accuracy of 94% while 96.3% for ocr character recognition

Technical terms:

Image to text

• Convolution 2d

• Max Pooling

• Activation Function (Tanh, Relu, Sigmoid, Leaky Relu)

• Flatten

• Dropout

Character Segmentation • C# (.exe)

Language Translation

• LSTM • GRU • Bi-directional RNN • Embedding layer • Encoder and Decoder

Text to Speech

• GTTs

• PYgame

We use two different datasets:

Language translation: https://machinelearningmastery.com/prepare-french-english-dataset-machine-translation/ Image to text: http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
ocrs		ocrs
predict		predict
1412.1842.pdf		1412.1842.pdf
README.md		README.md
ai_ocr.docx		ai_ocr.docx
model_ocr.py		model_ocr.py
model_ocr_1.0.py		model_ocr_1.0.py
model_ocr_2.0.py		model_ocr_2.0.py
model_ocr_3.0.py		model_ocr_3.0.py
model_ocr_4.0.py		model_ocr_4.0.py
model_ocr_5.0.py		model_ocr_5.0.py
model_ocr_6.0.py		model_ocr_6.0.py
model_predict.py		model_predict.py
ocr_final.py		ocr_final.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ocr++(an update to existing old ocrs)

For language translation(english to french) I achieved an accuracy of 94% while 96.3% for ocr character recognition

Technical terms:

We use two different datasets:

About

Uh oh!

Releases

Packages

Languages

LeadingIndiaAI/-IMAGE-TO-SPEECH-CONVERTOR-

Folders and files

Latest commit

History

Repository files navigation

ocr++(an update to existing old ocrs)

For language translation(english to french) I achieved an accuracy of 94% while 96.3% for ocr character recognition

Technical terms:

We use two different datasets:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages