WO2015200110A3 - Techniques for machine language translation of text from an image based on non-textual context information from the image - Google Patents
Techniques for machine language translation of text from an image based on non-textual context information from the image Download PDFInfo
- Publication number
- WO2015200110A3 WO2015200110A3 PCT/US2015/036603 US2015036603W WO2015200110A3 WO 2015200110 A3 WO2015200110 A3 WO 2015200110A3 US 2015036603 W US2015036603 W US 2015036603W WO 2015200110 A3 WO2015200110 A3 WO 2015200110A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- text
- image
- context information
- server
- technique
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/768—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using context analysis, e.g. recognition aided by known co-occurring patterns
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/20—Scenes; Scene-specific elements in augmented reality scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
- G06V20/63—Scene text, e.g. street names
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Machine Translation (AREA)
- Character Discrimination (AREA)
Abstract
A computer-implemented technique can include receiving, at a server from a mobile computing device, the server having one or more processors, an image including a text. The technique can include obtaining, at the server, optical character recognition (OCR) text corresponding to the text, the OCR text having been obtained by performing OCR on the image. The technique can include identifying, at the server, non-textual context information from the image, the non-textual context information (i) representing context information other than the text itself and (ii) being indicative of a context of the image. The technique can include based on the non-textual context information, obtaining, at the server, a translation of the OCR text to a target language to obtain a translated OCR text. The technique can include outputting, from the server to the mobile computing device, the translated OCR text.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP15795248.2A EP3161667A2 (en) | 2014-06-24 | 2015-06-19 | Techniques for machine language translation of text from an image based on non-textual context information from the image |
CN201580033709.4A CN106462574B (en) | 2014-06-24 | 2015-06-19 | The method and server of machine language translation for the text from image |
KR1020167036222A KR101889052B1 (en) | 2014-06-24 | 2015-06-19 | Techniques for machine language translation of text from an image based on non-textual context information from the image |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/313,670 US9436682B2 (en) | 2014-06-24 | 2014-06-24 | Techniques for machine language translation of text from an image based on non-textual context information from the image |
US14/313,670 | 2014-06-24 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2015200110A2 WO2015200110A2 (en) | 2015-12-30 |
WO2015200110A3 true WO2015200110A3 (en) | 2016-02-25 |
Family
ID=54548239
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2015/036603 WO2015200110A2 (en) | 2014-06-24 | 2015-06-19 | Techniques for machine language translation of text from an image based on non-textual context information from the image |
Country Status (5)
Country | Link |
---|---|
US (2) | US9436682B2 (en) |
EP (1) | EP3161667A2 (en) |
KR (1) | KR101889052B1 (en) |
CN (1) | CN106462574B (en) |
WO (1) | WO2015200110A2 (en) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10140293B2 (en) * | 2015-05-18 | 2018-11-27 | Google Llc | Coordinated user word selection for translation and obtaining of contextual information for the selected word |
CN105159893A (en) * | 2015-08-31 | 2015-12-16 | 小米科技有限责任公司 | Character string saving method and device |
US20170286383A1 (en) * | 2016-03-30 | 2017-10-05 | Microsoft Technology Licensing, Llc | Augmented imaging assistance for visual impairment |
CN113407743B (en) * | 2016-04-08 | 2024-11-05 | 北京三星通信技术研究有限公司 | Object information translation and derived information acquisition methods and devices |
US10579741B2 (en) * | 2016-08-17 | 2020-03-03 | International Business Machines Corporation | Proactive input selection for improved machine translation |
US10311330B2 (en) * | 2016-08-17 | 2019-06-04 | International Business Machines Corporation | Proactive input selection for improved image analysis and/or processing workflows |
US10580213B2 (en) | 2016-09-13 | 2020-03-03 | Magic Leap, Inc. | Systems and methods for sign language recognition |
US10229113B1 (en) * | 2016-09-28 | 2019-03-12 | Amazon Technologies, Inc. | Leveraging content dimensions during the translation of human-readable languages |
US10235362B1 (en) | 2016-09-28 | 2019-03-19 | Amazon Technologies, Inc. | Continuous translation refinement with automated delivery of re-translated content |
US10223356B1 (en) | 2016-09-28 | 2019-03-05 | Amazon Technologies, Inc. | Abstraction of syntax in localization through pre-rendering |
US10275459B1 (en) | 2016-09-28 | 2019-04-30 | Amazon Technologies, Inc. | Source language content scoring for localizability |
US10261995B1 (en) | 2016-09-28 | 2019-04-16 | Amazon Technologies, Inc. | Semantic and natural language processing for content categorization and routing |
KR102478396B1 (en) * | 2017-11-29 | 2022-12-19 | 삼성전자주식회사 | The Electronic Device Recognizing the Text in the Image |
JP7024427B2 (en) * | 2018-01-17 | 2022-02-24 | トヨタ自動車株式会社 | Display device for vehicles |
KR102598104B1 (en) | 2018-02-23 | 2023-11-06 | 삼성전자주식회사 | Method for displaying text information on an object contained in an image by compensating for motion generated during time of receiving text information from an external electronic device and electronic device thereof |
CN109190130B (en) * | 2018-08-30 | 2022-04-12 | 昆明理工大学 | Research method based on POI similarity and translation machine matching recommendation algorithm |
CN109241900B (en) * | 2018-08-30 | 2021-04-09 | Oppo广东移动通信有限公司 | Wearable device control method and device, storage medium and wearable device |
CN110163121B (en) * | 2019-04-30 | 2023-09-05 | 腾讯科技(深圳)有限公司 | Image processing method, device, computer equipment and storage medium |
CN111914830B (en) * | 2019-05-07 | 2024-10-08 | 阿里巴巴集团控股有限公司 | Text line positioning method, device, equipment and system in image |
CN110569830B (en) * | 2019-08-01 | 2023-08-22 | 平安科技(深圳)有限公司 | Multilingual text recognition method, device, computer equipment and storage medium |
US20230124572A1 (en) * | 2020-01-08 | 2023-04-20 | Google Llc | Translation of text depicted in images |
KR102374281B1 (en) | 2020-02-27 | 2022-03-16 | 주식회사 와들 | Importance Determination System of Text Block Extracted from Image and Its Method |
KR102374280B1 (en) | 2020-02-27 | 2022-03-16 | 주식회사 와들 | Blocking System of Text Extracted from Image and Its Method |
CN111382748B (en) * | 2020-02-28 | 2024-03-19 | 北京小米松果电子有限公司 | Image translation method, device and storage medium |
CN113392653A (en) * | 2020-03-13 | 2021-09-14 | 华为技术有限公司 | Translation method, related device, equipment and computer readable storage medium |
KR20220056004A (en) * | 2020-10-27 | 2022-05-04 | 삼성전자주식회사 | Electronic device and Method for controlling the electronic device thereof |
WO2021081562A2 (en) * | 2021-01-20 | 2021-04-29 | Innopeak Technology, Inc. | Multi-head text recognition model for multi-lingual optical character recognition |
US20230140570A1 (en) * | 2021-11-03 | 2023-05-04 | International Business Machines Corporation | Scene recognition based natural language translation |
CN115019291B (en) * | 2021-11-22 | 2023-04-14 | 荣耀终端有限公司 | Character recognition method for image, electronic device and storage medium |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130108115A1 (en) * | 2011-08-29 | 2013-05-02 | Qualcomm Incorporated | Camera ocr with context information |
US20140081619A1 (en) * | 2012-09-18 | 2014-03-20 | Abbyy Software Ltd. | Photography Recognition Translation |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7751805B2 (en) | 2004-02-20 | 2010-07-06 | Google Inc. | Mobile image-based information retrieval system |
US7643985B2 (en) * | 2005-06-27 | 2010-01-05 | Microsoft Corporation | Context-sensitive communication and translation methods for enhanced interactions and understanding among speakers of different languages |
US20080221862A1 (en) | 2007-03-09 | 2008-09-11 | Yahoo! Inc. | Mobile language interpreter with localization |
US8144990B2 (en) | 2007-03-22 | 2012-03-27 | Sony Ericsson Mobile Communications Ab | Translation and display of text in picture |
US8725490B2 (en) | 2007-10-18 | 2014-05-13 | Yahoo! Inc. | Virtual universal translator for a mobile device with a camera |
EP2116739B1 (en) * | 2008-05-09 | 2020-02-26 | Fox Factory, Inc. | Methods and apparatus for position sensitive suspension dampening |
CN101667251B (en) * | 2008-09-05 | 2014-07-23 | 三星电子株式会社 | OCR recognition method and device with auxiliary positioning function |
CN101620595A (en) * | 2009-08-11 | 2010-01-06 | 上海合合信息科技发展有限公司 | Method and system for translating text of electronic equipment |
KR101263332B1 (en) * | 2009-09-11 | 2013-05-20 | 한국전자통신연구원 | Automatic translation apparatus by using user interaction in mobile device and its method |
KR101077788B1 (en) * | 2010-01-18 | 2011-10-28 | 한국과학기술원 | Method and apparatus for recognizing objects in images |
TW201222282A (en) | 2010-11-23 | 2012-06-01 | Inventec Corp | Real time translation method for mobile device |
US8758826B2 (en) * | 2011-07-05 | 2014-06-24 | Wet Inc. | Cannabinoid receptor binding agents, compositions, and methods |
JP5348198B2 (en) * | 2011-08-04 | 2013-11-20 | コニカミノルタ株式会社 | Image forming apparatus |
US9424255B2 (en) * | 2011-11-04 | 2016-08-23 | Microsoft Technology Licensing, Llc | Server-assisted object recognition and tracking for mobile devices |
US20140030683A1 (en) * | 2012-07-24 | 2014-01-30 | Rebecca Anna BALLARD | Sensory input devices, and sensory input methods |
-
2014
- 2014-06-24 US US14/313,670 patent/US9436682B2/en active Active
-
2015
- 2015-06-19 CN CN201580033709.4A patent/CN106462574B/en active Active
- 2015-06-19 EP EP15795248.2A patent/EP3161667A2/en not_active Withdrawn
- 2015-06-19 KR KR1020167036222A patent/KR101889052B1/en active IP Right Grant
- 2015-06-19 WO PCT/US2015/036603 patent/WO2015200110A2/en active Application Filing
-
2016
- 2016-08-31 US US15/252,309 patent/US20160371256A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130108115A1 (en) * | 2011-08-29 | 2013-05-02 | Qualcomm Incorporated | Camera ocr with context information |
US20140081619A1 (en) * | 2012-09-18 | 2014-03-20 | Abbyy Software Ltd. | Photography Recognition Translation |
Also Published As
Publication number | Publication date |
---|---|
US20150370785A1 (en) | 2015-12-24 |
KR101889052B1 (en) | 2018-08-16 |
CN106462574A (en) | 2017-02-22 |
WO2015200110A2 (en) | 2015-12-30 |
US9436682B2 (en) | 2016-09-06 |
US20160371256A1 (en) | 2016-12-22 |
KR20170010843A (en) | 2017-02-01 |
CN106462574B (en) | 2019-07-12 |
EP3161667A2 (en) | 2017-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015200110A3 (en) | Techniques for machine language translation of text from an image based on non-textual context information from the image | |
MY193819A (en) | Electronic device and operating method thereof | |
EP3136257A3 (en) | Document-specific gazetteers for named entity recognition | |
WO2018208869A3 (en) | A learning based approach for aligning images acquired with different modalities | |
EP2857983A3 (en) | Analyzing font similarity for presentation | |
WO2014140903A3 (en) | Apparatus, method, and computer readable medium for recognizing text on a curved surface | |
EP3010219A3 (en) | Method and apparatus for managing images using a voice tag | |
EP2833294A3 (en) | Device to extract biometric feature vector, method to extract biometric feature vector and program to extract biometric feature vector | |
EP2919434A3 (en) | Method for determining data source | |
NZ744400A (en) | Eye image collection, selection, and combination | |
WO2014004536A3 (en) | Voice-based image tagging and searching | |
IL235565B (en) | Location based optical character recognition (ocr) | |
JP2015210683A5 (en) | ||
WO2015092588A3 (en) | Spectral image data processing | |
CL2016001036A1 (en) | Complex background-oriented optical character recognition method and device | |
EP2704061A3 (en) | Apparatus and method for recognizing a character in terminal equipment | |
WO2014108460A3 (en) | A label inspection system and method | |
MX364147B (en) | Area extracting method and apparatus. | |
WO2014110206A3 (en) | Advanced text editor | |
EP2713314A3 (en) | Image processing device and image processing method | |
EP2746989A3 (en) | Document processing device, image processing apparatus, document processing method and computer program product | |
MX2016017370A (en) | Instruction-generating method and device. | |
EP2816559A3 (en) | Translation system comprising display apparatus and server and control method thereof | |
EP3200048A3 (en) | Image display apparatus | |
EP2919115A3 (en) | Task migration method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15795248 Country of ref document: EP Kind code of ref document: A2 |
|
REEP | Request for entry into the european phase |
Ref document number: 2015795248 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020167036222 Country of ref document: KR Ref document number: 2015795248 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |