Feng et al., 2020 - Google Patents

Compression for text detection and recognition based on low bit-width quantization

Feng et al., 2020

Document ID: 15217911702312136300
Author: Feng S; Cao J; Luo Y; Dai Z; Zhang Y; Wang Y
Publication year: 2020
Publication venue: 2020 IEEE 5th International Conference on Signal and Image Processing (ICSIP)

External Links

Cited by

Snippet

In recent years, with the development of Neural Network, it has made a significant breakthrough in the field of text detection and recognition. However, large-scale deep Neural Network needs a large amount of storage space and computing resources, which …

Continue reading at ieeexplore.ieee.org (other versions)

238000001514 detection method 0 title abstract description 40

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4642—Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
CN106650813B (en)	2019-11-15	A kind of image understanding method based on depth residual error network and LSTM
CN110188227B (en)	2022-11-18	Hash image retrieval method based on deep learning and low-rank matrix optimization
US20190180154A1 (en)	2019-06-13	Text recognition using artificial intelligence
CN110866471A (en)	2020-03-06	Face image quality evaluation method and device, computer readable medium and communication terminal
CN111738169A (en)	2020-10-02	Handwriting formula recognition method based on end-to-end network model
CN110929080A (en)	2020-03-27	Optical remote sensing image retrieval method based on attention and generation countermeasure network
CN113516152B (en)	2024-04-16	Image description method based on composite image semantics
WO2023173552A1 (en)	2023-09-21	Establishment method for target detection model, application method for target detection model, and device, apparatus and medium
CN108197707A (en)	2018-06-22	Compression method based on the convolutional neural networks that global error is rebuild
CN110599502A (en)	2019-12-20	Skin lesion segmentation method based on deep learning
US11568140B2 (en)	2023-01-31	Optical character recognition using a combination of neural network models
CN113920516A (en)	2022-01-11	Calligraphy character skeleton matching method and system based on twin neural network
CN111079374A (en)	2020-04-28	Font generation method, device and storage medium
CN111935487B (en)	2022-08-12	Image compression method and system based on video stream detection
CN109508640A (en)	2019-03-22	Crowd emotion analysis method and device and storage medium
CN110414516B (en)	2022-02-01	Single Chinese character recognition method based on deep learning
Feng et al.	2020	Compression for text detection and recognition based on low bit-width quantization
CN116343109A (en)	2023-06-27	Text pedestrian searching method based on self-supervision mask model and cross-mode codebook
CN118522039B (en)	2024-10-18	Frame extraction pedestrian retrieval method based on YOLOv s and stage type regular combined pedestrian re-recognition
Liu et al.	2021	Video action recognition with visual privacy protection based on compressed sensing
CN110033077A (en)	2019-07-19	Neural network training method and device
Zhang et al.	2013	Laplacian affine sparse coding with tilt and orientation consistency for image classification
WO2023185209A1 (en)	2023-10-05	Model pruning
Jiao et al.	2018	Realization and improvement of object recognition system on raspberry pi 3b+
Bui et al.	2019	Automatic synthetic document image generation using generative adversarial networks: application in mobile-captured document analysis