Sun et al., 2019 - Google Patents

Foodtracker: A real-time food detection mobile application by deep convolutional neural networks

Sun et al., 2019

Document ID: 3257882570996283693
Author: Sun J; Radecka K; Zilic Z
Publication year: 2019
Publication venue: arXiv preprint arXiv:1909.05994

External Links

Cited by

Snippet

We present a mobile application made to recognize food items of multi-object meal from a single image in real-time, and then return the nutrition facts with components and approximate amounts. Our work is organized in two parts. First, we build a deep …

Continue reading at arxiv.org (PDF) (other versions)

235000013305 food 0 title abstract description 39

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00335—Recognising movements or behaviour, e.g. recognition of gestures, dynamic facial expressions; Lip-reading
- G06K9/00355—Recognition of hand or arm movements, e.g. recognition of deaf sign language
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR

Similar Documents

Publication	Publication Date	Title
Sun et al.	2019	Foodtracker: A real-time food detection mobile application by deep convolutional neural networks
Wang et al.	2022	A review on vision-based analysis for automatic dietary assessment
Zhen et al.	2020	Smap: Single-shot multi-person absolute 3d pose estimation
Chen et al.	2014	Real‐time hand gesture recognition using finger segmentation
Emeršič et al.	2018	Convolutional encoder–decoder networks for pixel‐wise ear detection and segmentation
Lu et al.	2021	[Retracted] Face Detection and Recognition Algorithm in Digital Image Based on Computer Vision Sensor
Shen et al.	2020	Defect detection of printed circuit board based on lightweight deep convolution network
Vishwakarma et al.	2015	Integrated approach for human action recognition using edge spatial distribution, direction pixel and-transform
Huu et al.	2021	Hand gesture recognition algorithm using SVM and HOG model for control of robotic system
Yi et al.	2018	Motion keypoint trajectory and covariance descriptor for human action recognition
Liu et al.	2017	Study of human action recognition based on improved spatio-temporal features
Yu	2021	Emotion monitoring for preschool children based on face recognition and emotion recognition algorithms
Zhang et al.	2015	Retargeting semantically-rich photos
Ji et al.	2014	Study of human action recognition based on improved spatio-temporal features
Xu et al.	2020	Hand segmentation pipeline from depth map: An integrated approach of histogram threshold selection and shallow CNN classification
Bekhet et al.	2021	A robust deep learning approach for glasses detection in non‐standard facial images
Yuan et al.	2024	FGNet: Fixation guidance network for salient object detection
Tang et al.	2022	Using a selective ensemble support vector machine to fuse multimodal features for human action recognition
Cui et al.	2019	Face recognition using total loss function on face database with ID photos
Chen et al.	2024	Real‐time ergonomic risk assessment in construction using a co‐learning‐powered 3D human pose estimation model
Tang et al.	2020	Using a multilearner to fuse multimodal features for human action recognition
Parashar et al.	2022	A robust covariate‐invariant gait recognition based on pose features
Yu et al.	2017	Recognition of human continuous action with 3D CNN
Wang et al.	2019	2D hand detection using multi-feature skin model supervised cascaded CNN
Li et al.	2017	Human interaction recognition fusing multiple features of depth sequences