research-article

Semantic 3D gaze mapping for estimating focused objects

Authors:

MobileHCI '19: Proceedings of the 21st International Conference on Human-Computer Interaction with Mobile Devices and Services

Article No.: 49, Pages 1 - 6

https://doi.org/10.1145/3338286.3344396

Published: 01 October 2019 Publication History

Get Access

Abstract

Eye-trackers are expected to be used in portable daily-use devices. However, it must register object information and define a unified coordinate system in advance for human--computer interaction and quantitative analysis. Therefore, we propose a semantic 3D gaze mapping to collect gaze information from multiple people on the unified map and detect focused objects automatically. The semantic 3D map can be reconstructed using keyframe-based semantic segmentation and structure-from-motion, and the 3D point-of-gaze can also be computed on the map. We confirmed that the fixation time of the focused object can be calculated through an experiment without prior information.

References

[1]

Pablo F Alcantarilla and T Solutions. 2011. Fast explicit diffusion for accelerated features in nonlinear scale spaces. IEEE Trans. Patt. Anal. Mach. Intell 34, 7 (2011), 1281--1298.

Google Scholar

[2]

Kai Essig, Daniel Dornbusch, Daniel Prinzhorn, Helge Ritter, Jonathan Maycock, and Thomas Schack. 2012. Automatic analysis of 3D gaze coordinates on scene objects using data from eye-tracking and motion-capture systems. In Proceedings of the Symposium on Eye Tracking Research and Applications. ACM, 37--44.

Digital Library

Google Scholar

[3]

Kakeru Hagihara, Keiichiro Taniguchi, Irshad Abibouraguimane, Yuta Itoh, Keita Higuchi, Jiu Otsuka, Maki Sugimoto, and Yoichi Sato. 2018. Object-wise 3D Gaze Mapping in Physical Workspace. In Proceedings of the 9th Augmented Human International Conference. ACM, 25.

Digital Library

Google Scholar

[4]

Michael Maurus, Jan Hendrik Hammer, and Jürgen Beyerer. 2014. Realistic heatmap visualization for interactive analysis of 3D gaze data. In Proceedings of the Symposium on Eye Tracking Research and Applications. ACM, 295--298.

Digital Library

Google Scholar

[5]

Pierre Moulon, Pascal Monasse, Romuald Perrot, and Renaud Marlet. 2016. Openmvg: Open multiple view geometry. In International Workshop on Reproducible Research in Pattern Recognition. Springer, 60--74.

Google Scholar

[6]

Susan M Munn and Jeff B Pelz. 2008. 3D point-of-regard, position and head orientation from a portable monocular video-based eye tracker. In Proceedings of the 2008 symposium on Eye tracking research & applications. ACM, 181--188.

Digital Library

Google Scholar

[7]

Jeff B Pelz, Thomas B Kinsman, and Karen M Evans. 2011. Analyzing complex gaze behavior in the natural world. In Human Vision and Electronic Imaging XVI, Vol. 7865. International Society for Optics and Photonics, 78650Z.

Crossref

Google Scholar

[8]

Thies Pfeiffer. 2012. Measuring and visualizing attention in space with 3D attention volumes. In Proceedings of the Symposium on Eye Tracking Research and Applications. ACM, 29--36.

Digital Library

Google Scholar

[9]

Thies Pfeiffer, Patrick Renner, and Nadine Pfeiffer-Leßmann. 2016. EyeSee3D 2.0: model-based real-time analysis of mobile eye-tracking in static and dynamic three-dimensional scenes. In Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications. ACM, 189--196.

Digital Library

Google Scholar

[10]

Kentaro Takemura, Yuji Kohashi, Tsuyoshi Suenaga, Jun Takamatsu, and Tsukasa Ogasawara. 2010. Estimating 3D point-of-regard and visualizing gaze trajectories under natural head movements. In Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications. ACM, 157--160.

Digital Library

Google Scholar

[11]

Kentaro Takemura, Kenji Takahashi, Jun Takamatsu, and Tsukasa Ogasawara. 2014. Estimating 3-D point-of-regard in a real environment using a head-mounted eye-tracking system. IEEE Trans. Human-Machine Syst. 44, 4 (2014), 531--536.

Crossref

Google Scholar

[12]

Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2881--2890.

Crossref

Google Scholar

Cited By

View all

Oishi SKoide KYokozuka MBanno A(2021)4D Attention: Comprehensive Framework for Spatio-Temporal Gaze MappingIEEE Robotics and Automation Letters10.1109/LRA.2021.30972746:4(7240-7247)Online publication date: Oct-2021
https://doi.org/10.1109/LRA.2021.3097274

Index Terms

Semantic 3D gaze mapping for estimating focused objects
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
2. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Object-wise 3D Gaze Mapping in Physical Workspace
AH '18: Proceedings of the 9th Augmented Human International Conference

Understanding the intention of other people is a fundamental social skill in human communication. Eye behavior is an important, yet implicit communication cue. In this work, we focus on enabling people to see the users' gaze associated with objects in ...
Nonlinear Eye Gaze Mapping Function Estimation via Support Vector Regression
ICPR '06: Proceedings of the 18th International Conference on Pattern Recognition - Volume 01

We propose a novel method for tracking eye gaze that allows natural head movement. Most existing remote eye gaze trackers cannot work under natural head movement due to the difficulty of building a gaze mapping function that can incorporate head motion ...
Estimating 3D point-of-regard and visualizing gaze trajectories under natural head movements
ETRA '10: Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications

The portability of an eye tracking system encourages us to develop a technique for estimating 3D point-of-regard. Unlike conventional methods, which estimate the position in the 2D image coordinates of the mounted camera, such a technique can represent ...

Reviews

Reviewer: Steve Carson

Knowing where users focus their gaze in real, virtual, augmented, or mixed environments is essential for creating immersive applications in areas like gaming, product design, psychology, and advertising. In multiuser environments, knowing the 3D point-of-gaze (PoG) of each user is a key enabler for immersive interaction. Creating such systems is difficult if they require detailed modeling of a part of the real world and associating its elements with a virtual world, making systems that construct their own models from sensory data highly desirable. Previous research has used a wide variety of hardware (virtual reality headsets with eye trackers, head/body position trackers, video cameras) and software techniques (structure from motion, simultaneous localization and mapping, and key frames) to solve aspects of determining 3D PoG focused objects at the expense of complex and time-consuming setup. The authors cleverly leverage this previous research, combining results from multiple open-source software packages to derive semantic (that is, user attention to objects in the local environment) 3D gaze mapping for estimating user focus without prior information. The main contributions of this paper are its excellent descriptive figures, well-defined algorithms, logical expository flow, and presentation of the mathematical details of computing 3D PoG from multiple 2D PoGs projected into key frames by Delaunay triangulation. The authors also conduct an experiment that shows their method can compute approximate user fixation time on objects, making it suitable for marketing and advertising research.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

MobileHCI '19: Proceedings of the 21st International Conference on Human-Computer Interaction with Mobile Devices and Services

October 2019

646 pages

ISBN:9781450368254

DOI:10.1145/3338286

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

MobileHCI '19

Sponsor:

SIGCHI

MobileHCI '19: 21st International Conference on Human-Computer Interaction with Mobile Devices and Services

October 1 - 4, 2019

Taipei, Taiwan

Acceptance Rates

Overall Acceptance Rate 202 of 906 submissions, 22%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
184
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Oishi SKoide KYokozuka MBanno A(2021)4D Attention: Comprehensive Framework for Spatio-Temporal Gaze MappingIEEE Robotics and Automation Letters10.1109/LRA.2021.30972746:4(7240-7247)Online publication date: Oct-2021
https://doi.org/10.1109/LRA.2021.3097274

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Object-wise 3D Gaze Mapping in Physical Workspace

Nonlinear Eye Gaze Mapping Function Estimation via Support Vector Regression

Estimating 3D point-of-regard and visualizing gaze trajectories under natural head movements

Reviews

Access critical reviews of Computing literature here