More Web Proxy on the site http://driver.im/

research-article

Generalized Feedback Loop for Joint Hand-Object Pose Estimation

Authors:

Markus Oberweger,

Vincent LepetitAuthors Info & Claims

IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 42, Issue 8

Pages 1898 - 1912

https://doi.org/10.1109/TPAMI.2019.2907951

Published: 01 August 2020 Publication History

Abstract

We propose an approach to estimating the 3D pose of a hand, possibly handling an object, given a depth image. We show that we can correct the mistakes made by a Convolutional Neural Network trained to predict an estimate of the 3D pose by using a feedback loop. The components of this feedback loop are also Deep Networks, optimized using training data. This approach can be generalized to a hand interacting with an object. Therefore, we jointly estimate the 3D pose of the hand and the 3D pose of the object. Our approach performs en-par with state-of-the-art methods for 3D hand pose estimation, and outperforms state-of-the-art methods for joint hand-object pose estimation when using depth images only. Also, our approach is efficient as our implementation runs in real-time on a single GPU.

References

[1]

A. Erol, G. Bebis, M. Nicolescu, R. D. Boyle, and X. Twombly, “Vision-based hand pose estimation: A review,” Comput. Vis. Image Understanding, vol. 108, no. 1/2, pp. 52–73, 2007.

Digital Library

[2]

C. Keskin, F. Kıraç, Y. E. Kara, and L. Akarun, “Real time hand pose estimation using depth sensors,” in Proc. IEEE Int. Conf. Comput. Vis., 2011, pp. 1228–1234.

[3]

C. Keskin, F. Kıraç, Y. E. Kara, and L. Akarun, “Hand pose estimation and hand shape classification using multi-layered randomized decision forests,” in Proc. Eur. Conf. Comput. Vis., 2012, pp. 852–863.

[4]

S. Melax, L. Keselman, and S. Orsten, “Dynamics based 3D skeletal hand tracking,” in Proc. Graph. Interface Conf., 2013, pp. 63–70.

[5]

I. Oikonomidis, N. Kyriazis, and A. A. Argyros, “Full DOF tracking of a hand interacting with an object by modeling occlusions and physical constraints,” in Proc. IEEE Int. Conf. Comput. Vis., 2011, pp. 2088–2095.

[6]

C. Qian, X. Sun, Y. Wei, X. Tang, and J. Sun, “Realtime and robust hand tracking from depth,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 1106–1113.

[7]

D. Tang, H. J. Chang, A. Tejani, and T.-K. Kim, “Latent regression forest: Structured estimation of 3D articulated hand posture,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 3786–3793.

[8]

D. Tang, T. Yu, and T.-K. Kim, “Real-time articulated hand pose estimation using semi-supervised transductive regression forests,” in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 3224–3231.

[9]

C. Xu and L. Cheng, “Efficient hand pose estimation from a single depth image,” in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 3456–3462.

[10]

C. Xu, L. N. Govindarajan, Y. Zhang, and L. Cheng, “Lie-X: Depth image based articulated object pose estimation, tracking, and action recognition on lie groups,” Int. J. Comput. Vis., vol. 3, pp. 454–478, 2017.

[11]

M. Oberweger and V. Lepetit, “DeepPrior++: Improving fast and accurate 3D hand pose estimation,” in Proc. IEEE Int. Conf. Comput. Vis. Workshops, 2017, pp. 585–594.

[12]

N. Neverova, C. Wolf, F. Nebout, and G. Taylor, “Hand pose estimation through semi-supervised and weakly-supervised learning,” Comput. Vis. Image Understanding, vol. 164, pp. 56–67, 2017.

[13]

H. Guo, G. Wang, X. Chen, C. Zhang, F. Qiao, and H. Yang, “Region ensemble network: Improving convolutional network for hand pose estimation,” in Proc. IEEE Int. Conf. Image Process., 2017, pp. 4512–4516.

[14]

X. Deng, S. Yang, Y. Zhang, P. Tan, L. Chang, and H. Wang, “Hand3D: Hand pose estimation using 3D neural network,” in CoRR, 2017, vol. abs/1704.02224.

[15]

X. Zhou, Q. Wan, W. Zhang, X. Xue, and Y. Wei, “Model-based deep hand pose estimation,” Proc. Int. Joint Conf. Artif. Intell., 2016, pp. 2421–2427.

[16]

J. Tompson, M. Stein, Y. LeCun, and K. Perlin, “Real-time continuous pose recovery of human hands using convolutional networks,” ACM Trans. Graph., vol. 33, 2014, Art. no.

Digital Library

[17]

L. Ballan, A. Taneja, J. Gall, L. Van Gool, and M. Pollefeys, “Motion capture of hands in action using discriminative salient points,” in Proc. Eur. Conf. Comput. Vis., 2012, pp. 640–653.

[18]

M. de La Gorce, D. J. Fleet, and N. Paragios, “Model-based 3D hand pose estimation from monocular video,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, no. 9, pp. 1793–1805, Sep. 2011.

[19]

R. Plänkers and P. Fua, “Articulated soft objects for multi-view shape and motion capture,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 9, pp. 1182–1187, Sep. 2003.

Digital Library

[20]

T. Sharp, C. Keskin, D. Robertson, J. Taylor, J. Shotton, D. Kim, C. Rhemann, I. Leichter, A. Vinnikov, Y. Wei, D. Freedman, P. Kohli, E. Krupka, A. Fitzgibbon, and S. Izadi, “Accurate, robust, and flexible real-time hand tracking,” in Proc. Annu. ACM Conf. Human Factors Comput. Syst., 2015, pp. 3633–3642.

[21]

S. Sridhar, A. Oulasvirta, and C. Theobalt, “Interactive markerless articulated hand motion tracking using RGB and depth data,” in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 2456–2463.

[22]

D. Tzionas, A. Srikantha, P. Aponte, and J. Gall, “Capturing hand motion with an RGB-D sensor, fusing a generative model with salient points,” in Proc. German Conf. Pattern Recognit., 2014, pp. 277–289.

[23]

I. Oikonomidis, N. Kyriazis, and A. A. Argyros, “Efficient model-based 3D tracking of hand articulations using kinect,” in Proc. Brit. Mach. Vis. Conf., 2011, pp. 101.1–101.11.

[24]

A. Tkach, A. Tagliasacchi, E. Remelli, M. Pauly, and A. Fitzgibbon, “Online generative model personalization for hand tracking,” ACM Trans. Graph., vol. 36, no. 6, 2017, Art. no.

[25]

J. Taylor, L. Bordeaux, T. Cashman, B. Corish, C. Keskin, T. Sharp, E. Soto, D. Sweeney, J. Valentin, B. Luff, A. Topalian, E. Wood, S. Khamis, P. Kohli, S. Izadi, R. Banks, A. Fitzgibbon, and J. Shotton, “Efficient and precise interactive hand tracking through joint, continuous optimization of pose and correspondences,” ACM Trans. Graph., vol. 34, no. 4, 2016, Art. no.

[26]

A. Dosovitskiy, J. T. Springenberg, and T. Brox, “Learning to generate chairs with convolutional neural networks,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 1538–1546.

[27]

A. Tagliasacchi, M. Schrder, A. Tkach, S. Bouaziz, M. Botsch, and M. Pauly, “Robust articulated-ICP for real-time hand tracking,” Comput. Graph. Forum, vol. 34, no. 5, pp. 101–114, 2015.

[28]

S. Sridhar, F. Mueller, A. Oulasvirta, and C. Theobalt, “Fast and robust hand tracking using detection-guided optimization,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 3213–3221.

[29]

M. Oberweger, P. Wohlhart, and V. Lepetit, “Training a feedback loop for hand pose estimation,” in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 3316–3324.

[30]

S. Sridhar, F. Mueller, M. Zollhoefer, D. Casas, A. Oulasvirta, and C. Theobalt, “Real-time joint tracking of a hand manipulating an object from RGB-D input,” in Proc. Eur. Conf. Comput. Vis., 2016, pp. 294–310.

[31]

D. Tzionas, L. Ballan, A. Srikantha, P. Aponte, M. Pollefeys, and J. Gall, “Capturing hands in action using discriminative salient points and physics simulation,” Int. J. Comput. Vis., vol. 118, pp. 172–193, 2016.

Digital Library

[32]

C. Bishop, Pattern Recognition and Machine Learning. Berlin, Germany: Springer, 2006.

Digital Library

[33]

Y. Tang, N. Srivastava, and R. Salakhutdinov, “Learning generative models with visual attention,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2014, pp. 1808–1816.

[34]

A. Kuznetsova, L. Leal-taixe, and B. Rosenhahn, “Real-time sign language recognition using a consumer depth camera,” in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 83–90.

[35]

M. Oberweger, P. Wohlhart, and V. Lepetit, “Hands deep in deep learning for hand pose estimation,” in Proc. Comput. Vis. Winter Workshop, 2015, pp. 1–10.

[36]

F. Müller, D. Mehta, O. Sotnychenko, S. Sridhar, D. Casas, and C. Theobalt, “Real-time hand tracking under occlusion from an egocentric RGB-D sensor,” in Proc. IEEE Int. Conf. Comput. Vis., 2017, pp. 1284–1293.

[37]

C. Zimmermann and T. Brox, “Learning to estimate 3D hand pose from single RGB images,” in Proc. IEEE Int. Conf. Comput. Vis., 2017, pp. 4913–4921.

[38]

T. D. Kulkarni, W. Whitney, P. Kohli, and J. B. Tenenbaum, “Deep convolutional inverse graphics network,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2015, pp. 2539–2547.

[39]

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial nets,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2014, pp. 2672–2680.

[40]

T. D. Kulkarni, I. Yildirim, P. Kohli, W. A. Freiwald, and J. B. Tenenbaum, “Deep generative vision as approximate Bayesian computation,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2014, pp. 1–5.

[41]

V. Nair, J. Susskind, and G. E. Hinton, “Analysis-by-synthesis by learning to invert generative black boxes,” in Proc. Int. Conf. Artif. Neural Netw., 2008, pp. 971–981.

[42]

C. Wan, T. Probst, L. Van Gool, and A. Yao, “Crossing nets: Dual generative models with a shared latent space for hand pose estimation,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 680–689.

[43]

J. Carreira, P. Agrawal, K. Fragkiadaki, and J. Malik, “Human pose estimation with iterative error feedback,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 4733–4742.

[44]

A. R. Zamir, T.-L. Wu, L. Sun, W. Shen, J. Malik, and S. Savarese, “Feedback networks,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 1308–1317.

[45]

M. Rad and V. Lepetit, “BB8: A scalable, accurate, robust to partial occlusion method for predicting the 3D poses of challenging objects without using depth,” in Proc. IEEE Int. Conf. Comput. Vis., 2017, pp. 3848–3856.

[46]

Y. Li, G. Wang, X. Ji, Y. Xiang, and D. Fox, Proc. European Conf. Comput. Vision (ECCV), pp. 683–698, 2018.

[47]

G. Rogez, J. S. Supancic, and D. Ramanan, “Understanding everyday hands in action from RGB-D images,” in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 3889–3897.

[48]

J. Romero, H. Kjellström, and D. Kragic, “Hands in action: Real-time 3D reconstruction of hands in interaction with objects,” in Proc. IEEE Int. Conf. Robot. Autom., 2010, pp. 458–463.

[49]

M. Madadi, S. Escalera, A. Carruesco, C. Andujar, X. Bar, and J. Gonzlez, “Occlusion aware hand pose recovery from sequences of depth images,” in Proc. Int. Conf. Autom. Face Gesture Recognit., 2017, pp. 230–237.

[50]

P. Wohlhart and V. Lepetit, “Learning descriptors for object recognition and 3D pose estimation,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 3109–3118.

[51]

D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” Int. J. Comput. Vis., vol. 60, pp. 91–110, 2004.

Digital Library

[52]

D. Wagner, T. Langlotz, and D. Schmalstieg, “Robust and unobtrusive marker tracking on mobile phones,” in Proc. IEEE/ACM Int. Symp. Mixed Augmented Reality, 2008, pp. 121–124.

[53]

A. Crivellaro, M. Rad, Y. Verdie, K. M. Yi, P. Fua, and V. Lepetit, “A novel representation of parts for accurate 3D object detection and tracking in monocular images,” in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 4391–4399.

[54]

E. Brachmann, A. Krull, F. Michel, S. Gumhold, J. Shotton, and C. Rother, “Learning 6D object pose estimation using 3D object coordinates,” in Proc. Eur. Conf. Comput. Vis., 2014, pp. 536–551.

[55]

A. Krull, E. Brachmann, F. Michel, M. Y. Yang, S. Gumhold, and C. Rother, “Learning analysis-by-synthesis for 6D pose estimation in RGB-D images,” in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 954–962.

[56]

B. Drost, M. Ulrich, N. Navab, and S. Ilic, “Model globally, match locally: Efficient and robust 3D object recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2010, pp. 998–1005.

[57]

S. Gupta, P. Arbelaez, R. Girshick, and J. Malik, “Aligning 3D models to RGB-D images of cluttered scenes,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 4731–4740.

[58]

R. Y. Wang, S. Paris, and J. Popovic, “6D Hands: Markerless hand tracking for computer aided design,” Proc. ACM Symp. on User Interface Softw. and Technol., pp. 549–558, 2011.

[59]

I. Oikonomidis, N. Kyriazis, and A. Argyros, “Tracking the articulated motion of two strongly interacting hands,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2012, pp. 1862–1869.

[60]

H. Hamer, K. Schindler, E. Koller-Meier, and L. Van Gool, “Tracking a hand manipulating an object,” in Proc. IEEE Int. Conf. Comput. Vis., 2009, pp. 1475–1482.

[61]

D. Goudie and A. Galata, “3D hand-object pose estimation from depth with convolutional neural networks,” in Proc. Int. Conf. Autom. Face Gesture Recognit., 2017, pp. 406–413.

[62]

D. Tzionas and J. Gall, “3D object reconstruction from hand-object interactions,” in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 729–737.

[63]

P. Panteleris, N. Kyriazis, and A. A. Argyros, “3D tracking of human hands in interaction with unknown objects,” in Proc. Brit. Mach. Vis. Conf., 2015, pp. 123.1–123.12.

[64]

N. Kyriazis and A. A. Argyros, “Scalable 3D tracking of multiple interacting objects,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 3430–3437.

[65]

N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: A simple way to prevent neural networks from overfitting,” J. Mach. Learn. Res., vol. 15, no. 1, pp. 1929–1958, 2014.

Digital Library

[66]

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 770–778.

[67]

M. D. Zeiler, G. W. Taylor, and R. Fergus, “Adaptive deconvolutional networks for mid and high level feature learning,” in Proc. IEEE Int. Conf. Comput. Vis., 2011, pp. 2018–2025.

[68]

K. Khoshelham and S. O. Elberink, “Accuracy and resolution of kinect depth data for indoor mapping applications,” Sensors, vol. 12, no. 2, pp. 1437–1454, 2012.

[69]

A. Jain, J. Tompson, M. Andriluka, G. W. Taylor, and C. Bregler, “Learning human pose estimation features with convolutional networks,” in Proc. Int. Conf. Learn. Representations, 2014, pp. 1–14.

[70]

S. Liu, X. Liang, L. Liu, X. Shen, J. Yang, C. Xu, L. Lin, X. Cao, and S. Yan, “Matching-CNN meets KNN: Quasi-parametric human parsing,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 1419–1427.

[71]

D. Scherer, A. Müller, and S. Behnke, “Evaluation of pooling operations in convolutional architectures for object recognition,” in Proc. Int. Conf. Artif. Neural Netw., 2010, pp. 92–101.

[72]

X. Sun, Y. Wei, S. Liang, X. Tang, and J. Sun, “Cascaded hand pose regression,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2015, pp. 824–832.

[73]

A. X. Chang, T. Funkhouser, L. Guibas, P. Hanrahan, Q. Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, J. Xiao, L. Yi, and F. Yu, “ShapeNet: An information-rich 3D model repository,” Stanford University — Princeton University — Toyota Technological Institute at Chicago, 2015.

[74]

S. Han, B. Liu, R. Wang, Y. Ye, C. D. Twigg, and K. Kin, “Online optical marker-based hand tracking with deep labels,” ACM Trans. Graph., vol. 37, no. 4, pp. 166:1–166:10, 2018.

[75]

M. Jaderberg, K. Simonyan, A. Zisserman, and K. Kavukcuoglu, “Spatial transformer networks,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2015, pp. 2017–2025.

[76]

C.-H. Lin and S. Lucey, “Inverse compositional spatial transformer networks,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 2252–2260.

[77]

Y. Xiang, T. Schmidt, V. Narayanan, and D. Fox, “PoseCNN: A convolutional neural network for 6D object pose estimation in cluttered scenes,” Proc. Robot. Sci. Syst., 2018, pp. 1–10.

[78]

W. Kabsch, “A solution for the best rotation to relate two sets of vectors,” Acta Crystallographica, vol. 32, pp. 922–923, 1976.

[79]

D. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in Proc. Int. Conf. Learn. Representations, 2015, pp. 1–15.

[80]

D. Bouchacourt, M. P. Kumar, and S. Nowozin, “DISCO nets: Dissimilarity coefficient networks,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2016, pp. 352–360.

[81]

L. Ge, H. Liang, J. Yuan, and D. Thalmann, “Real-time 3D hand pose estimation with 3D convolutional neural networks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 41, no. 4, pp. 956–970, Apr. 2019.

Digital Library

[82]

X. Chen, G. Wang, H. Guo, and C. Zhang, “Pose guided structured region ensemble network for cascaded hand pose estimation,” Neurocomputing, 2018.

[83]

R. H. Byrd, P. Lu, J. Nocedal, and C. Zhu, “A limited memory algorithm for bound constrained optimization,” SIAM J. Sci. Statistical Comput., vol. 16, no. 5, pp. 1190–1208, 1995.

Digital Library

[84]

J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio, “Theano: A CPU and GPU math expression compiler,” in Proc. Python Sci. Comput. Conf., 2010, pp. 3–11.

[85]

R. Rosales and S. Sclaroff, “Combining generative and discriminative models in a framework for articulated pose estimation,” Int. J. Comput. Vis., vol. 67, no. 3, pp. 251–276, May 2006.

Digital Library

[86]

F. Müller, F. Bernard, O. Sotnychenko, D. Mehta, S. Sridhar, D. Casas, and C. Theobalt, “GANerated hands for real-time 3D hand tracking from monocular RGB,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2018, pp. 49–59.

[87]

P. Panteleris, I. Oikonomidis, and A. A. Argyros, “Using a single RGB frame for real time 3D hand pose estimation in the wild,” in Proc. IEEE Workshop Appl. Comput. Vis., 2018, pp. 436–445.

Cited By

Woo TPark WJeong WPark J(2024)A survey of deep learning methods and datasets for hand pose estimation from hand-object interaction imagesComputers and Graphics10.1016/j.cag.2023.09.013116:C(474-490)Online publication date: 4-Mar-2024
https://dl.acm.org/doi/10.1016/j.cag.2023.09.013
Zhang YKephart JJi Q(2024)Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty GuidanceComputer Vision – ECCV 202410.1007/978-3-031-73229-4_7(106-125)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-73229-4_7
Birlo MCaramalau REdwards PDromey BClarkson MStoyanov D(2024)HUP-3D: A 3D Multi-view Synthetic Dataset for Assisted-Egocentric Hand-Ultrasound-Probe Pose EstimationMedical Image Computing and Computer Assisted Intervention – MICCAI 202410.1007/978-3-031-72378-0_40(430-436)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1007/978-3-031-72378-0_40
Show More Cited By

Index Terms

Generalized Feedback Loop for Joint Hand-Object Pose Estimation
1. Computing methodologies

Index terms have been assigned to the content through auto-classification.

Recommendations

Training a Feedback Loop for Hand Pose Estimation
ICCV '15: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV)

We propose an entirely data-driven approach to estimating the 3D pose of a hand given a depth image. We show that we can correct the mistakes made by a Convolutional Neural Network trained to predict an estimate of the 3D pose by using a feedback loop. ...
Vision-based hand pose estimation: A review

Direct use of the hand as an input device is an attractive method for providing natural human-computer interaction (HCI). Currently, the only technology that satisfies the advanced requirements of hand-based input for HCI is glove-based sensing. This ...
A survey of deep learning methods and datasets for hand pose estimation from hand-object interaction images
Abstract
The research topic of estimating hand pose from the images of hand-object interaction has the potential for replicating natural hand behavior in many practical applications of virtual reality and robotics. However, the intricacy of hand-object ...
Graphical abstract

Display Omitted
Highlights
- Deep learning is effectively used for estimating hand pose from images.
- The correlation between a hand and an object helps in estimating hand-object pose.
- Hand model helps estimate hand shape, but it restricts within the model’s ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence

IEEE Transactions on Pattern Analysis and Machine Intelligence Volume 42, Issue 8

Aug. 2020

256 pages

ISSN:0162-8828

Issue’s Table of Contents

0162-8828 © 2019 Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 August 2020

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

17
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 24 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Woo TPark WJeong WPark J(2024)A survey of deep learning methods and datasets for hand pose estimation from hand-object interaction imagesComputers and Graphics10.1016/j.cag.2023.09.013116:C(474-490)Online publication date: 4-Mar-2024
https://dl.acm.org/doi/10.1016/j.cag.2023.09.013
Zhang YKephart JJi Q(2024)Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty GuidanceComputer Vision – ECCV 202410.1007/978-3-031-73229-4_7(106-125)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-73229-4_7
Birlo MCaramalau REdwards PDromey BClarkson MStoyanov D(2024)HUP-3D: A 3D Multi-view Synthetic Dataset for Assisted-Egocentric Hand-Ultrasound-Probe Pose EstimationMedical Image Computing and Computer Assisted Intervention – MICCAI 202410.1007/978-3-031-72378-0_40(430-436)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1007/978-3-031-72378-0_40
Chen MShuang FLi SLiu X(2023)ASCS-Reinforcement Learning: A Cascaded Framework for Accurate 3D Hand Pose EstimationProceedings of the 2023 ACM International Conference on Multimedia Retrieval10.1145/3591106.3592215(335-342)Online publication date: 12-Jun-2023
https://dl.acm.org/doi/10.1145/3591106.3592215
Zhang HTian YZhang YLi MAn LSun ZLiu Y(2023)PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular ImagesIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.327169145:10(12287-12303)Online publication date: 1-May-2023
https://dl.acm.org/doi/10.1109/TPAMI.2023.3271691
Zhou JXu CGe YCheng L(2023)Realistic Depth Image Synthesis for 3D Hand Pose EstimationIEEE Transactions on Multimedia10.1109/TMM.2023.333052226(5246-5256)Online publication date: 6-Nov-2023
https://dl.acm.org/doi/10.1109/TMM.2023.3330522
Li BLi RWang WFu H(2023)Serial-parallel multi-scale feature fusion for anatomy-oriented hand joint detectionNeurocomputing10.1016/j.neucom.2023.02.046536:C(59-72)Online publication date: 1-Jun-2023
https://dl.acm.org/doi/10.1016/j.neucom.2023.02.046
Wang FZhang XChen TShen ZLiu SHe Z(2023)KVNetNeurocomputing10.1016/j.neucom.2023.01.036530:C(11-22)Online publication date: 14-Apr-2023
https://dl.acm.org/doi/10.1016/j.neucom.2023.01.036
Xiang DXu WZhang YPeng BWang GLi K(2023)MTMVCJournal of Visual Communication and Image Representation10.1016/j.jvcir.2023.10390295:COnline publication date: 1-Sep-2023
https://dl.acm.org/doi/10.1016/j.jvcir.2023.103902
Zhu YGuo C(2023)A hand motion capture method based on infrared thermography for measuring fine motor skills in biomedicineArtificial Intelligence in Medicine10.1016/j.artmed.2022.102474135:COnline publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1016/j.artmed.2022.102474
Show More Cited By

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents