[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Enhanced 3D Shape Reconstruction With Knowledge Graph of Category Concept

Published: 04 March 2022 Publication History

Abstract

Reconstructing three-dimensional (3D) objects from images has attracted increasing attention due to its wide applications in computer vision and robotic tasks. Despite the promising progress of recent deep learning–based approaches, which directly reconstruct the full 3D shape without considering the conceptual knowledge of the object categories, existing models have limited usage and usually create unrealistic shapes. 3D objects have multiple forms of representation, such as 3D volume, conceptual knowledge, and so on. In this work, we show that the conceptual knowledge for a category of objects, which represents objects as prototype volumes and is structured by graph, can enhance the 3D reconstruction pipeline. We propose a novel multimodal framework that explicitly combines graph-based conceptual knowledge with deep neural networks for 3D shape reconstruction from a single RGB image. Our approach represents conceptual knowledge of a specific category as a structure-based knowledge graph. Specifically, conceptual knowledge acts as visual priors and spatial relationships to assist the 3D reconstruction framework to create realistic 3D shapes with enhanced details. Our 3D reconstruction framework takes an image as input. It first predicts the conceptual knowledge of the object in the image, then generates a 3D object based on the input image and the predicted conceptual knowledge. The generated 3D object satisfies the following requirements: (1) it is consistent with the predicted graph in concept, and (2) consistent with the input image in geometry. Extensive experiments on public datasets (i.e.,  ShapeNet, Pix3D, and Pascal3D+) with 13 object categories show that (1) our method outperforms the state-of-the-art methods, (2) our prototype volume-based conceptual knowledge representation is more effective, and (3) our pipeline-agnostic approach can enhance the reconstruction quality of various 3D shape reconstruction pipelines.

References

[1]
Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. tensorflow: A system for large-scale machine learning. In USENIX Symposium on Operating Systems Design and Implementation. 265–283.
[2]
Hassan Afzal, Djamila Aouada, Bruno Mirbach, and Björn Ottersten. 2018. Full 3D reconstruction of non-rigidly deforming objects. ACM Transactions on Multimedia Computing, Communications, and Applications 14, 1s (2018), 1–23.
[3]
Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, and Joshua B. Tenenbaum. 2017. Synthesizing 3D shapes via modeling multi-view depth maps and silhouettes with deep generative networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu. IEEE. 1511–1519.
[4]
Cesar Cadena, Luca Carlone, Henry Carrillo, Yasir Latif, Davide Scaramuzza, José Neira, Ian Reid, and John J. Leonard. 2016. Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on Robotics 32, 6 (2016), 1309–1332.
[5]
Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. 2015. shapenet: An information-rich 3D model repository. arXiv:1512.03012.
[6]
Hao Chen, Kunyang Sun, Zhi Tian, Chunhua Shen, Yongming Huang, and Youliang Yan. 2020. Blendmask: Top-down meets bottom-up for instance segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Virtual. IEEE. 8573–8581.
[7]
Zhiqin Chen and Hao Zhang. 2019. Learning implicit fields for generative shape modeling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach. IEEE. 5939–5948.
[8]
K. M. G. Cheung, Simon Baker, and Takeo Kanade. 2003. Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Vol. 1, Wisconsin. IEEE. 1–1.
[9]
Seonghwa Choi, Anh-Duc Nguyen, Jinwoo Kim, Sewoong Ahn, and Sanghoon Lee. 2019. Point cloud deformation for single image 3d reconstruction. In IEEE International Conference on Image Processing (ICIP’19). Taipei. IEEE. 2379–2383.
[10]
Christopher B. Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, and Silvio Savarese. 2016. 3D-R2N2: A unified approach for single and multi-view 3D object reconstruction. In Proceedings of the European Conference on Computer Vision(Lecture Notes in Computer Science), Vol. 9912, Springer. Amsterdam. Springer. 628–644.
[11]
Angela Dai, Charles Ruizhongtai Qi, and Matthias Nießner. 2017. Shape completion using 3D-encoder-predictor CNNs and shape synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu. IEEE. 5868–5877.
[12]
Hugh Durrant-Whyte and Tim Bailey. 2006. Simultaneous localization and mapping. IEEE Robotics & Automation Magazine 13, 2 (2006), 99–110.
[13]
Haoqiang Fan, Hao Su, and Leonidas J. Guibas. 2017. A point set generation network for 3D object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu. IEEE. 605–613.
[14]
Georgia Gkioxari, Jitendra Malik, and Justin Johnson. 2019. Mesh R-CNN. In Proceedings of the IEEE International Conference on Computer Vision. Long Beach. IEEE. 9785–9795.
[15]
Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, and Mathieu Aubry. 2018. A papier-mâché approach to learning 3D surface generation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City. IEEE. 216–224.
[16]
Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, and Yizhou Yu. 2017. High-resolution shape completion using deep neural networks for global structure and local geometry inference. In Proceedings of the IEEE International Conference on Computer Vision. Honolulu. IEEE. 85–93.
[17]
Xian-Feng Han, Hamid Laga, and Mohammed Bennamoun. 2019. Image-based 3D object reconstruction: State-of-the-art and trends in the deep learning era. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 5 (2019), 1578–1604.
[18]
Christian Häne, Shubham Tulsiani, and Jitendra Malik. 2017. Hierarchical surface prediction for 3D object reconstruction. In International Conference on 3D Vision. Qingdao. IEEE. 412–420.
[19]
Richard Hartley and Andrew Zisserman. 2003. Multiple View Geometry in Computer Vision. Cambridge University Press.
[20]
Eldar Insafutdinov and Alexey Dosovitskiy. 2018. Unsupervised learning of shape and pose with differentiable point clouds. In Advances in Neural Information Processing Systems. Montréal. MIT Press. 2802–2812.
[21]
Shahram Izadi, David Kim, Otmar Hilliges, David Molyneaux, Richard Newcombe, Pushmeet Kohli, Jamie Shotton, Steve Hodges, Dustin Freeman, Andrew Davison, et al. 2011. KinectFusion: Real-time 3D reconstruction and interaction using a moving depth camera. In Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology. Santa Barbara. ACM. 559–568.
[22]
Adrian Johnston, Ravi Garg, Gustavo Carneiro, Ian Reid, and Anton van den Hengel. 2017. Scaling CNNs for high resolution volumetric reconstruction from a single image. In Proceedings of the IEEE International Conference on Computer Vision Workshops. Venice. IEEE. 939–948.
[23]
Angjoo Kanazawa, Shubham Tulsiani, Alexei A. Efros, and Jitendra Malik. 2018. Learning category-specific mesh reconstruction from image collections. In Proceedings of the European Conference on Computer Vision (Lecture Notes in Computer Science), Vol. 11219, Munich. Springer. 386–402.
[24]
Abhishek Kar, Christian Häne, and Jitendra Malik. 2017. Learning a multi-view stereo machine. In Advances in Neural Information Processing Systems. Long Beach. MIT Press, 365–376.
[25]
Abhishek Kar, Shubham Tulsiani, Joao Carreira, and Jitendra Malik. 2015. Category-specific object reconstruction from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston. IEEE. 1966–1974.
[26]
Hiroharu Kato and Tatsuya Harada. 2019. Learning view priors for single-view 3D reconstruction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach. IEEE. 9778–9787.
[27]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980 (2014).
[28]
Andrey Kurenkov, Jingwei Ji, Animesh Garg, Viraj Mehta, Junyoung Gwak, Christopher Choy, and Silvio Savarese. 2018. Deformnet: Free-form deformation network for 3D shape reconstruction from a single image. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision. Lake Tahoe. IEEE. 858–866.
[29]
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436–444.
[30]
Jie Li, Kai Han, Peng Wang, Yu Liu, and Xia Yuan. 2020. Anisotropic convolutional networks for 3D semantic scene completion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Virtual. IEEE. 3351–3359.
[31]
Yi-Lun Liao, Yao-Cheng Yang, Yuan-Fang Lin, Pin-Jung Chen, Chia-Wen Kuo, Wei-Chen Chiu, and Yu-Chiang Frank Wang. 2019. Learning pose-aware 3D reconstruction via 2D-3D self-consistency. In IEEE International Conference on Acoustics, Speech and Signal Processing. Brighton. IEEE. 3857–3861.
[32]
Chen-Hsuan Lin, Chen Kong, and Simon Lucey. 2018. Learning efficient point cloud generation for dense 3D object reconstruction. In Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 32, New Orleans. AAAI, 1–1.
[33]
Donald J. R. Meagher. 1980. Octree Encoding: A New Technique for the Representation, Manipulation and Display of Arbitrary 3D Objects by Computer. Electrical and Systems Engineering Department, Rensseiaer Polytechnic.
[34]
Kaichun Mo, Shilin Zhu, Angel X. Chang, Li Yi, Subarna Tripathi, Leonidas J. Guibas, and Hao Su. 2019. partnet: A large-scale benchmark for fine-grained and hierarchical part-level 3D object understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach. IEEE. 909–918.
[35]
K. L. Navaneet, Priyanka Mandikal, Mayank Agarwal, and R. Venkatesh Babu. 2019. capnet: Continuous approximation projection for 3D point cloud reconstruction using 2D supervision. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, Honolulu. AAAI. 8819–8826.
[36]
Richard A. Newcombe, Shahram Izadi, Otmar Hilliges, David Molyneaux, David Kim, Andrew J. Davison, Pushmeet Kohi, Jamie Shotton, Steve Hodges, and Andrew Fitzgibbon. 2011. KinectFusion: Real-time dense surface mapping and tracking. In IEEE International Symposium on Mixed and Augmented Reality. Basel. IEEE. 127–136.
[37]
Anh-Duc Nguyen, Seonghwa Choi, Woojae Kim, and Sanghoon Lee. 2019. Graphx-convolution for point cloud deformation in 2D-to-3D conversion. In Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul. IEEE. 8628–8637.
[38]
Pietro Pala and Stefano Berretti. 2019. Reconstructing 3D face models by incremental aggregation and refinement of depth frames. ACM Transactions on Multimedia Computing, Communications, and Applications 15, 1 (2019), 1–24.
[39]
Junyi Pan, Xiaoguang Han, Weikai Chen, Jiapeng Tang, and Kui Jia. 2019. Deep mesh reconstruction from single RGB images via topology modification networks. In Proceedings of the IEEE International Conference on Computer Vision. Seoul. IEEE. 9964–9973.
[40]
Junyi Pan, Jun Li, Xiaoguang Han, and Kui Jia. 2018. Residual meshnet: Learning to deform meshes for single-view 3D reconstruction. In International Conference on 3D Vision. Verona. IEEE. 719–727.
[41]
Yun-he Pan. 2019. On visual knowledge. Frontiers of Information Technology & Electronic Engineering 20, 8 (2019), 1021–1025.
[42]
Pedro O. Pinheiro, Negar Rostamzadeh, and Sungjin Ahn. 2019. Domain-adaptive single-view 3D reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul. IEEE. 7638–7647.
[43]
Stephan R. Richter and Stefan Roth. 2015. Discriminative shape from shading in uncalibrated illumination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston. IEEE. 1128–1136.
[44]
Stephan R. Richter and Stefan Roth. 2018. Matryoshka networks: Predicting 3D geometry via nested shape layers. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City. IEEE. 1936–1944.
[45]
Dingfeng Shi, Yifan Zhao, and Jia Li. 2020. Reconstructing part-level 3D models from a single image. In 2020 IEEE International Conference on Multimedia and Expo. Virtual. IEEE. 1–6.
[46]
Edward Smith, Scott Fujimoto, Adriana Romero, and David Meger. 2019. GEOMetrics: Exploiting geometric structure for graph-encoded objects. In Proceedings of the International Conference on Machine Learning, Vol. 97, Long Beach. ACM. 5866–5876.
[47]
David Stutz and Andreas Geiger. 2020. Learning 3D shape completion under weak supervision. International Journal of Computer Vision 128, 5 (2020), 1162–1181.
[48]
Xingyuan Sun, Jiajun Wu, Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Tianfan Xue, Joshua B. Tenenbaum, and William T. Freeman. 2018. Pix3D: Dataset and methods for single-image 3D shape modeling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City. IEEE. 2974–2983.
[49]
Jiapeng Tang, Xiaoguang Han, Junyi Pan, Kui Jia, and Xin Tong. 2019. A skeleton-bridged deep learning approach for generating meshes of complex topologies from single RGB images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach. IEEE. 4541–4550.
[50]
Maxim Tatarchenko, Alexey Dosovitskiy, and Thomas Brox. 2017. Octree generating networks: Efficient convolutional architectures for high-resolution 3D outputs. In Proceedings of the IEEE International Conference on Computer Vision. Venice. IEEE. 2088–2096.
[51]
Shubham Tulsiani, Alexei A. Efros, and Jitendra Malik. 2018. Multi-view consistency as supervisory signal for learning shape and pose prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City. IEEE. 2897–2905.
[52]
Shubham Tulsiani, Tinghui Zhou, Alexei A. Efros, and Jitendra Malik. 2017. Multi-view supervision for single-view reconstruction via differentiable ray consistency. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2626–2634.
[53]
Hanqing Wang, Jiaolong Yang, Wei Liang, and Xin Tong. 2019. Deep single-view 3D object reconstruction with visual hull embedding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, Honolulu. AAAI, 8941–8948.
[54]
Jianren Wang and Zhaoyuan Fang. 2020. GSIR: Generalizable 3D shape interpretation and reconstruction. In European Conference on Computer Vision. Virtual. Springer. 498–514.
[55]
Jinglu Wang, Bo Sun, and Yan Lu. 2019. MVPNet: Multi-view point regression networks for 3D object reconstruction from a single image. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, Honolulu. AAAI, 8949–8956.
[56]
Meng Wang, Lingjing Wang, and Yi Fang. 2017. 3DensiNet: A robust neural network architecture towards 3D volumetric object prediction from 2D image. In Proceedings of the ACM International Conference on Multimedia. Mountain View. ACM. 961–969.
[57]
Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Hang Yu, Wei Liu, Xiangyang Xue, and Yu-Gang Jiang. 2020. Pixel2Mesh: 3D mesh model generation via image guided deformation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020).
[58]
Weiyue Wang, Qiangui Huang, Suya You, Chao Yang, and Ulrich Neumann. 2017. Shape inpainting using 3D generative adversarial network and recurrent convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision. Venice. IEEE. 2298–2306.
[59]
Matthew J. Westoby, James Brasington, Niel F. Glasser, Michael J. Hambrey, and Jennifer M. Reynolds. 2012. ‘structure-from-motion’ photogrammetry: A low-cost, effective tool for geoscience applications. Geomorphology 179 (2012), 300–314.
[60]
Andrew P. Witkin. 1981. Recovering surface shape and orientation from texture. Artificial Intelligence 17, 1-3 (1981), 17–45.
[61]
Jiajun Wu, Yifan Wang, Tianfan Xue, Xingyuan Sun, Bill Freeman, and Josh Tenenbaum. 2017. MarrNet: 3D shape reconstruction via 2.5D sketches. In Advances in Neural Information Processing Systems, Long Beach. MIT Press, 540–550.
[62]
Jiajun Wu, Chengkai Zhang, Tianfan Xue, Bill Freeman, and Josh Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling. In Advances in Neural Information Processing Systems. 82–90.
[63]
Jiajun Wu, Chengkai Zhang, Xiuming Zhang, Zhoutong Zhang, William T. Freeman, and Joshua B. Tenenbaum. 2018. Learning shape priors for single-view 3D completion and reconstruction. In Proceedings of the European Conference on Computer Vision(Lecture Notes in Computer Science), Vol. 11215, Springer. 673–691.
[64]
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3D ShapeNets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston. IEEE. 1912–1920.
[65]
Nan Xiang, Li Wang, Tao Jiang, Yanran Li, Xiaosong Yang, and Jianjun Zhang. 2019. Single-image mesh reconstruction and pose estimation via generative normal map. In Proceedings of the International Conference on Computer Animation and Social Agents. Paris. ACM. 79–84.
[66]
Yu Xiang, Roozbeh Mottaghi, and Silvio Savarese. 2014. Beyond Pascal: A benchmark for 3D object detection in the wild. In IEEE Winter Conference on Applications of Computer Vision. Steamboat Springs. IEEE. 75–82.
[67]
Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, and Shengping Zhang. 2019. Pix2Vox: Context-aware 3D reconstruction from single and multi-view images. In Proceedings of the IEEE International Conference on Computer Vision. Seoul. IEEE. 2690–2698.
[68]
Haozhe Xie, Hongxun Yao, Shengping Zhang, Shangchen Zhou, and Wenxiu Sun. 2020. Pix2Vox++: Multi-scale context-aware 3D object reconstruction from single and multiple images. International Journal of Computer Vision 128, 12 (2020), 2919–2935.
[69]
Xinchen Yan, Jimei Yang, Ersin Yumer, Yijie Guo, and Honglak Lee. 2016. Perspective transformer nets: Learning single-view 3D object reconstruction without 3D supervision. In Advances in Neural Information Processing Systems, Barcelona. MIT Press, 1696–1704.
[70]
Bo Yang, Stefano Rosa, Andrew Markham, Niki Trigoni, and Hongkai Wen. 2018. Dense 3D object reconstruction from a single depth view. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 12 (2018), 2820–2834.
[71]
Guandao Yang, Yin Cui, Serge Belongie, and Bharath Hariharan. 2018. Learning single-view 3D reconstruction with limited pose supervision. In Proceedings of the European Conference on Computer Vision(Lecture Notes in Computer Science), Vol. 11219, Munich. Springer, 90–105.
[72]
Shuo Yang, Min Xu, Haozhe Xie, Stuart Perry, and Jiahao Xia. 2021. Single-view 3D object reconstruction from shape priors in memory. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. 3152–3161.
[73]
Chun-Han Yao, Wei-Chih Hung, Varun Jampani, and Ming-Hsuan Yang. 2021. Discovering 3D parts from image collections. In Proceedings of the IEEE International Conference on Computer Vision. IEEE. 12981–12990.
[74]
Yuan Yao, Nico Schertler, Enrique Rosales, Helge Rhodin, Leonid Sigal, and Alla Sheffer. 2020. start hereFront2Back: Single view 3D shape reconstruction via front to back prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Virtual. IEEE. 531–540.
[75]
Chen Zhang. 2019. CuFusion2: Accurate and denoised volumetric 3D object reconstruction using depth cameras. IEEE Access 7 (2019), 49882–49893.
[76]
Chen Zhang and Yu Hu. 2017. CuFusion: Accurate real-time camera tracking and volumetric scene reconstruction with a cuboid. Sensors 17, 10 (2017), 2260.
[77]
Qian-Yi Zhou and Vladlen Koltun. 2015. Depth camera tracking with contour cues. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston. IEEE. 632–638.

Cited By

View all
  • (2024)A Low-Density Parity-Check Coding Scheme for LoRa NetworkingACM Transactions on Sensor Networks10.1145/366592820:4(1-29)Online publication date: 8-Jul-2024
  • (2024)ChirpTransformer: Versatile LoRa Encoding for Low-power Wide-area IoTProceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services10.1145/3643832.3661861(479-491)Online publication date: 3-Jun-2024
  • (2024)Context-detail-aware United Network for Single Image DerainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363940720:5(1-18)Online publication date: 22-Jan-2024
  • Show More Cited By

Index Terms

  1. Enhanced 3D Shape Reconstruction With Knowledge Graph of Category Concept

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 3
      August 2022
      478 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3505208
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 March 2022
      Received: 01 December 2021
      Accepted: 01 September 2021
      Revised: 01 August 2021
      Published in TOMM Volume 18, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Deep learning
      2. 3D reconstruction
      3. conceptual knowledge

      Qualifiers

      • Research-article
      • Refereed

      Funding Sources

      • National Key Research and Development Program of China
      • National Natural Science Foundation of China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)116
      • Downloads (Last 6 weeks)9
      Reflects downloads up to 13 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Low-Density Parity-Check Coding Scheme for LoRa NetworkingACM Transactions on Sensor Networks10.1145/366592820:4(1-29)Online publication date: 8-Jul-2024
      • (2024)ChirpTransformer: Versatile LoRa Encoding for Low-power Wide-area IoTProceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services10.1145/3643832.3661861(479-491)Online publication date: 3-Jun-2024
      • (2024)Context-detail-aware United Network for Single Image DerainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363940720:5(1-18)Online publication date: 22-Jan-2024
      • (2024)Knowledge-guided multi-perception attention network for image dehazingThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-023-03177-240:9(6479-6492)Online publication date: 1-Sep-2024
      • (2023)AquaHelper: Underwater SOS Transmission and Detection in Swimming PoolsProceedings of the 21st ACM Conference on Embedded Networked Sensor Systems10.1145/3625687.3625816(294-307)Online publication date: 12-Nov-2023
      • (2023)An Iterative Semi-supervised Approach with Pixel-wise Contrastive Loss for Road Extraction in Aerial ImagesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/360637420:3(1-21)Online publication date: 10-Nov-2023
      • (2023)LHNet: A Low-cost Hybrid Network for Single Image DehazingProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612594(7706-7717)Online publication date: 26-Oct-2023
      • (2023)Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person RetrievalProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612009(8922-8931)Online publication date: 26-Oct-2023
      • (2023)Exploring the Effect of High-frequency Components in GANs TrainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357858519:5(1-22)Online publication date: 16-Mar-2023
      • (2023)A Secure Anonymous Identity-Based Virtual-Space Agreement Method for Crowds-Based Anonymous Communicate SchemeIET Information Security10.1049/2023/82345372023Online publication date: 18-Dec-2023
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media