default search action
24th ISM 2022: Italy
- IEEE International Symposium on Multimedia, ISM 2022, Naples, Italy, December 5-7, 2022. IEEE 2022, ISBN 978-1-6654-7172-5
- Donghuo Zeng, Yanan Wang, Jianming Wu, Kazushi Ikeda:
Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval. 1-9 - Furkan Kaynar, Adrian Michl, Eckehard G. Steinbach:
Interactive RGB Image Segmentation via Depth-modified Click Encoding and Estimated Depth. 10-17 - Valeri George, Jens Brandenburg, Gabriel Hege, Tobias Hinz, Adam Wieckowski, Benjamin Bross, Detlev Marpe:
Efficient Multi-Threading Strategies in VVenC, an Open and Optimized VVC Encoder Implementation. 18-25 - Antonio M. Rinaldi, Cristiano Russo, Cristian Tommasino:
Effects of Color Stain Normalization in Histopathology Image Retrieval using Deep Learning. 26-33 - Mengchen Xiong, Xiao Xu, Dong Yang, Eckehard G. Steinbach:
Robust Depth Estimation in Foggy Environments Combining RGB Images and mmWave Radar. 34-41 - Birk Torpmann-Hagen, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Kyrre Glette:
Segmentation Consistency Training: Out-of-Distribution Generalization for Medical Image Segmentation. 42-49 - Syed Qasim Abbas, S. Jannat Shirazi, Yi-Ping Phoebe Chen:
Aggregated Bidirectional Local Binary Pattern for Robust Perceptual Image Hashing. 50-57 - Mariano Ntrougkas, Nikolaos Gkalelis, Vasileios Mezaris:
TAME: Attention Mechanism Based Feature Fusion for Generating Explanation Maps of Convolutional Neural Networks. 58-65 - Michele Baldassini, Francesco Pistolesi, Beatrice Lazzerini:
Detecting happiness from 14-channel binary-valued EEG charts via deep learning. 66-73 - Abdelrahman Seleem, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira:
Impact of Conventional and Deep Learning-based Point Cloud Geometry Coding on Deep Learning-based Classification Performance. 74-81 - Florian Schniederjann, Darius Rausch, Jens Wiggenbrock, Robert Mertens:
Teardrop Magnification: A Hybrid Linear-Fisheye Magnifier for the Border and Corner of the Screen. 82-83 - Madjid Maidi, Samir Otmane:
Multimodal 2D/3D Registration for Open Augmented Reality Applications. 84-85 - Matthew Hamilton, Nicholas Wells, Amílcar Soares:
On Requirements for Field of Light Displays to Pass the Visual Turing Test. 86-87 - Sebastian Eger, Rastin Pries, Gábor Sörös, Michael G. Adam, Martin Piccolrovazzi, Eckehard G. Steinbach:
To Sparsify or not to Sparsify: Simplifying Visual Feature Maps for Mobile Agents. 88-91 - Mahmoud Fakhry, Abeer FathAllah Brery, Ascensión Gallardo-Antolín:
Analysis of Heart Sound Signals using Sparse Modeling with Gabor Dictionary. 92-96 - Christopher B. Kuhn, Markus Hofbauer, Bowen Ma, Goran Petrovic, Eckehard G. Steinbach:
Improving Multimodal Object Detection with Individual Sensor Monitoring. 97-104 - Martin Piccolrovazzi, Michael G. Adam, Sebastian Eger, Marsil Zakour, Eckehard G. Steinbach:
Self-Supervised Object Recognition Based on Repeated Re-Capturing of Dynamic Indoor Environments. 105-112 - Nikolaos Gkalelis, Dimitrios Daskalakis, Vasileios Mezaris:
Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism. 113-120 - Shivi Vats, Jounsup Park, Klara Nahrstedt, Michael Zink, Ramesh K. Sitaraman, Hermann Hellwagner:
Semantic-Aware View Prediction for 360-Degree Videos at the 5G Edge. 121-128 - Driton Salihu, Adam Misik, Markus Hofbauer, Eckehard G. Steinbach:
S2CMAF: Multi-Method Assessment Fusion for Scan-to-CAD Methods. 129-136 - Ruiying Yang, María Santamaría, Francesco Cricri, Honglei Zhang, Jani Lainema, Ramin Ghaznavi Youvalari, Miska M. Hannuksela:
Low-precision post-filtering in video coding. 137-140 - Nannan Zou, Francesco Cricri, Honglei Zhang, Hamed R. Tavakoli, Miska M. Hannuksela, Esa Rahtu:
The Lottery Ticket Adaptation for Neural Video Coding. 141-145 - Evlampios Apostolidis, Georgios Balaouras, Vasileios Mezaris, Ioannis Patras:
Explaining video summarization based on the focus of attention. 146-150 - Markus Hofbauer, Christopher B. Kuhn, Goran Petrovic, Eckehard G. Steinbach:
Measuring the Influence of Image Preprocessing on the Rate-Distortion Performance of Video Encoding. 151-152 - Himanshu Gupta, Sowmya Vasuki Jallepalli, Pratik Mulchandani, Chirag Trasikar, Chetan Manjesh, Vishy Swaminathan, Stefano Petrangeli:
Towards Efficient Video Super Resolution for Faster Streaming. 153-154 - Tor-Arne S. Nordmo, Martine Mostervik Espeseth, Bjørn Aslak Juliussen, Michael A. Riegler, Dag Johansen:
Detection of Commercial Fishing-related Slipping Events using Multimodal Data. 155-156 - Ruowei Jiang, Brendan Duke, Frédéric Flament, Parham Aarabi:
Synthesizing ultraviolet skin images via GAN with Gaussian weighted patch blending. 157-158 - Francesco Pistolesi, Michele Baldassini, Beatrice Lazzerini:
A smartphone app to collect emotion-labeled signals in the wild using a body sensor network. 159-160 - Cise Midoglu, Andrea M. Storås, Saeed Shafiee Sabet, Malek Hammou, Steven Alexander Hicks, Inga Strümke, Michael Alexander Riegler, Carsten Griwodz, Pål Halvorsen:
Experiences and Lessons Learned from a Crowdsourced-Remote Hybrid User Survey Framework. 161-162 - Thanh Tran, Sebastian Bader, Jan Lundgren:
An artificial neural network-based system for detecting machine failures using a tiny sound dataset: A case study. 163-168 - Pooja Guhan, Saayan Mitra, Somdeb Sarkhel, Stefano Petrangeli, Ritwik Sinha, Viswanathan Swaminathan, Aniket Bera, Dinesh Manocha:
Contextualized Styling of Images for Web Interfaces using Reinforcement Learning. 169-172 - Burak Kara, Mehmet N. Akcay, Ali C. Begen, Saba Ahsan, Igor D. D. Curcio, Kashyap Kammachi Sreedhar, Emre Aksu:
Benchmarking the Second Edition of the Omnidirectional Media Format Standard. 173-176 - Kashyap Kammachi Sreedhar, Miska M. Hannuksela, Emre B. Aksu, Lauri Ilola, Lukasz Condrad:
Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming. 177-180 - Xi Guo, Imran Mogra:
Using Web 3D and WebXR Game to Enhance Engagement in Primary School Learning. 181-184 - Michael Lilley, Kapotaksha Das, Kais Riani, Mohamed Abouelenien:
A Topological Approach for Facial Region Segmentation in Thermal Images. 189-193 - Syed Zohaib Hassan, Saeed Shafiee Sabet, Pegah Salehi, Hayley Ko, Ingvild Riiser, Miriam S. Johnson, Gunn Astrid Baugerud, Michael Alexander Riegler, Pål Halvorsen:
A Comparative Study of Interactive Environments for Investigative Interview of A Virtual Child Avatar. 194-201 - Jakub Peschel, Michal Batko, Pavel Zezula:
On Selection of Efficient Sequential Pattern Mining Algorithm Based on Characteristics of Data. 202-205 - Sophia Neamoniti, Vlasios Kasapakis:
Hand Tracking vs Motion Controllers: The effects on Immersive Virtual Reality Game Experience. 206-207 - Georgios-Fotios Angelis, Armando Domi, Alexandros Zamichos, Maria Tsourma, Anastasios Drosou, Dimitrios Tzovaras:
On The Exploration of Vision Transformers in Remote Sensing Building Extraction. 208-215 - Olcay Kursun, Semih Dinç, Oleg V. Favorov:
Contextually Guided Convolutional Neural Networks for Learning Most Transferable Representations. 210-213 - Na Wang, Haoliang Wang, Stefano Petrangeli, Viswanathan Swaminathan, Fei Li, Songqing Chen:
Towards Accurate Positioning in Multiuser Augmented Reality on Mobile Devices. 214-217 - Diogo Gonçalves Silva, Pedro Alexandre Simões dos Santos, João Dias:
Emotionally Expressive Motion Controller for Virtual Character Locomotion Animations. 218-219 - Xi Qi, Lihua Tian, Chen Li, Hui Song, Jiahui Yan:
Singing Melody Extraction Based on Combined Frequency-Temporal Attention and Attentional Feature Fusion with Self-Attention. 220-227 - Zeyu Xiong, Pei-Chun Lin, Amin Farjudian:
Retaining Semantics in Image to Music Conversion. 228-235 - Gurunath Reddy M., Zhe Zhang, Yi Yu, Florian Harscoët, Simon Canales, Suhua Tang:
Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics. 236-239 - Rahul Jaiswal:
Performance Analysis of Deep Learning Based Speech Quality Model with Mixture of Features. 240-244 - Xiao Fu, Xin Yuan, Jinglu Hu:
HSD: A hierarchical singing annotation dataset. 245-246 - Tristin Cory, Razib Iqbal:
Comparison of Multi-Scale Speaker Vectors and S-Vectors for Zero-Shot Speech Synthesis. 247-248 - Georg Wimmer, Rudolf Schraml, Andreas Uhl, Alexander Petutschnigg:
Roundwood Tracking from the Forest to the Sawmill using filter approaches to highlight the annual ring pattern. 249-256 - Weixin Jiang, Gang Wu, Viswanathan Swaminathan, Stefano Petrangeli, Haoliang Wang, Ryan A. Rossi, Nedim Lipka:
Task-Oriented Near-Lossless Burst Compression. 257-260 - Semih Dinç, Randy Russell, Luis Alberto Cueva Parra:
Cloud Region Segmentation from All Sky Images using Double K-Means Clustering. 261-262 - Jacob D. Hauenstein, Timothy S. Newman:
Toward Energy Efficient Curvature in Range Images. 263-264 - Khoa Pho, Han Lam, Tung Le, Huy Tien Nguyen, Atsuo Yoshitaka:
Attention-driven RetinaNet for Parasitic Egg Detection. 265-272 - Yi-Jie Chen, Yen-Chiao Wang, Bo-Hao Chen, Hsiang-Yin Cheng, Jia-Li Yin:
Actor-Critic Bilateral Filter for Noise-Robust Image Smoothing. 273-277 - Komei Hiruta, Ryusuke Saito, Taro Hatakeyama, Atsushi Hashimoto, Satoshi Kurihara:
Conditional GAN for Small Datasets. 278-281 - Shima Mohammadi, João Ascenso:
Evaluation of Sampling Algorithms for a Pairwise Subjective Assessment Methodology. 288-292 - Na Li, Yao Liu:
FFmpegSR: A General Framework Toward Real-Time 4K Super-Resolution. 293-296
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.