default search action
23rd MMSP 2021: Tampere, Finland
- 23rd International Workshop on Multimedia Signal Processing, MMSP 2021, Tampere, Finland, October 6-8, 2021. IEEE 2021, ISBN 978-1-6654-3288-7
- Jens Brandenburg, Adam Wieckowski, Anastasia Henkel, Benjamin Bross, Detlev Marpe:
Pareto-optimized coding configurations for VVenC, a fast and efficient VVC encoder. 1-6 - Shahab Pasha, Arman Arian, Jan Lundgren:
Machine-learnt Beamforming for Large Aperture 3D Microphone Arrays, An Industrial Application. 1-6 - Alper Koz, Baris Demirkiliç, Yunus Bilge Kurt, Ahmet Oguz Akyüz, Sinan Kalkan, A. Aydin Alatan, Alan Chalmers:
HDR Image Construction from Trifocal Multiexposure Images. 1-5 - Xiaoya Zhang, Yuanzhi Yao, Nenghai Yu:
Convolutional Neural Network-driven Optimal Prediction for Image Reversible Data Hiding. 1-6 - Viktoria Heimann, Andreas Spruck, André Kaup:
Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds. 1-6 - Michael Buron Yuen, Carlos Vázquez:
Human Subject Distance Estimation Using the Pupillary Distance and Head Orientation. 1-6 - Wentao Yu, Steffen Zeiler, Dorothea Kolossa:
Large-vocabulary Audio-visual Speech Recognition in Noisy Environments. 1-6 - Negar Heidari, Alexandros Iosifidis:
Progressive Spatio-Temporal Bilinear Network with Monte Carlo Dropout for Landmark-based Facial Expression Recognition with Uncertainty Estimation. 1-6 - Lohic Fotio Tiotsop, Tomas Mizdos, Enrico Masala, Marcus Barkowsky, Peter Pocta:
How to Train No Reference Video Quality Measures for New Coding Standards using Existing Annotated Datasets? 1-6 - Ho Tan Nguyen, Chi Do-Kim Pham, Jinjia Zhou:
SpeedDeblur: A Framework to speed up CNN-based Deblurring for HEVC compressed video. 1-6 - Waqas Ellahi, Toinon Vigier, Patrick Le Callet:
A machine-learning framework to predict TMO preference based on image and visual attention features. 1-6 - Maxim Verwilst, Nina Zizakic, Lingchen Gu, Aleksandra Pizurica:
Deep image hashing based on twin-bottleneck hashing with variational autoencoders. 1-6 - María Santamaría, Vinod Kumar Malamal Vadakital, Lukasz Kondrad, Antti Hallapuro, Miska M. Hannuksela:
Coding of volumetric content with MIV using VVC subpictures. 1-6 - Hannes Fassold:
Detecting speaking persons in video. 1 - Anubhav Jain, Pavel Korshunov, Sébastien Marcel:
Improving Generalization of Deepfake Detection by Training for Attribution. 1-6 - Yingqi Tang, Xiang Zhang, Donghang Chen, Zhizhuo Zhang, Haifei Yu:
Motion-augmented Change Detection for Video Surveillance. 1-6 - Anthony Trioux, Giuseppe Valenzise, Marco Cagnazzo, Michel Kieffer, François-Xavier Coudoux, Patrick Corlay, Mohamed Gharbi:
A Perceptual Study of the Decoding Process of the SoftCast Wireless Video Broadcast Scheme. 1-6 - Simoni Panayi, Alessandro Artusi:
Hazing or Dehazing: the big dilemma for object detection. 1-9 - Bohan Li, Lauren Partin, Jingning Han, Yaowu Xu:
A Temporal Filtering Approach Based on Optical Flow Estimation for Video Coding. 1-6 - Gerasimos Arvanitis, Aris S. Lalos, Konstantinos Moustakas:
Fast Spatio-temporal Compression of Dynamic 3D Meshes. 1-6 - Shangyin Gao, Lev Markhasin, Bi Wang:
Spatial Cross-Attention RGB-D Fusion Module for Object Detection. 1-6 - Kelvin Chelli, Roopak R. Tamboli, Thorsten Herfet:
Deep Learning-based Semantic Analysis of Sparse Light Field Ray Sets. 1-6 - Mehryar Abbasi, Parvaneh Saeedi, Jason Au, Jon Havelock:
Timed Data Incrementation: A Data Regularization Method for IVF Implantation Outcome Prediction from Length Variant Time-lapse Image Sequences. 1-5 - Antonio Jesús Muñoz-Montoro, Julio J. Carabias-Orti, Pedro Vera-Candeas:
Ambisonics domain Singing Voice Separation combining Deep Neural Network and Direction Aware Multichannel NMF. 1-6 - Sardar Basiri, Kaiwen Zhang, Stéphane Coulombe:
An Action-Aware Combat Model for Efficient Video Compression of Massively Multiplayer Online Role-playing Games on Cloud Gaming Platforms. 1-6 - Toby Godwin, Georgios Rizos, Alice Baird, Najla D. Al Futaisi, Vincent Brisse, Björn W. Schuller:
Evaluating Deep Music Generation Methods Using Data Augmentation. 1-6 - Deeraj Nagothu, Ronghua Xu, Yu Chen, Erik Blasch, Alexander J. Aved:
DeFake: Decentralized ENF-Consensus Based DeepFake Detection in Video Conferencing. 1-6 - Yuki Sugimoto, Shoko Imaizumi:
A Lossless Image Processing Method with Contrast and Saturation Enhancement. 1-6 - Marc Górriz Blanch, Issa Khalifeh, Noel E. O'Connor, Marta Mrak:
Attention-based Stylisation for Exemplar Image Colourisation. 1-6 - Mateusz Guzik, Mieszko Fras, Konrad Kowalczyk:
Incorporation of Localization Information for Sound Source Separation in Spherical Harmonic Domain. 1-6 - Ilyass Abouelaziz, Aladine Chetouani, Mohammed El Hassouni, Hocine Cherifi:
No-Reference Mesh Visual Quality Assessment Using Graph-Based Deep Learning. 1-6 - Haruhisa Kato, Tatsuya Kobayashi, Masaru Sugano, Sei Naito:
Split Rendering of the Transparent Channel for Cloud AR. 1-6 - Wang Peng, Liping Yang, Xiaohua Gu:
Convolutional Receptive Field Dual Selection Mechanism for Acoustic Scene Classification. 1-6 - Vignesh V. Menon, Hadi Amirpour, Christian Timmerer, Mohammed Ghanbari:
INCEPT: Intra CU Depth Prediction for HEVC. 1-6 - Ismael Seidel, Vanio Rodrigues Filho, Mateus Grellert, Luciano Volcan Agostini, José Luís Güntzel:
SAD or SATD? How the Distortion Metric Impacts a Fractional Motion Estimation VLSI Architecture. 1-6 - Julitta Bartolewska, Konrad Kowalczyk:
Frame-based Maximum a Posteriori Estimation of Second-Order Statistics for Multichannel Speech Enhancement in Presence of Noise. 1-6 - Bilal Hassan, Ebroul Izquierdo:
ApparelNet: Person Verification Encompassing Auxiliary Attachments Variation. 1-6 - Yuzhuo Ren, Braeden Syrnyk, Niranjan Avadhanam:
Dual Attention Network for Heart Rate and Respiratory Rate Estimation. 1-6 - Andrey Makrushin, Mark Trebeljahr, Stefan Seidlitz, Jana Dittmann:
On feasibility of GAN-based fingerprint morphing. 1-6 - Erion-Vasilis M. Pikoulis, Christos Mavrokefalidis, Aris S. Lalos:
A data-aware dictionary-learning based technique for the acceleration of deep convolutional networks. 1-5 - Conggui Liu, Yoshinao Sato:
Enhancing Block-Online Speech Separation using Interblock Context Flow. 1-6 - Federico Simonetta, Stavros Ntalampiras, Federico Avanzini:
Audio-to-Score Alignment Using Deep Automatic Music Transcription. 1-6 - Srividya Tirunellai Rajamani, Kumar T. Rajamani, Björn W. Schuller:
Towards an Efficient Deep Learning Model for Emotion and Theme Recognition in Music. 1-5 - Mikko Parviainen, Pasi Pertilä:
Time Difference of Arrival Estimation of Multiple Simultaneous Speakers Using Deep Clustering Neural Networks. 1-6 - Da-Yoon Nam, Hae-Kwang Kim, Jong-Ki Han:
Efficient View Synthesis Algorithm Using View Selection for Generating 6DoF Images. 1-6 - Davi Lazzarotto, Evangelos Alexiou, Touradj Ebrahimi:
Benchmarking of objective quality metrics for point cloud compression. 1-6 - Simon Grosche, Fabian Brand, André Kaup:
A Novel End-To-End Network for Reconstruction of Non-Regularly Sampled Image Data Using Locally Fully Connected Layers. 1-6 - Farid Alijani, Esa Rahtu:
Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition. 1-6 - Ugur Alican Alma, Pablo Alvarez Romeo, Mehmet Ercan Altinsoy:
Preliminary Study of Upper-Body Haptic Feedback Perception on Cinematic Experience. 1-6 - Dorsaf Sebai:
Multi-rate deep semantic image compression with quantized modulated autoencoder. 1-6 - David Heise, Helen L. Bear:
Visually Exploring Multi-Purpose Audio Data. 1-6 - Steve Göring, Alexander Raake:
Rule of Thirds and Simplicity for Image Aesthetics using Deep Neural Networks. 1-6 - Theyab A. Alotaibi, Farid Bourennani, Ishtiaq Rasool Khan:
Assessing the Performance of Image Quality Assessment Metrics. 1-6 - Sarah Fachada, Armand Losfeld, Takanori Senoh, Gauthier Lafruit, Mehrdad Teratani:
A Calibration Method for Subaperture Views of Plenoptic 2.0 Camera Arrays. 1-6 - Zeman Shao, Shaobo Fang, Runyu Mao, Jiangpeng He, Janine L. Wright, Deborah A. Kerr, Carol J. Boushey, Fengqing Zhu:
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation. 1-6 - Steve Göring, Rakesh Rao Ramachandra Rao, Stephan Fremerey, Alexander Raake:
AVrate Voyager: an open source online testing platform. 1-6 - Madhukar Bhat, Jean-Marc Thiesse, Patrick Le Callet:
VVC partitioning decision driven by machine learning for a comprehensive hardware encoder. 1-6 - Aladine Chetouani, Maurice Quach, Giuseppe Valenzise, Frédéric Dufaux:
Convolutional Neural Network for 3D Point Cloud Quality Assessment with Reference. 1-6 - Minghong Mo, Fan Liang, Jun Wang:
An Optimization Algorithm for Color Table Generation of Palette Mode for VVC. 1-5 - Çaglar Aytekin, Sakari Alenius, Dmytro Paliy, Juuso Gren:
A Sub-band Approach to Deep Denoising Wavelet Networks and a Frequency-adaptive Loss for Perceptual Quality. 1-6 - Alireza Zare, Alireza Aminlou, Miska M. Hannuksela:
VVC Adaptive Loop Filter Optimization for Subpicture-based Viewport-adaptive Streaming. 1-6 - Shoken Kaneko, Hannes Gamper:
A Fast Forest Reverberator Using Single Scattering Cylinders. 1-5 - Hans-Jürgen Zepernick, Kerstin Pieper, Robert P. Spang, Ulrich Engelke, Matthias Hirth, Babak Naderi:
On the Impact of COVID-19 on Subjective Digital Media Quality Assessment. 1-6 - Runyu Mao, Jiangpeng He, Luotao Lin, Zeman Shao, Heather A. Eicher-Miller, Fengqing Zhu:
Improving Dietary Assessment Via Integrated Hierarchy Food Classification. 1-6 - Jan Willem Kleinrouweler, Toni Dimitrovski, Sjors Braam, Rick Hindriks, Hans van den Berg, Lucia D'Acunto, Omar Niamut:
Dynamic Edge Offloading for Real-time Video Processing Pipelines. 1 - Ashish Alex, Lin Wang, Paolo Gastaldo, Andrea Cavallaro:
Mixup Augmentation for Generalizable Speech Separation. 1-6 - Vanio Rodrigues Filho, Marcio Monteiro, Ismael Seidel, Mateus Grellert, José Luís Güntzel:
Hardware-Friendly Search Patterns for the Versatile Video Coding Fractional Motion Estimation. 1-6 - Andreas Papandreou, Andreas Kloukiniotis, Aris S. Lalos, Konstantinos Moustakas:
Deep multi-modal data analysis and fusion for robust scene understanding in CAVs. 1-6 - Minh Nguyen, Ekrem Çetinkaya, Hermann Hellwagner, Christian Timmerer:
WISH: User-centric Bitrate Adaptation for HTTP Adaptive Streaming on Mobile Devices. 1-6 - Nesryne Mejri, Konstantinos Papadopoulos, Djamila Aouada:
Leveraging High-Frequency Components for Deepfake Detection. 1-6 - Weiyan Chen, Changjian Zhu, Shan Zhang:
Piecewise Segmentation Occlusion Model for Image-Based Plenoptic Spectral Analysis. 1-6 - Farid Alijani, Esa Rahtu:
Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition. 1 - Hyunse Yoon, Seongmin Lee, Jiwoo Kang, Sanghoon Lee:
Deep Chessboard Corner Detection Using Multi-task Learning. 1-6 - Nayna Jain, Karthik Nandakumar, Nalini K. Ratha, Sharath Pankanti, Uttam Kumar:
Optimizing Homomorphic Encryption based Secure Image Analytics. 1-6 - Mert Seker, Anssi Männistö, Alexandros Iosifidis, Jenni Raitoharju:
Automatic Main Character Recognition for Photographic Studies. 1-6 - Xhenis Çoba, Fangchen Feng, Azeddine Beghdadi:
Blind image separation for document restoration using plug-and-play approach. 1-6 - Jian Cao, Yifan Jia, Fan Liang, Jun Wang:
Encounter CU Again: History-Based Complexity Reduction Strategy for VVC Intra-Frame Encoder. 1-6 - Chaofei Wang, Wenjie Zhu, Yingzhan Xu, Yiling Xu, Le Yang:
Point-Voting based Point Cloud Geometry Compression. 1-5 - Yana Nehmé, Patrick Le Callet, Florent Dupont, Jean-Philippe Farrugia, Guillaume Lavoué:
Exploring Crowdsourcing for Subjective Quality Assessment of 3D Graphics. 1-6 - Milan Stepanov, M. Umair Mukati, Giuseppe Valenzise, Søren Forchhammer, Frédéric Dufaux:
Learning-based lossless light field compression. 1-6 - Paritosh Parmar, Jaiden Reddy, Brendan Morris:
Piano Skills Assessment. 1-5 - Sotirios Papadopoulos, Charalampos Symeonidis, Ioannis Pitas:
Leader and breakaway detection in racing sports videos. 1-5 - Evgeny Belyaev:
Fast Decoding and Parameters Selection for CS-JPEG Video Codec. 1-6 - Yufei Zeng, Yanxiong Li, Zhenfeng Zhou, Ruiqi Wang, Difeng Lu:
Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network. 1-5 - Fang-Yi Chao, Cagri Ozcinar, Aljosa Smolic:
Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need. 1-6 - Stephen Voran:
Optimal Frame Duration for Oracle Audio Signal Separation is Determined by Joint Minimization of Two Antagonistic Artifacts. 1-6 - Christoph Gerhardt, Florian Weidner, Wolfgang Broll:
OUTSIDE: Multi-Scale Semantic Segmentation of Universal Outdoor Scenes. 1-6 - Frank Sippel, Jürgen Seiler, André Kaup:
Hyperspectral Image Reconstruction from Multispectral Images Using Non-Local Filtering. 1-6 - Emre Can Kaya, Ioan Tabus:
Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds. 1-6 - Arnaud Soulier, Pauline Puteaux, Frédéric Comby, William Puech:
Lossless Satellite Data Compression for Real-Time Navigation of Autonomous Vehicles. 1-6 - Yoichi Matsuo, Kazuhisa Yamagishi, Shoko Takahashi:
Shapley-value-based Quality Degradation Analysis Method for Adaptive Bitrate Streaming Services. 1-6 - Bishwo Adhikari, Xingyang Ni, Esa Rahtu, Heikki Huttunen:
Towards a Real-Time Facial Analysis System. 1-6 - Teck Kai Chan, Cheng Siong Chin:
Detecting Sound Events Using Convolutional Macaron Net With Pseudo Strong Labels. 1-6 - Alireza Javaheri, Catarina Brites, Fernando Pereira, João Ascenso:
A Point-to-Distribution Joint Geometry and Color Metric for Point Cloud Quality Assessment. 1-6 - Olfa Haggui, Hamza Bayd, Baptiste Magnier, Arezki Aberkane:
Human Detection in Moving Fisheye Camera using an Improved YOLOv3 Framework. 1-6 - Randy Frans Fela, Nick Zacharov, Søren Forchhammer:
Perceptual Evaluation of 360 Audiovisual Quality and Machine Learning Predictions. 1-6 - Davide Berghi, Adrian Hilton, Philip J. B. Jackson:
Visually Supervised Speaker Detection and Localization via Microphone Array. 1-6 - Haoyu Chen, Edward J. Delp, Amy R. Reibman:
Estimating Image Quality for Person Re-Identification. 1-6 - Rita Fermanian, Mikael Le Pendu, Christine Guillemot:
Regularizing the Deep Image Prior with a Learned Denoiser for Linear Inverse Problems. 1-6 - Ailbhe Gill, Mikael Le Pendu, Martin Alain, Emin Zerman, Aljosa Smolic:
Light Field Visual Attention Prediction Using Fourier Disparity Layers. 1-6 - Pramit Mazumdar, Giuliano Arru, Marco Carli, Federica Battisti:
Analysis of the influence of human faces for the estimation of salience in omnidirectional images. 1-5 - Anastasios Vafeiadis, Ioannis Papadimitriou, Anastasis Papanagnou, Dimitrios Giakoumis, Konstantinos Votis, Dimitrios Tzovaras:
Evaluating Spectral Magnitude Representation and Spectral Energy for Audio-based Activity Detection. 1-6 - Zhenyu Lei, Yejing Xie, Suiyi Ling, Andreas Pastor, Junle Wang, Junyu Dong, Patrick Le Callet:
Multi-Modal Aesthetic Assessment for Mobile Gaming Image. 1-5 - Abhishek Goswami, Ali Ak, Wolf Hauser, Patrick Le Callet, Frédéric Dufaux:
Reliability of Crowdsourcing for Subjective Quality Evaluation of Tone Mapping Operators. 1-6 - Joakim Edlund, Christine Guillemot, Mårten Sjöström:
Analysis of Top-Down Connections in Multi-Layered Convolutional Sparse Coding. 1-6 - Stuart W. Perry, Luís Alberto da Silva Cruz, Emil Dumic, Nhung Hong Thi Nguyen, António M. G. Pinheiro, Evangelos Alexiou:
Comparison of Remote Subjective Assessment Strategies in the Context of the JPEG Pleno Point Cloud Activity. 1-6 - Meenakshi Meenakshi, Seshan Srirangarajan:
Low-Rank Double Relaxed Regression for Discriminative Projection Learning. 1-6 - Hui Yuan, Raouf Hamzaoui, Ferrante Neri, Shengxiang Yang, Tingting Wang:
Global Rate-distortion Optimization of Video-based Point Cloud Compression with Differential Evolution. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.