default search action
MMSP 2007: Chania, Crete, Greece
- IEEE 9th Workshop on Multimedia Signal Processing, MMSP 2007, Chania, Crete, Greece, October 1-3, 2007. IEEE 2007, ISBN 978-1-4244-1274-7
Plenary Talks
- Alan C. Bovik:
New Directions in Image and Video Quality Assessment Plenary Talk. 1 - Dimitris N. Metaxas:
Facial Features Tracking for Gross Head Movement analysis and Expression Recognition. 2
Invited Talks
- Patti Price:
Multimedia Technologies and Solutions for Educational Applications: Opportunities, Trends and Challenges. 3-8 - Xavier Serra:
State of the Art and Future Directions in Musical Sound Synthesis. 9-12 - Eric J. Pauwels, Albert Ali Salah, Romain Tavenard:
Sensor Networks for Ambient Intelligence. 13-16 - Touradj Ebrahimi:
Recent advances in brain-computer interfaces. 17
Multimedia Technologies for Children
- Nirit Bauminger, Dina Goren-Bar, Eynat Gal, Patrice L. Weiss, Judi Kupersmitt, Fabio Pianesi, Oliviero Stock, Massimo Zancanaro:
Enhancing Social Communication in High-Functioning Children with Autism through a Co-Located Interface. 18-21 - Alexandros Potamianos, Shrikanth S. Narayanan:
A review of the acoustic and linguistic properties of children's speech. 22-25 - Abeer Alwan, Yijian Bai, Matthew Black, Larry Casey, Matteo Gerosa, Margaret Heritage, Markus Iseli, Barbara Jones, Abe Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan, Patti Price, Joseph Tepperman, Shizhen Wang:
A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources. 26-30
Speech & Interfaces to Multimedia
- Te Li, Susanto Rahardja, Soo Ngee Koh:
Perceptual Enhancement for Fully Scalable Audio. 31-34 - Hiroaki Kokubo, Nobuo Hataoka, Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Real-Time Continuous Speech Recognition System on SH-4A Microprocessor. 35-38 - Zdenek Becvar, Lukas Novak, Jan Zelenka, Miloslav Brada, Pavel Slepicka:
Impact of Additional Noise on Subjective and Objective Quality Assessement in VoIP. 39-42 - Carlos Busso, Shrikanth S. Narayanan:
Joint Analysis of the Emotional Fingerprint in the Face and Speech: A single subject study. 43-47 - Samuel Kim, Panayiotis G. Georgiou, Sungbok Lee, Shrikanth S. Narayanan:
Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features. 48-51 - Visar Berisha, Andreas Spanias:
Dual-Mode Wideband Speech Compression. 52-55 - Nobutaka Ono, Souichiro Fukamachi, Takuya Nishimoto, Shigeki Sagayama:
Sound Source Localization by Asymmetrically Arrayed 2ch Microphones on a Sphere. 56-59 - Viktor Rozgic, Carlos Busso, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter. 60-65 - Vassiliki Moschou, Margarita Kotti, Emmanouil Benetos, Constantine Kotropoulos:
Systematic comparison of BIC-based speaker segmentation systems. 66-69 - Oleksiy J. Koval, Sviatoslav Voloshynovskiy, Thierry Pun:
Analysis of multimodal binary detection systems based on dependent/independent modalities. 70-73 - Manjinder Singh Benning, Ajay Kapur, Bernie C. Till, George Tzanetakis:
Multimodal Sensor Analysis of Sitar Performance: Where is the Beat? 74-77 - Sanni Siltanen, Mika Hakkarainen, Otto Korkalo, Tapio Salonen, Juha Sääski, Charles Woodward, Theofanis Kannetis, Manolis Perakakis, Alexandros Potamianos:
Multimodal User Interface for Augmented Assembly. 78-81 - Ichiro Yuyama, Shigeki Takiura, Yasumasa Numata, Hiroshi Hasegawa, Yu Watanabe:
Usability Evaluation of Finger Pointer for Home-Use Display. 82-85 - Li Li, Quanzhi Li, Wu Chou, Feng Liu:
R-Flow: An Extensible XML Based Multimodal Dialog System Architecture. 86-89
Speech & Audio
- Theodoros Giannakopoulos, Aggelos Pikrakis, Sergios Theodoridis:
A Multi-Class Audio Classification Method With Respect To Violent Content In Movies Using Bayesian Networks. 90-93 - Amir Said, Ton Kalker, Ronald W. Schafer:
Phase-Domain Statistical Analysis for Audio Source Localization. 94-97 - Shiva Sundaram, Shrikanth S. Narayanan:
Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate. 98-102
Multimedia Communication
- Ngai-Man Cheung, Antonio Ortega:
Flexible Video Decoding: A Distributed Source Coding Approach. 103-106 - Alexandros G. Dimakis, Jiajun Wang, Kannan Ramchandran:
Unequal Growth Codes: Intermediate Performance and Unequal Error Protection for Video Streaming. 107-110 - Marco Grangetto, Enrico Magli, Gabriella Olmo:
Symmetric Distributed Arithmetic Coding of Correlated Sources. 111-114
Multimedia Communication
- Yuan Lin, Anna N. Kim, Eren Gurses, Andrew Perkis:
Rate-Distortion Optimized I-Slice Selection for Low Delay Video Transmission. 115-118 - Szu-Wei Lee, C.-C. Jay Kuo:
Motion Compensation Complexity Model for Decoder-Friendly H.264 System Design. 119-122 - Maryse R. Stoufs, Adrian Munteanu, Jan Cornelis, Peter Schelkens:
Joint Source-Channel Coding for the Scalable Extension of H.264/MPEG-4 AVC. 123-126 - Juntao Ouyang, Lifeng Sun, Yuzhuo Zhong, Shiqiang Yang:
Power-Rate-Distortion Optimization for Multi-Source Video Streaming under Energy Constraints over Ad Hoc Networks. 127-130 - Sung Ho Jin, Cheon Seog Kim, Dong Jun Seo, Yong Man Ro:
Quality Measurement Modeling on Scalable Video Applications. 131-134 - Batu Sat, Benjamin W. Wah:
Evaluation of Conversational Voice Communication Quality of the Skype, Google-Talk, Windows Live, and Yahoo Messenger Voip Systems. 135-138 - Alexander Eichhorn:
Efficient Dependency Tracking in Packetised Media Streams. 139-142 - Xiaoyu Cheng, Lifeng Sun, Shiqiang Yang:
A multi-view video coding approach using Layered Depth Image. 143-146 - Wei-Chung Wen, Hsu-Feng Hsiao, Jen-Yu Yu:
Dynamic FEC-Distortion Optimization for H.264 Scalable Video Streaming. 147-150 - Antonios Argyriou:
Cross-Layer Adaptive ARQ for Uplink Video Streaming in Tandem Wireless/Wireline Networks. 151-154 - Christophe Beaugeant:
Smart Transcoding between CELP Speech Codecs through Voiced Oriented Pitch Mapping. 155-158 - Savvas Argyropoulos, Nikolaos Thomos, Nikolaos V. Boulgouris, Michael G. Strintzis:
Adaptive Frame Interpolation for Wyner-Ziv Video Coding. 159-162 - Dmytro Rusanovskyy, Moncef Gabbouj, Kemal Ugur:
Spatial and Temporal Adaptation of Interpolation Filter For Low Complexity Encoding/Decoding. 163-166 - Ivana Radulovic, Pascal Frossard:
Multiple description image coding with redundant expansions and optimal quantization. 167-170 - Shay Har-Noy, Òscar Divorra Escoda, Peng Yin, Cristina Gomila, Truong Q. Nguyen:
Adaptive In-Loop Prediction Refinement for Video Coding. 171-174
Image & Video I
- Adam Slater, Yu Hen Hu, Nigel Boston:
Multiscale Integral Invariants For Facial Landmark Detection in 2.5D Data. 175-178 - Thomas Drugman, Mihai Gurban, Jean-Philippe Thiran:
Relevant Feature Selection for Audio-Visual Speech Recognition. 179-182 - Denis Kubasov, Jayanth Nayak, Christine Guillemot:
Optimal Reconstruction in Wyner-Ziv Video Coding with Multiple Side Information. 183-186
Image & Video III
- Vasileios Chasanis, Aristidis Likas, Nikolas P. Galatsanos:
Scene Detection in Videos Using Shot Clustering and Symbolic Sequence Segmentation. 187-190 - Effrosini Kokiopoulou, Pascal Frossard:
Image alignment with rotation manifolds built on sparse geometric expansions. 191-194 - Marcus Barkowsky, Jens Bialkowski, Roland Bitto, André Kaup:
Temporal registration using 3D phase correlation and a maximum likelihood approach in the perceptual evaluation of video quality. 195-198 - Xiaohan Wang, Xiaolin Wu:
On Design of Linear Minimum-Entropy Predictor. 199-202 - Arie Hans Nasution, Sabu Emmanuel:
Intelligent Video Surveillance for Monitoring Elderly in Home Environments. 203-206 - Nicholas Vretos, Vassilios Solachidis, Ioannis Pitas:
A Face Tracker Trajectories Clustering Using Mutual Information. 207-210 - Huimin Chen, Henry L. Bart Jr., Shuqing Huang:
Integrated Feature Selection and Clustering from Multiple Views for a Taxonomic Problem. 211-214 - Tse-Wei Chen, Wei-Kai Chan, Shao-Yi Chien:
Efficient Face Detection with Segmentation and Feature-based Face Scoring in Surveillance Systems. 215-218 - Wei-Kai Chan, Shao-Yi Chien:
Real-Time Memory-Efficient Video Object Segmentation in Dynamic Background with Multi-Background Registration Technique. 219-222 - Jie Xu, Getian Ye, Jian Zhang:
Long-term Trajectory Extraction for Moving Vehicles. 223-226 - Daidi Zhong:
Fast Searching For The Optimal Area Of TFV Representation. 227-230 - Zhiyong Wang, Kelly Lam, Li Zhuo, David Dagan Feng:
Concept Constrained Image Region Annotation. 231-234 - Huang-Chia Shih, Chung-Lin Huang:
Semantics Interpretation of Superimposed Captions in Sports Videos. 235-238 - Shafiq ur Réhman, Li Liu, Haibo Li:
Manifold of Facial Expressions for Tactile Perception. 239-242
Image & Video II
- Jiangbo Lu, Sammy Rogmans, Gauthier Lafruit, Francky Catthoor:
High-Speed Stream-Centric Dense Stereo and View Synthesis on Graphics Hardware. 243-246 - Yang Liu, Gagan Rath, Christine Guillemot:
Improved Intra Prediction for H.264/AVC Scalable Extension. 247-250 - Denis Kubasov, Khaled Lajnef, Christine Guillemot:
A Hybrid Encoder/Decoder Rate Control for Wyner-Ziv Video Coding with a Feedback Channel. 251-254
Interfaces to Multimedia and Multimodal Interaction
- Vit Libal, Jonathan Connell, Gerasimos Potamianos, Etienne Marcheret:
An Embedded System for In-Vehicle Visual Speech Activity Detection. 255-258 - JongHo Shin, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Analyzing the Multimodal Behaviors of Users of a Speech-to-Speech Translation Device by using Concept Matching Scores. 259-263 - George Papandreou, Athanassios Katsamanis, Vassilis Pitsikalis, Petros Maragos:
Multimodal Fusion and Learning with Uncertain Features Applied to Audiovisual Speech Recognition. 264-267
Image & Video IV
- Sunday Nyamweno, Ramdas Satyan, Fabrice Labeau:
Exponential Decay of Transmission Distortion in H.264. 268-271 - Costas Panagiotakis, Ilias Grinias, Georgios Tziritas:
MINMAX Video Summarization under Equality Principle. 272-275 - Chan-Sik Park, Ju-Hee Kim:
Bus Bandwidth Aware H.264/AVC Motion Compensation Design for High Definition Video Encoding. 276-279 - Ying Li, Chitra Dorai:
Applying Image Analysis to Auto Insurance Triage: A Novel Application. 280-283 - Do-Kyoung Kwon, Yongjin Cho, C.-C. Jay Kuo:
A Simplified Rate Control Scheme for Non-Conversational H.264 Video. 284-287 - Yu-Ming Liang, Sheng-Wen Shih, Arthur Chun-Chieh Shih, Hong-Yuan Mark Liao, Cheng-Chung Lin:
A Language Modeling Approach to Atomic Human Action Recognition. 288-291 - Jung-Shiong Chang, Arthur Chun-Chieh Shih, Hong-Yuan Mark Liao, Wen-Hsien Fang:
Principal Component Analysis-based Mesh Decomposition. 292-295 - Paulo Vinicius Koerich Borges, Joceli Mayer, Ebroul Izquierdo:
Segmentation of Document Images Using Higher Order Statistics. 296-299 - Benjamin Le Guen, Stéphane Pateux, Jacques Weiss:
Motion-Geometry Compensation for Analysis-Synthesis Video Coder. 300-303 - Nicolas Tizon, Béatrice Pesquet-Popescu:
An adaptive synthesis filter bank for image decoding with fractional scalability. 304-307 - Kwang-deok Seo, Kyu-Chan Roh:
Advanced FGS Coding Scheme Based on MPEG-4 FGS Technology. 308-311 - Ismaël Daribo, Christophe Tillier, Béatrice Pesquet-Popescu:
Distance Dependent Depth Filtering in 3D Warping for 3DTV. 312-315 - Jun-Wei Hsieh, Cheng-Shuang Peng, Kuo-Chin Fan:
Grid-based Template Matching for People Counting. 316-319 - Konstantinos Rapantzikos, Georgios Evangelopoulos, Petros Maragos, Yannis Avrithis:
An Audio-Visual Saliency Model for Movie Summarization. 320-323
Multimedia Processing for Biomedical Applications
- Sokratis Makrogiannis, Jeremy Wellen, Yanjun Wu, Luke Bloy, Susanta K. Sarkar:
A Multimodal Image Registration and Fusion Methodology Applied to Drug Discovery Research. 324-327 - Mark Hasegawa-Johnson:
A Multi-Stream Approach to Audiovisual Automatic Speech Recognition. 328-331
Multimedia Indexing, Search and Security
- Regunathan Radhakrishnan, Claus Bauer:
Content-based Video Signatures based on Projections of Difference Images. 341-344 - Sviatoslav Voloshynovskiy, Oleksiy J. Koval, Fokko Beekhof, Thierry Pun:
Robust perceptual hashing as classification problem: decision-theoretic and practical considerations. 345-348 - Xinglei Zhu, Zhishou Zhang, Zhi Li, Qibin Sun:
Flexible Layered Authentication Graph for Multimedia Streaming. 349-352 - Antonis Mairgiotis, Giannis K. Chantas, Nikolaos Galatsanos, Konstantinos Blekas, Yongyi Yang:
New Detectors for Watermarks with Unknown Power Based on Student-t Image Priors. 353-356 - Soumyadip Rakshit, Donald M. Monro:
Medical Conditions: Effect on Iris Recognition. 357-360 - Jingjing Liu, Xian-Sheng Hua, Shipeng Li:
Object-Sensitive Query Analysis for Video Search. 361-364 - Danoush Hosseinzadeh, Sridhar Krishnan:
Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs. 365-368 - Jihane Bennour, Jean-Luc Dugelay:
Toward a 3D watermarking benchmark. 369-372 - Spyridon K. Kapotas, Eleni E. Varsaki, Athanassios N. Skodras:
Data Hiding in H.264 Encoded Video Sequences. 373-376 - Syed Ali Raza Jafri, Shahab Baqai:
Robust Digital Watermarking for Wavelet-based Compression. 377-380 - Yuhua Jiao, Bian Yang, Mingyu Li, Xiamu Niu:
MDCT-Based Perceptual Hashing for Compressed Audio Content Identification. 381-384 - Jun Zhang, Ingemar J. Cox, Gwenaël J. Doërr:
Steganalysis for LSB Matching in Images with High-frequency Noise. 385-388 - Yoshiaki Itoh, Akira Iwabuchi, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee:
Music Boundary Detection Using Similarity in a Music Selection. 389-392
Security and Multimedia
- Yao-Chung Lin, David P. Varodayan, Bernd Girod:
Image Authentication and Tampering Localization using Distributed Source Coding. 393-396 - Ashwin Swaminathan, Min Wu, K. J. Ray Liu:
A Component Estimation Framework for Information Forensics. 397-400 - Nitin Singhal, Young-Yoon Lee, Chang-Su Kim, Sang-Uk Lee:
Robust Image Watermarking Based on Local Zernike Moments. 401-404
Multimedia Indexing and Search
- Erdem Ünal, Panayiotis G. Georgiou, Shrikanth S. Narayanan, Elaine Chew:
Statistical Modeling and Retrieval of Polyphonic Music. 405-409 - Parvez Ahammad, Chuohao Yeo, Kannan Ramchandran, S. Shankar Sastry:
Unsupervised Discovery of Action Hierarchies in Large Collections of Activity Videos. 410-413 - Huang-Chia Shih, Chung-Lin Huang, Jenq-Neng Hwang:
Video Attention Ranking using Visual and Contextual Attention Model for Content-based Sports Videos Mining. 414-417
Multimedia Emerging Applications
- Aleksej Spenst, Thorsten Herfet:
A User-Centric QoS Management Approach for Digital Home. 418-421 - Jens-Uwe Garbas, André Kaup:
Wavelet-Based Multi-View Video Coding with Spatial Scalability. 422-425 - Wei-Yang Lin, Ming-Yang Chen, Kerry R. Widder, Yu Hen Hu, Nigel Boston:
Fusion of Multiple Facial Regions for Expression-Invariant Face Recognition. 426-429 - Francesca De Simone, Marco Carli, Alessandro Neri, Adrian Hornsby, Irek Defée:
Robust Data Hiding Technique for Video Error Concealment over DVB-H channel. 430-433 - Dimitrios Besiris, Nikolaos A. Laskaris, Fotini Fotopoulou, George Economou:
Key frame extraction in video sequences: a vantage points approach. 434-437 - Naoya Matsusue, Hiroshi Hasegawa, Ken-ichi Sato:
Suppression of Boundary Effect and Introduction of Scale Correlation for Wavelet based Traffic Prediction. 438-440 - Yi Pang, Lifeng Sun, Songliu Guo, Shiqiang Yang:
Spatial and Temporal Data Parallelization of Multi-view Video Encoding Algorithm. 441-444 - Athanassios Zagouras, George Economou, Andrew Macedonas, Spiros Fotopoulos:
An application study of manifold learning-ranking techniques in face recognition. 445-448 - Carman K. M. Yuk, Oscar C. Au, Richard Y. M. Li, Sui-Yuk Lam:
Soft-Decision Color Demosaicking with Direction Vector Selection. 449-452 - José Diogo Areia, João Ascenso, Catarina Brites, Fernando Pereira:
Wyner-Ziv Stereo Video Coding using a Side Information Fusion Approach. 453-456 - Athanassios Katsamanis, George Papandreou, Petros Maragos:
Audiovisual-to-Articulatory Speech Inversion Using HMMs. 457-460 - Mohammad Hosein Sedaaghi, Constantine Kotropoulos, Dimitrios Ververidis:
Using Adaptive Genetic Algorithms to Improve Speech Emotion Recognition. 461-464 - Hui-Yu Huang, Weir-Sheng Shih, Wen-Hsing Hsu:
A Film Classifier Based on Low-level Visual Features. 465-468
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.