Abstract
We present SocialSense , a collaborative smartphone based speaker and mood identification and reporting system that uses a user’s voice to detect and log his/her speaking and mood episodes. SocialSense collaboratively works with other phones that are running the app present in the vicinity to periodically send/receive speaking and mood vectors to/from other users present in a social interaction setting, thus keeping track of the global speaking episodes of all users with their mood. In addition, it utilizes a novel event-adaptive dynamic classification scheme for speaker identification which updates the speaker classification model every time one or more users enter or leave the scenario, ensuring a most updated classifier based on user presence. Evaluation of using dynamic classifiers shows that SocialSense improves speaker identification accuracy by 30% compared to traditional static speaker identification systems, and a 10% to 43% performance boost under various noisy environments. SocialSense also improves the mood classification accuracy by 4% to 20% compared to the baseline approaches. Energy consumption experiments show that its device daily lifetime is between 10-14 hours.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Neumann, R., Strack, F.: Mood Contagion: The automatic transfer of mood between persons. Journal of Personality and Social Psychology 79(2), 211–223 (2000)
Reynolds, D.A.: Speaker identification and verification using gaussian mixture speaker models. Speech Communication 17(1-2), 91–108 (1995)
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using gaussian mixture speaker models. Transactions on Speech and Audio Processing 3(1), 72–83 (1995)
Luo, C., Chan, M.C.: SocialWeaver: collaborative inference of human conversation networks using smartphones. In: 11th ACM Conference on Embedded Networked Sensor Systems (SenSys), Roma, Italy (2013)
Nakakura, T., Sumi, Y., Nishida, T.: Neary: conversation field detection based on similarity of auditory situation. In: 10th Workshop on Mobile Computing Systems and Applications (HotMobile), Santa Cruz, California, USA (2009)
Lu, H., Brush, B., Priyantha, B., Karlson, A.K., Liu, J.: SpeakerSense: Energy efficient unobtrusive speaker identification on mobile phones. In: IEEE Pervasive Computing and Communication (PerCom), Seattle, Washington, USA (2011)
Rachuri, K., Musolesi, M., Mascolo, C., Rentfrow, P.J., Longworth, C., Aucinas, A.: EmotionSense: A mobile phone based adaptive platform for experimental social psychology research. In: ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp), Copenhagen, Denmark (2010)
Yu, C., Aoki, P.M., Woodruff, A.: Detecting user engagement in everyday conversations. In: 8th International Conference on Spoken Language Processing, South Korea (2004)
Casale, P., Pujol, O., Radeva, P.: Face-to-face social activity detection using data collected with a wearable device. In: 4th Iberian Conference on Pattern Recognition, Portugal (2009)
Miluzzo, E., Lane, N.D., Fodor, K., Peterson, R., Lu, H., Musolesi, M., Eisenman, S.B., Zheng, X., Campbell, A.T.: Sensing meets mobile social networks: the design, implementation and evaluation of the CenceMe application. In: 6th ACM Conference on Embedded Networked Sensor Systems (SenSys), Raleigh, North Carolina, USA (2008)
Choudhury, T.: Sensing and modeling human networks. Ph.D. Thesis, Program in Media Arts and Sciences, Massachusetts Institute of Technology (2004)
Chen, D., Yang, J., Malkin, R., Wactlar, H.D.: Detecting social interactions of the elderly in a nursing home environment. ACM Transactions on Multimedia Computing, Communications and Applications 3(1), 1–22 (2007)
Li, Q., Chen, S., Stankovic, J.A.: Multi-modal in-person interaction monitoring using smartphone and on-body sensors. In: IEEE International Conference on Body Sensor Networks, Cambridge, MA, USA (2013)
Audacity, http://audacity.sourceforge.net/
Kim, J., Lee, S., Narayan, S.S.: An exploratory study of manifolds of emotional speech. In: Acoustics Speech and Signal Processing, Dallas, TX, USA (2010)
Stefanacci, R.G.: How big an issue is depression in assisted living? Assisted Living Consult 4(4), 30–35 (2008)
Miluzzo, E., Cornelius, C.T., Ramaswamy, A., Choudhury, T., Liu, Z., Campbell, A.T.: Darwin phones: the evolution of sensing and inference on mobile phones. In: 8th International Conference on Mobile Systems, Applications, and Services (MobiSys), San Francisco, California, USA (2010)
Xu, C., et al.: Crowd++: unsupervised speaker count with smartphones. In: ACM International Joint Conference on Pervasive and Ubiquitous Computing, Zurich, Switzerland (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Ahmed, M.Y., Kenkeremath, S., Stankovic, J. (2015). SocialSense: A Collaborative Mobile Platform for Speaker and Mood Identification. In: Abdelzaher, T., Pereira, N., Tovar, E. (eds) Wireless Sensor Networks. EWSN 2015. Lecture Notes in Computer Science, vol 8965. Springer, Cham. https://doi.org/10.1007/978-3-319-15582-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-15582-1_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15581-4
Online ISBN: 978-3-319-15582-1
eBook Packages: Computer ScienceComputer Science (R0)