[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2063576.2064027acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
demonstration

PICASSO: automated soundtrack suggestion for multi-modal data

Published: 24 October 2011 Publication History

Abstract

We demonstrate PICASSO, a novel approach to soundtrack recommendation. Given text, video, or image documents, PICASSO selects the best fitting music pieces, out of a given set of files, for instance, a user's personal mp3 collection. This task, commonly referred to as soundtrack suggestion, is non-trivial as it requires a lot of human attention and a good deal of experience, with master pieces distinguished, e.g., with the Academy Award for Best Original Score. We put forward PICASSO to solve this task in a fully automated way. We address the problem by extracting the required information, in form of music/screenshot samples, from available contemporary movies, making the training set easily obtainable. The training set is further extended with information acquired from movie scripts and subtitles, giving us a richer description of the action and atmosphere expressed in a particular movie scene. Although the number of applications for this approach is very large, we focus on two selected applications. First, we consider recommendation of the soundtrack for the slide show generation based on the given set of images. Second, we consider recommending a soundtrack as the background music for given audio books.

References

[1]
Sihem Amer-Yahia, et al. Group recommendation: Semantics and efficiency. PVLDB, 2009.
[2]
Audiobooks online. http://www.audiobooks.org/.
[3]
B. S. Manjunath, et al. Color and texture descriptors. IEEE Trans. Circuits Syst. Video Techn., 2001.
[4]
Marco Cristani, et al. Toward an automatically generated soundtrack from low-level cross-modal correlations for automotive scenarios. ACM Multimedia, 2010.
[5]
Mayur Datar, et al. Locality-sensitive hashing scheme based on p-stable distributions. Symposium on Computational Geometry, 2004.
[6]
Jan Engelen. A rapidly growing electronic publishing trend: Audiobooks for leisure and education. ELPUB, 2008.
[7]
Ronald Fagin, et al. Comparing partial rankings. SIAM J. Discrete Math., 2006.
[8]
Cheng-Te Li and Man-Kwan Shan. Emotion-based impressionism slideshow with automatic music accompaniment. ACM Multimedia, 2007.
[9]
Lucene framework. http://lucene.apache.org.
[10]
Meinard Müller. Information Retrieval for Music and Motion. Springer, 2007.
[11]
Music2ten. http://music2ten.com.%
[12]
%Irina Rish.% An empirical study of the naive Bayes classifier.% In IJCAI-01 workshop on "Empirical Methods in AI".
[13]
Niklas Röber, et al. Interactive audiobooks: Combining narratives with game elements. TIDSE, 2006.
[14]
Aleksandar Stupar and Sebastian Michel. PICASSO - to sing you must close your eyes and draw. SIGIR, 2011.
[15]
George Tzanetakis. Music analysis, retrieval and synthesis of audio signals marsyas. ACM Multimedia, 2009.

Cited By

View all
  • (2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
  • (2017)Adaptive News Video UploadingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_7(205-234)Online publication date: 1-Sep-2017
  • (2017)Lecture Video SegmentationMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_6(173-203)Online publication date: 1-Sep-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
October 2011
2712 pages
ISBN:9781450307178
DOI:10.1145/2063576
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio books
  2. automatic music selection
  3. background music
  4. slide show
  5. soundtrack recommendation

Qualifiers

  • Demonstration

Conference

CIKM '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2017)Conclusion and Future WorkMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_8(235-260)Online publication date: 1-Sep-2017
  • (2017)Adaptive News Video UploadingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_7(205-234)Online publication date: 1-Sep-2017
  • (2017)Lecture Video SegmentationMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_6(173-203)Online publication date: 1-Sep-2017
  • (2017)Soundtrack Recommendation for UGVsMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_5(139-171)Online publication date: 1-Sep-2017
  • (2017)Tag Recommendation and RankingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_4(101-138)Online publication date: 1-Sep-2017
  • (2017)Event UnderstandingMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_3(59-99)Online publication date: 1-Sep-2017
  • (2017)Literature ReviewMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_2(31-57)Online publication date: 1-Sep-2017
  • (2017)IntroductionMultimodal Analysis of User-Generated Multimedia Content10.1007/978-3-319-61807-4_1(1-30)Online publication date: 1-Sep-2017
  • (2016)Multimodal-based Multimedia Analysis, Retrieval, and Services in Support of Social Media ApplicationsProceedings of the 24th ACM international conference on Multimedia10.1145/2964284.2971471(1425-1429)Online publication date: 1-Oct-2016
  • (2016)Multimodal Analysis of User-Generated Content in Support of Social Media ApplicationsProceedings of the 2016 ACM on International Conference on Multimedia Retrieval10.1145/2911996.2912032(423-426)Online publication date: 6-Jun-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media