[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1631272.1631451acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
tutorial

Parallel algorithms for mining large-scale rich-media data

Published: 19 October 2009 Publication History

Abstract

The amount of online photos and videos is now at the scale of tens of billions. To organize, index, and retrieve these large-scale rich-media data, a system must employ scalable data management and mining algorithms. The research community needs to consider solving large scale problems rather than solving problems with small datasets that do not reflect real life scenarios. This tutorial introduces key challenges in large-scale rich-media data mining, and presents parallel algorithms for tackling such challenges. We present our parallel implementations of Spectral Clustering (PSC), FP-Growth (PFP), Latent Dirichlet Allocation (PLDA), and Support Vector Machines (PSVM).

References

[1]
Y. Song, W.-Y. Chen, H. Bai, C.-J. Lin, and E. Y. Chang. Parallel Spectral Clustering. In Proc. of ECML/PKDD, 2008.
[2]
H. Li, Y. Wang, D. Zhang, M. Zhang, and E. Y. Chang, PFP--Parallel FP-Growth Algorithm for Query Recommendation, ACM Recommendation Systems, 2008.
[3]
Y. Wang, H. Bai, M. Stanton, W.-Y. Chen, E. Y. Chang, PLDA--Parallel Latent Dirichlet Allocation for Large-scale Applications, AAIM, June 2009.
[4]
E.Y. Chang, K. Zhu, H. Wang, H. Bai, J. Li, Z. Qiu, H. Cui, PSVM: Parallelizing Support Vector Machines on Distributed Computers, NIPS, 2007
[5]
F. R. Bach and M. I. Jordan. Learning Spectral Clustering. In Proc. of NIPS, 2003.
[6]
W.-Y. Chen, J.C. Chu, J. Luan, H. Bai, Y. Wang, E. Y. Chang, Collaborative Filtering for Orkut Communities: Discovery of User Latent Behavior, 18th International WWW Conference, 2009.
[7]
D.M. Blei, A.Y. Ng, M.I. Jordan, Latent Dirichlet allocation. Journal of Machine Learning Research, 2003
[8]
L.-J. Li,.G.,Wang, and L. Fei-Fei, OPTIMOL, Automatic Online Picture Collection via Incremental Model Learning, CVPR, 2007.
[9]
PLDA Open Source http://code.google.com/p/plda/.
[10]
K.-S. Goh and E. Y. Chang, One, Two Class SVMs for Multiclass Image Annotation, IEEE Transactions on Knowledge and Data Engineering (TKDE), Volume 17, Number 10, October 2005.
[11]
K.-S. Goh, E. Y. Chang, and W.-C. Lai, Concept-dependent Multimodal Active Learning for Image Retrieval, ACM International Conference on Multimedia (MM), pp.564--571, New York, October 2004
[12]
PSVM Open Source http://code.google.com/p/psvm/.

Cited By

View all
  • (2023)Software engineering for internet of underwater things to analyze oceanic dataInternet of Things10.1016/j.iot.2023.10089324(100893)Online publication date: Dec-2023
  • (2020)A comparative study of Distributed Large Scale Data Mining AlgorithmsBSSS Journal of Computer10.51767/jc1102Online publication date: 25-May-2020
  • (2020)Healthcare informatics and analytics in big dataExpert Systems with Applications10.1016/j.eswa.2020.113388152(113388)Online publication date: Aug-2020
  • Show More Cited By

Index Terms

  1. Parallel algorithms for mining large-scale rich-media data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '09: Proceedings of the 17th ACM international conference on Multimedia
    October 2009
    1202 pages
    ISBN:9781605586083
    DOI:10.1145/1631272

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 October 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. PFP
    2. PLDA
    3. PSC
    4. PSVM
    5. distributed computing
    6. distributed optimization
    7. large-scale data mining
    8. large-scale machine learning

    Qualifiers

    • Tutorial

    Conference

    MM09
    Sponsor:
    MM09: ACM Multimedia Conference
    October 19 - 24, 2009
    Beijing, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 07 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Software engineering for internet of underwater things to analyze oceanic dataInternet of Things10.1016/j.iot.2023.10089324(100893)Online publication date: Dec-2023
    • (2020)A comparative study of Distributed Large Scale Data Mining AlgorithmsBSSS Journal of Computer10.51767/jc1102Online publication date: 25-May-2020
    • (2020)Healthcare informatics and analytics in big dataExpert Systems with Applications10.1016/j.eswa.2020.113388152(113388)Online publication date: Aug-2020
    • (2019)A Comprehensive Survey on Cloud Data Mining (CDM) Frameworks and AlgorithmsACM Computing Surveys10.1145/334926552:5(1-62)Online publication date: 13-Sep-2019
    • (2019)Parallel Computing of Support Vector MachinesACM Computing Surveys10.1145/328098951:6(1-38)Online publication date: 28-Jan-2019
    • (2018)Challenges in Mining Big Data StreamsData and Communication Networks10.1007/978-981-13-2254-9_15(173-183)Online publication date: 30-Dec-2018
    • (2017)Big data challenges in ocean observationPersonal and Ubiquitous Computing10.1007/s00779-016-0980-221:1(55-65)Online publication date: 1-Feb-2017
    • (2017)Adding Big Value to Big Businesses: A Present State of the Art of Big Data, Frameworks and AlgorithmsICT Based Innovations10.1007/978-981-10-6602-3_17(171-184)Online publication date: 1-Oct-2017
    • (2016)Big Data MiningEffective Big Data Management and Opportunities for Implementation10.4018/978-1-5225-0182-4.ch003(53-59)Online publication date: 2016
    • (2016)Paths sharing based FP-growth data mining algorithms2016 8th International Conference on Wireless Communications & Signal Processing (WCSP)10.1109/WCSP.2016.7752497(1-4)Online publication date: Oct-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media