[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1557019.1557123acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Exploring social tagging graph for web object classification

Published: 28 June 2009 Publication History

Abstract

This paper studies web object classification problem with the novel exploration of social tags. Automatically classifying web objects into manageable semantic categories has long been a fundamental preprocess for indexing, browsing, searching, and mining these objects. The explosive growth of heterogeneous web objects, especially non-textual objects such as products, pictures, and videos, has made the problem of web classification increasingly challenging. Such objects often suffer from a lack of easy-extractable features with semantic information, interconnections between each other, as well as training examples with category labels.
In this paper, we explore the social tagging data to bridge this gap. We cast web object classification problem as an optimization problem on a graph of objects and tags. We then propose an efficient algorithm which not only utilizes social tags as enriched semantic features for the objects, but also infers the categories of unlabeled objects from both homogeneous and heterogeneous labeled objects, through the implicit connection of social tags. Experiment results show that the exploration of social tags effectively boosts web object classification. Our algorithm significantly outperforms the state-of-the-art of general classification methods.

Supplementary Material

JPG File (p957-yin.jpg)
MP4 File (p957-yin.mp4)

References

[1]
S. Bao, G.-R. Xue, X. Wu, Y. Yu, B. Fei, and Z. Su. Optimizing web search using social annotations. In WWW, pages 501--510, 2007.
[2]
C. H. Brooks and N. Montanez. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In WWW, pages 625--632, 2006.
[3]
S. Chakrabarti, B. Dom, and P. Indyk. Enhanced hypertext categorization using hyperlinks. In SIGMOD Conference, pages 307--318, 1998.
[4]
W. Dai, G.-R. Xue, Q. Yang, and Y. Yu. Transferring naive bayes classifiers for text classification. In AAAI, pages 540--545, 2007.
[5]
C. Do and A. Y. Ng. Transfer learning for text classification. In NIPS, 2005.
[6]
R. Ghani, S. Slattery, and Y. Yang. Hypertext categorization using hyperlink patterns and meta data. In ICML, pages 178--185, 2001.
[7]
P. Heymann, G. Koutrika, and H. Garcia-Molina. Can social bookmarking improve web search? In WSDM, pages 195--206, 2008.
[8]
T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In ECML, pages 137--142, 1998.
[9]
P. R. Kalva, F. Enembreck, and A. L. Koerich. Web image classification based on the fusion of image and text classifiers. In ICDAR, pages 561--568, 2007.
[10]
X. Li, L. Guo, and Y. E. Zhao. Tag-based social interest discovery. In WWW, pages 675--684, 2008.
[11]
W.-H. Lin and A. G. Hauptmann. News video classification using svm-based multimodal classifiers and combination strategies. In ACM Multimedia, pages 323--326, 2002.
[12]
X. Qi and B. D. Davison. Web page classification: Features and algorithms. ACM Comput. Surv., 41(2), 2009.
[13]
R. Schenkel, T. Crecelius, M. Kacimi, S. Michel, T. Neumann, J. X. Parreira, and G. Weikum. Efficient top-k querying over social-tagging networks. In SIGIR, pages 523--530, 2008.
[14]
D. Shen, J.-T. Sun, Q. Yang, and Z. Chen. A comparison of implicit and explicit links for web page classification. In WWW, pages 643--650, 2006.
[15]
X. Wu, L. Zhang, and Y. Yu. Exploring social annotations for the semantic web. In WWW, pages 417--426, 2006.
[16]
G.-R. Xue, W. Dai, Q. Yang, and Y. Yu. Topic-bridged plsa for cross-domain text classification. In SIGIR, pages 627--634, 2008.
[17]
G.-R. Xue, D. Shen, Q. Yang, H.-J. Zeng, Z. Chen, Y. Yu, W. Xi, and W.-Y. Ma. IRC: An iterative reinforcement categorization algorithm for interrelated web objects. In ICDM, pages 273--280, 2004.
[18]
D. Zhou, J. Bian, S. Zheng, H. Zha, and C. L. Giles. Exploring social annotations for information retrieval. In WWW, pages 715--724, 2008.
[19]
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Sch-olkopf. Learning with local and global consistency. In NIPS, 2003.
[20]
X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912--919, 2003.

Cited By

View all

Index Terms

  1. Exploring social tagging graph for web object classification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
    June 2009
    1426 pages
    ISBN:9781605584959
    DOI:10.1145/1557019
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 June 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. optimization
    2. social tagging
    3. web classification

    Qualifiers

    • Research-article

    Conference

    KDD09

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)10
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Heterogeneous GraphletsACM Transactions on Knowledge Discovery from Data10.1145/341877315:1(1-43)Online publication date: 7-Dec-2020
    • (2019)Multi-level diversification approach of semantic-based image retrieval resultsProgress in Artificial Intelligence10.1007/s13748-019-00195-xOnline publication date: 22-Jun-2019
    • (2018)Joint Latent Dirichlet Allocation for Social TagsIEEE Transactions on Multimedia10.1109/TMM.2017.271682920:1(224-237)Online publication date: 1-Jan-2018
    • (2018)Collaborative text categorization via exploiting sparse coefficientsWorld Wide Web10.1007/s11280-017-0460-221:2(373-394)Online publication date: 1-Mar-2018
    • (2017)An Efficient Multiclass Classifier Using On-Page Positive Personality Features for Web Page Classification for the Next Generation Wireless Communication NetworksWireless Personal Communications: An International Journal10.1007/s11277-016-3173-493:2(503-522)Online publication date: 1-Mar-2017
    • (2016)Collaborative Tagging for Collective IntelligenceEncyclopedia of E-Commerce Development, Implementation, and Management10.4018/978-1-4666-9787-4.ch138(1957-1969)Online publication date: 2016
    • (2016)Visualization of High-Level Associations from Twitter DataBusiness Intelligence10.4018/978-1-4666-9562-7.ch036(692-712)Online publication date: 2016
    • (2016)On Transductive Classification in Heterogeneous Information NetworksProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983730(811-820)Online publication date: 24-Oct-2016
    • (2016)We Know Where You Are: Home Location Identification in Location-Based Social Networks2016 25th International Conference on Computer Communication and Networks (ICCCN)10.1109/ICCCN.2016.7568598(1-9)Online publication date: Aug-2016
    • (2016)Optimization and label propagation in bipartite heterogeneous networks to improve transductive classification of textsInformation Processing and Management: an International Journal10.1016/j.ipm.2015.07.00452:2(217-257)Online publication date: 1-Mar-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media