[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/584931.584941acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Effects of maximum flow algorithm on identifying web community

Published: 08 November 2002 Publication History

Abstract

In this paper, we describe the effects of using maximum flow algorithm on extracting web community from the web. A web community is a set of web pages having a common topic. Since the web can be recognized as a graph that consists of nodes and edges that represent web pages and hyperlinks respectively, so far various graph theoretical approaches have been proposed to extract web communities from the web graph. The method of finding a web community using maximum flow algorithm was proposed by NEC Research Institute in Princeton two years ago. However the properties of web communities derived by this method have been seldom known. To examine the effects of this method, we selected 30 topics randomly and experimented using Japanese web archives crawled in 2000. Through these experiments, it became clear that the method has both advantages and disadvantages. We will describe some strategies to use this method effectively. Moreover, by using same topics, we examined another method that is based on complete bipartite graphs. We compared the web communities obtained by those methods and analyzed those characteristics.

References

[1]
J. M. Kleinberg, "Authoritative Sources in a Hyperlinked Environment," In Proc. ACM-SIAM Symposium on Discrete Algorithms, 1998.
[2]
D. Gibson, J. M. Kleinberg and P. Raghavan, "Inferring Web Communities from Link Topology," In Proc. HyperText98, 1998.
[3]
S. Chakrabarti, B. Dom, P. Raghavan, S. Rajagopalan, D. Gibson and J. Kleinberg, "Automatic resource compilation by analyzing hyperlink structure and associated text," In Proc. 7th WWW Conference, 1998.
[4]
M. Toyoda and M. Kitsuregawa, "Creating a Web Community Chart for Navigating Related Communities," In Proc. 8th WWW Conference, 1999.
[5]
S. Brin and L. Page, "The anatomy of a large-scale hypertextual Web search engine," In Proc. 7th WWW Conference, 1998.
[6]
R. Lempel and S. Moran, "The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect," In Proc. 9th WWW Conference, 2000.
[7]
J. Dean and M. R. Henzinger, "Finding related pages in the World Wide Web," In Proc. 8th WWW Conference, 1999.
[8]
A. Borodin, G. O. Roberts, J. S. Rosenthal and P. Tsaparas, "Finding Authorities and Hubs From Link Structures on the World Wide Web," In Proc. 10th WWW Conference, 2000.
[9]
G. W. Flake, S. Lawrence, and C. L. Giles, "Efficient Identification of Web Communities," In Proc. KDD 2000, 2000.
[10]
G. W. Flake, S. Lawrence, C. L. Giles, and F. M. Coetzee. "Self-Organization and Identification of Web Communities," IEEE Computer,35(3);66--71, 2002.
[11]
R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, "Trawling the web for emerging cyber-communities," In Proc. 8th WWW Conference, 1999.
[12]
R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, "Network Flows: Theory, Algorithms, and Applications," Prentice Hall, Englewood Cliffs, NJ, 1993.
[13]
A. V. Goldberg and R. E. Tarjan, "A new approach to the maximal flow problem," In Proc. 18th Ann. ACM Symposium on Theory of Computing, 1986.
[14]
L. R. Ford Jr. and D. R. Fulkerson, "Maximal flow through a network," Canadian J.Math.,8:399--404, 1956.

Cited By

View all
  • (2017)Finding k most influential edges on flow graphsInformation Systems10.5555/3050918.305096065:C(93-105)Online publication date: 1-Apr-2017
  • (2017)Finding k most influential edges on flow graphsInformation Systems10.1016/j.is.2016.12.00265(93-105)Online publication date: Apr-2017
  • (2013)Using social networks to enhance customer relationship managementProceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems10.1145/2536146.2536163(169-176)Online publication date: 28-Oct-2013
  • Show More Cited By

Index Terms

  1. Effects of maximum flow algorithm on identifying web community

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WIDM '02: Proceedings of the 4th international workshop on Web information and data management
    November 2002
    116 pages
    ISBN:1581135939
    DOI:10.1145/584931
    • Program Chairs:
    • Roger Chiang,
    • Ee-Peng Lim
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 November 2002

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. maximum-flow algorithm
    2. web community
    3. web graph

    Qualifiers

    • Article

    Conference

    CIKM02

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)Finding k most influential edges on flow graphsInformation Systems10.5555/3050918.305096065:C(93-105)Online publication date: 1-Apr-2017
    • (2017)Finding k most influential edges on flow graphsInformation Systems10.1016/j.is.2016.12.00265(93-105)Online publication date: Apr-2017
    • (2013)Using social networks to enhance customer relationship managementProceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems10.1145/2536146.2536163(169-176)Online publication date: 28-Oct-2013
    • (2012)Community DiscoverySocial Network Mining, Analysis, and Research Trends10.4018/978-1-61350-513-7.ch004(56-64)Online publication date: 2012
    • (2012)Extracting research communities from bibliographic dataInternational Journal of Knowledge-based and Intelligent Engineering Systems10.3233/KES-2012-023016:1(25-34)Online publication date: 1-Jan-2012
    • (2012)Subject-based extraction of a latent blog communityInformation Sciences: an International Journal10.1016/j.ins.2011.08.004184:1(215-229)Online publication date: 1-Feb-2012
    • (2010)An improved algorithm for extracting research communities from bibliographic dataProceedings of the 15th international conference on Database systems for advanced applications10.5555/1880853.1880892(338-345)Online publication date: 1-Apr-2010
    • (2010)An Improved Algorithm for Extracting Research Communities from Bibliographic DataDatabase Systems for Advanced Applications10.1007/978-3-642-14589-6_34(338-345)Online publication date: 2010
    • (2009)Extracting Research Communities by Improved Maximum Flow AlgorithmProceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II10.1007/978-3-642-04592-9_59(472-479)Online publication date: 1-Oct-2009
    • (2007)An algorithm for modularization of MAPK and calcium signaling pathwaysJournal of Biomedical Informatics10.1016/j.jbi.2007.05.00740:6(726-749)Online publication date: 1-Dec-2007
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media