[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2701126.2701157acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Lossless graph summarization using dense subgraphs discovery

Published: 08 January 2015 Publication History

Abstract

Dense subgraph discovery, in a large graph, is useful to solve the community search problem. Motivated from this, we propose a graph summarization method where we search and aggregate dense subgraphs into super nodes. Since the dense subgraphs have high overlap of common neighbors, thus merging such subgraphs can produce a highly compact summary graph. Whereas the member nodes of each dense subgraph have many edges in common, they also have some edges not belonging to a given subgraph; which in turn reduce the compression ratio. To solve this problem, we propose the concept of AutoPruning that effectively filters the dense subgraphs having higher ratio of common to that of non-common neighbors. To summarize the dense subgraphs, we use the Minimum Description Length (MDL) principle to obtain a highly compact summary with least edge corrections for lossless compression. We propose two alternatives to trade-off the computation time and compression ratio while creating a super node from each dense subgraph. Through experiments on two publicly available real world graphs, we compare the proposed approach with the well known Minimum Degree Measure (MDM), for dense subgraph discovery, and observe very encouraging results.

References

[1]
Y. Asahiro, K. Iwama, H. Tamaki, and T. Tokuyama. Greedily finding a dense subgraph. Journal of Algorithms, 34(2): 203--221, 2000.
[2]
P. Boldi and S. Vigna. The webgraph framework i: compression techniques. In Proceedings of the 13th international conference on World Wide Web, pages 595--602. ACM, 2004.
[3]
G. Buehrer and K. Chellapilla. A scalable pattern mining approach to web graph compression with communities. In Proceedings of the 2008 International Conference on Web Search and Data Mining, pages 95--106. ACM, 2008.
[4]
M. Charikar. Greedy approximation algorithms for finding dense components in a graph. In Approximation Algorithms for Combinatorial Optimization, pages 84--95. Springer, 2000.
[5]
F. Chierichetti, R. Kumar, S. Lattanzi, M. Mitzenmacher, A. Panconesi, and P. Raghavan. On compressing social networks. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 219--228. ACM, 2009.
[6]
W. Cui, Y. Xiao, H. Wang, J. Hong, and W. Wang. Local search of communities in large graphs. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 2014.
[7]
C. Hernández and G. Navarro. Compressed representations for web and social graphs. Knowledge and Information Systems, pages 1--35, 2013.
[8]
U. Kang and C. Faloutsos. Beyond'caveman communities': Hubs and spokes for graph compression and mining. In Data Mining (ICDM), 2011 IEEE 11th International Conference on, pages 300--309. IEEE, 2011.
[9]
K. U. Khan, N. Waqas, and Y.-K. Lee. Set-based approximate approach for lossless graph summarization. The Computing Journal- Under Review, 2014.
[10]
S. Khuller and B. Saha. On finding dense subgraphs. In Automata, Languages and Programming, pages 597--608. Springer, 2009.
[11]
S. Navlakha, R. Rastogi, and N. Shrivastava. Graph summarization with bounded error. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 419--432. ACM, 2008.
[12]
J. Rissanen. Modeling by shortest data description. Automatica, 14(5): 465--471, 1978.
[13]
S. B. Seidman. Network structure and minimum degree. Social networks, 5(3): 269--287, 1983.
[14]
M. Sozio and A. Gionis. The community-search problem and how to plan a successful cocktail party. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 939--948. ACM, 2010.
[15]
Y. Tian, R. A. Hankins, and J. M. Patel. Efficient aggregation for graph summarization. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 567--580. ACM, 2008.
[16]
H. Toivonen, F. Zhou, A. Hartikainen, and A. Hinkka. Compression of weighted graphs. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 965--973. ACM, 2011.

Cited By

View all
  • (2022)The minimum description length principle for pattern mining: a surveyData Mining and Knowledge Discovery10.1007/s10618-022-00846-z36:5(1679-1727)Online publication date: 4-Jul-2022
  • (2017)Graph Summarization Based on Attribute-Connected NetworkWeb and Big Data10.1007/978-3-319-69781-9_16(161-171)Online publication date: 8-Nov-2017
  • (2016)Extracting Semantics from Legacy Scientific Workflows2016 IEEE Tenth International Conference on Semantic Computing (ICSC)10.1109/ICSC.2016.102(9-16)Online publication date: Feb-2016
  • Show More Cited By

Index Terms

  1. Lossless graph summarization using dense subgraphs discovery

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IMCOM '15: Proceedings of the 9th International Conference on Ubiquitous Information Management and Communication
      January 2015
      674 pages
      ISBN:9781450333771
      DOI:10.1145/2701126
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 January 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. MDL
      2. dense subgraphs
      3. graph summarization
      4. minimum degree measure

      Qualifiers

      • Research-article

      Funding Sources

      • Korea Government (MEST)

      Conference

      IMCOM '15
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 213 of 621 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)The minimum description length principle for pattern mining: a surveyData Mining and Knowledge Discovery10.1007/s10618-022-00846-z36:5(1679-1727)Online publication date: 4-Jul-2022
      • (2017)Graph Summarization Based on Attribute-Connected NetworkWeb and Big Data10.1007/978-3-319-69781-9_16(161-171)Online publication date: 8-Nov-2017
      • (2016)Extracting Semantics from Legacy Scientific Workflows2016 IEEE Tenth International Conference on Semantic Computing (ICSC)10.1109/ICSC.2016.102(9-16)Online publication date: Feb-2016
      • (2015)Analyzing Subgraph Isomorphism on Graphs with Diverse Structural PropertiesProceedings of the 2015 International Conference on Big Data Applications and Services10.1145/2837060.2837088(170-174)Online publication date: 20-Oct-2015

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media