[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/ICDM.2005.16guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Visual Data Mining Framework for Convenient Identification of Useful Knowledge

Published: 27 November 2005 Publication History

Abstract

Data mining algorithms usually generate a large number of rules, which may not always be useful to human users. In this project, we propose a novel visual data-mining framework, called Opportunity Map, to identify useful and actionable knowledge quickly and easily from the discovered rules. The framework is inspired by the House of Quality from Quality Function Deployment (QFD) in Quality Engineering. It associates discovered rules, related summarized data and data distributions with the application objective using an interactive matrix. Combined with drill down visualization, integrated visualization of data distribution bars and rules, visualization of trend behaviors, and comparative analysis, the Opportunity Map allows users to analyze rules and data at different levels of detail and quickly identify the actionable knowledge and opportunities. The proposed framework represents a systematic and flexible approach to rule analysis. Applications of the system to large-scale data sets from our industrial partner have yielded promising results.

References

[1]
Adomavicius G. and Tuzhilin, A. "Discovery of actionable patterns in databases: the action hierarchy approach". KDD-97, 1997.
[2]
Agrawal R. and Srikant R. "Fast algorithms for mining association rules". VLDB-1994.
[3]
Blanchard J, Guillet F, Briand H. "Exploratory Visualization for Association Rule Rummaging". KDD- 03 Workshop on Multimedia Data Mining, 2003.
[4]
Bruzzese D., Davino C. "Visual Post-Analysis of Association Rules" ECML/PKDD VDM Workshop 2002.
[5]
Chambers J.M, Cleveland W. S, Kleiner B, and Tukey P.A. Graphical Methods for Data Analysis. Chapman & Hall, 1983.
[6]
Cohen L. Quality Function Deployment. Prentice Hall, 1995.
[7]
Han J, Cercone N. "RuleViz: A Model for Visualizing Knowledge Discovery Process". KDD-00, 2000.
[8]
Han J., Fu, Y., Wang W., Koperski, K. and Zaiane, O. "DMQL: a data mining query language for relational databases". SIGMOD Workshop on DMKD, 1996.
[9]
Hofmann H, Siebes A., Wilhelm, A. "Visualizing association rules with interactive mosaic plots". KDD- 00, 2000.
[10]
Jorge A., Pocas J., Azevedo P. "Post-processing environment for browsing large sets of association rules". PKDD-02 VDM Workshop, 2002.
[11]
Keim D. "Information Visualization and Visual Data Mining". IEEE Trans. Vis. Comput. Graph, 2002.
[12]
Liu B., Hsu W., and Chen S. "Using general impressions to analyze discovered classification rules". KDD-97, 1997.
[13]
Liu B., Hsu W, Ma Y. "Integrating Classification and Association Rule Mining". KDD-98, 1998.
[14]
Liu B., Hsu W., and Ma Y. "Mining Association Rules with Multiple Minimum Support". KDD-99. 1999.
[15]
Liu, B., Hsu, H., Ma, Y. "Identifying Non-Actionable Association Rules". KDD-01, 2001.
[16]
Ma S., Hellerstein J. "Ordering Categorical Data to Improve Visualization". INFOVIS-99, 1999.
[17]
Ong K-H, Ong K-L, Ng W-K, Lim E-P. "CrystalClear: Active Visualization of Association Rules". ICDM-02 Workshop on Active Mining (AM-02), 2002.
[18]
Padmanabhan B. and Tuzhilin A. "A belief-driven method for discovering unexpected patterns". KDD-98.
[19]
Piatesky-Shapiro G., and Matheus C. "The interestingness of deviations". KDD-94, 1994.
[20]
Quinlan J. R. 1992. C4.5: program for machine learning. Morgan Kaufmann.
[21]
Silberschatz A., and Tuzhilin, A. "What makes patterns interesting in knowledge discovery systems." IEEE Trans. on Know. and Data Eng. 8(6), 1996.
[22]
Suzuki E., "Autonomous discovery of reliable exception rules". KDD-97, 1997.
[23]
Terninko J. "Step-by-Step QFD: Customer-Driven Product Design". Saint Lucie Press. 1997.
[24]
Tuzhilin A., and Liu, B. "Querying multiple sets of discovered rules". KDD, 2002.
[25]
Wong P-C, Whitney P, Thomas J. "Visualizing Association Rules for Text Mining". INFOVIS-99.
[26]
Zhao K. Liu B. "Visual Analysis of the Behavior of Discovered Rules". KDD-2001 Workshop on Visual Data Mining.
[27]
Zhao K. Liu, B., Tirpak, T., and Schaller, A. "V-Miner: Using Enhanced Parallel Coordinates to Mine Product Design and Test Data". KDD-2004.
[28]
Zhao K., Liu B., Tirpak T., and Xiao W. "Opportunity Map: A Visualization Framework for Fast Identification of Actionable Knowledge". CIKM 2005.

Cited By

View all
  • (2013)Visual Data Mining Methods for Kernel Smoothed Estimates of Cox ProcessesRevised Selected Papers of PAKDD 2013 International Workshops on Trends and Applications in Knowledge Discovery and Data Mining - Volume 786710.1007/978-3-642-40319-4_8(83-94)Online publication date: 14-Apr-2013
  • (2010)Centralized and Distributed Anonymization for High-Dimensional Healthcare DataACM Transactions on Knowledge Discovery from Data10.1145/1857947.18579504:4(1-33)Online publication date: 1-Oct-2010
  • (2006)Opportunity mapProceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/1150402.1150524(892-901)Online publication date: 20-Aug-2006
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '05: Proceedings of the Fifth IEEE International Conference on Data Mining
November 2005
837 pages
ISBN:0769522785

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 November 2005

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2013)Visual Data Mining Methods for Kernel Smoothed Estimates of Cox ProcessesRevised Selected Papers of PAKDD 2013 International Workshops on Trends and Applications in Knowledge Discovery and Data Mining - Volume 786710.1007/978-3-642-40319-4_8(83-94)Online publication date: 14-Apr-2013
  • (2010)Centralized and Distributed Anonymization for High-Dimensional Healthcare DataACM Transactions on Knowledge Discovery from Data10.1145/1857947.18579504:4(1-33)Online publication date: 1-Oct-2010
  • (2006)Opportunity mapProceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/1150402.1150524(892-901)Online publication date: 20-Aug-2006
  • (2006)Rule interestingness analysis using OLAP operationsProceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/1150402.1150437(297-306)Online publication date: 20-Aug-2006

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media