[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/775047.775145acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Discovery net: towards a grid of knowledge discovery

Published: 23 July 2002 Publication History

Abstract

This paper provides a blueprint for constructing collaborative and distributed knowledge discovery systems within Grid-based computing environments. The need for such systems is driven by the quest for sharing knowledge, information and computing resources within the boundaries of single large distributed organisations or within complex Virtual Organisations (VO) created to tackle specific projects. The proposed architecture is built on top of a resource federation management layer and is composed of a set of different resources. We show how this architecture will behave during a typical KDD process design and deployment, how it enables the execution of complex and distributed data mining tasks with high performance and how it provides a community of e-scientists with means to collaborate, retrieve and reuse both KDD algorithms, discovery processes and knowledge in a visual analytical environment.

References

[1]
P. Chapman, J. Clinton, T. Khabaza, T. Reinartz, and R. Wirth. The CRISP-DM process model, March 1999.]]
[2]
Jaturon Chattratichat, John Darlington, Yike Guo, Stefan Hedvall, Martin Kohler, and Jameel Syed. An architecture for distributed enterprise data mining. In Proceedings of the 7th Conference on High Performance Computing and Networking Europe, 1999.]]
[3]
The Data Mining Group. {http://www.dmg.org}.]]
[4]
Discovery link http://www.ibm.com/solutions/lifesciences/.]]
[5]
M. Eisen, P. Spellman, P. Brown, and D. Botstein. Cluster analysis and display of genomewide expression patterns. Proc. Natl. Acad. Sci., 95:14863--14868, 1998.]]
[6]
European datagrid project, http://www.eu-datagrid.org/.]]
[7]
Eurogrid, http://www.eurogrid.org/.]]
[8]
Usama Fayyad. Knowledge discovery in databases: An overview. In Nada Lavrač and Sašo Džeroski, editors, Proceedings of the 7th International Workshop on Inductive Logic Programming, volume 1297 of LNAI, pages 3--16, Berlin, September 17--20 1997. Springer.]]
[9]
Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth. Knowledge discovery and data mining: Towards a unifying framework. In Proceedings of Second International Conference on Knowledge Discovery and Data Mining. AAAI Press, 1996.]]
[10]
Ian Foster and Carl Kesselman. The globus toolkit. In Ian Foster and Carl Kesselman, editors, The Grid: Blueprint for a New Computing Infrastructure, pages 259--278. Morgan Kaufmann, San Francisco, CA, 1999. Chap. 11.]]
[11]
Ian Foster, Carl kesselman, Jeffrey M. Nick, and Steven Tuecke. The physiology of the grid an open grid services architecture for distributed systems integration. Technical report, 2002.]]
[12]
Ian Foster, Carl Kesselman, and Steven Tuecke. The anatomy of the Grid: Enabling scalable virtual organization. The International Journal of High Performance Computing Applications, 15(3):200--222, Fall 2001.]]
[13]
Nathalie Furmento, Anthony Mayer, Stephen McGough, Steven Newhouse, and John Darlington. A component framework for HPC applications. Lecture Notes in Computer Science, 2150, 2001.]]
[14]
geneticxchange http://www.geneticxchange.com/.]]
[15]
Global grid forum, http://www.gridforum.org/.]]
[16]
Carole Goble. The low down on e-science and grids for biology. Comparative and Functional Genomics, pages 365--370, 2001.]]
[17]
Nasa power grid, http://www.ipg.nasa.gov/.]]
[18]
Sap http://www.sap.com/.]]
[19]
Seti institute, http://www.seti.org/.]]
[20]
Uddi http://www.uddi.org.]]
[21]
Web services technology http://www.w3.org/2002/ws/.]]
[22]
Web service description language http://www.w3.org/tr/wsdl.]]

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
July 2002
719 pages
ISBN:158113567X
DOI:10.1145/775047
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 July 2002

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

KDD02
Sponsor:

Acceptance Rates

KDD '02 Paper Acceptance Rate 44 of 307 submissions, 14%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Knowledge-Grid Modelling for Academic PurposesArtificial Intelligence for Knowledge Management10.1007/978-3-319-55970-4_1(1-14)Online publication date: 31-Mar-2017
  • (2015)LODFlowProceedings of the 11th International Conference on Semantic Systems10.1145/2814864.2814882(137-144)Online publication date: 16-Sep-2015
  • (2015)Parallel and Distributed Spatial Outlier Mining in GridJournal of Grid Computing10.1007/s10723-015-9326-y13:2(139-157)Online publication date: 1-Jun-2015
  • (2014)Evaluating Distributed Platforms for Protein-Guided Scientific WorkflowProceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment10.1145/2616498.2616551(1-8)Online publication date: 13-Jul-2014
  • (2011)Constructing u‐City of Seoul by future foresight analysisConcurrency and Computation: Practice and Experience10.1002/cpe.169023:10(1114-1126)Online publication date: 14-Jan-2011
  • (2010)Performance study of distributed Apriori-like frequent itemsets miningKnowledge and Information Systems10.5555/3225669.322601623:1(55-72)Online publication date: 1-Apr-2010
  • (2010)Toward Distributed Knowledge Discovery on Grid SystemsEmergent Web Intelligence: Advanced Semantic Technologies10.1007/978-1-84996-077-9_9(213-243)Online publication date: 2010
  • (2010)Parallel and Grid-Based Data Mining – Algorithms, Models and Systems for High-Performance KDDData Mining and Knowledge Discovery Handbook10.1007/978-0-387-09823-4_53(1009-1028)Online publication date: 7-Jul-2010
  • (2009)High efficient scheduler for distributed data mining applicationsProceedings of the 3rd WSEAS international conference on Computer engineering and applications10.5555/1519432.1519446(87-92)Online publication date: 10-Jan-2009
  • (2009)Intelligent Agents in the Service-Oriented World - An Industrial Experience ReportProceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 0110.1109/WI-IAT.2009.365(693-696)Online publication date: 15-Sep-2009
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media