Abstract
The paper proposes an optimal distributed k Nearest Neighbor query processing algorithm based on Data Grid, called the opGkNN. Three steps are incorporated in the opGkNN. First when a user submits a query with a vector Vq and a number k, an iDistance[3]-based vector set reduction is first conducted at data node level in parallel. Then the candidate vectors are transferred to the executing nodes for the refinement process in which the answer set is obtained. Finally, the answer set is transferred to the query node. The experimental results show that the performance of the algorithm is efficient and effective in minimizing the response time by decreasing network transfer cost and increasing the parallelism of I/O and CPU.
This paper is partially supported by the Program of National Natural Science Foundation of China under Grant No. 60873022; The Program of Natural Science Foundation of Zhejiang Province under Grant No. Y1080148; The Key Program of Science and Technology of Zhejiang Province under Grant No. 2008C13082; The Open Project of Zhejiang Provincial Key Laboratory of Information Network Technology; The Key Project of Special Foundation for Young Scientists in Zhejiang Gongshang University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1998)
Böhm, C., Berchtold, S., Keim, D.: Searching in High-dimensional Spaces: Index Structures for Improving the Performance of Multimedia Databases. ACM Computing Surveys 33(3) (2001)
Jagadish, H.V., Ooi, B.C., Tan, K.L., Yu, C., Zhang, R.: iDistance: An Adaptive B+-tree Based Indexing Method for Nearest Neighbor Search. TODS 30(2), 364–397 (2005)
Berchtold, S., Bohm, C., Braunmuller, B., Keim, D.A., et al.: Fast Parallel Similarity Search in Multimedia Databases. In: SIGMOD, pp. 1–12
Zhuang, Y., Zhuang, Y.T., Li, Q., Wu, F.: Speeding Up Similarity Queries over Large Chinese Calligraphic Character Databases Using Data Grid. In: GCC, pp. 225–236 (2007)
UCI KDD Archive (2002), http://www.kdd.ics.uci.edu
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhuang, Y., Hu, H., Li, X., Xu, B., Hu, H. (2009). Optimal K-Nearest-Neighbor Query in Data Grid. In: Li, Q., Feng, L., Pei, J., Wang, S.X., Zhou, X., Zhu, QM. (eds) Advances in Data and Web Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5446. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00672-2_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-00672-2_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00671-5
Online ISBN: 978-3-642-00672-2
eBook Packages: Computer ScienceComputer Science (R0)