[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2093973.2094049acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
poster

Spatial categorical outlier detection: pair correlation function based approach

Published: 01 November 2011 Publication History

Abstract

Spatial Categorical Outlier Detection (SCOD) has attracted considerable attentions from the areas of spatial data mining and geological analysis. When encountering an SCOD problem, some researchers introduce to utilize Spatial Numerical Outlier Detection measures by mapping categorical attributes to continuous ones. However, such approaches fail to capture the special properties of spatial categorical data, which is prone to incur the masking and swamping issues. In this paper, we model spatial dependencies between spatial categorical observations and propose a Pair Correlation Function(PCF) based method to detect SCOs. First, a new metric, named Pair Correlation Ratio(PCR), is estimated for each pair of categorical combinations based on their co-occurrence frequency at different spatial distances. Then discrete PCRs are fitted in a continuous function of distances. The outlier score is computed using the average PCRs between referenced object and its spatial neighbors. Observations with the lowest PCRs are labeled as potential SCOs. Extensive experiments demonstrated that PCF based method outperformed existing approaches.

References

[1]
N. R. Adam, V. P. Janeja, and V. Atluri. Neighborhood based detection of anomalies in high dimensional spatio-temporal sensor datasets. In ACM SAC '04, pages 576--583, 2004.
[2]
L. Anselin. Local indicators of spatial association-lisa. Geographical Analysis, 27(2):93--115, 1995.
[3]
S. Boriah, V. Chandola, and V. Kumar. Similarity measures for categorical data: A comparative evaluation. In SDM, pages 243--254, 2008.
[4]
M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. ACM SIGMOD '00, pages 93--104, 2000.
[5]
V. Chandola, S. Boriah, and V. Kuman. Understanding categorical similarity measures for outlier detection. Technical report, University of Minnesota, 2008.
[6]
K. Das and J. Schneider. Detecting anomalous records in categorical datasets. In Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '07, pages 220--229, New York, NY, USA, 2007. ACM.
[7]
P. Goovaerts. Geostatistics for natural resources evaluation. Applied geostatistics series. Oxford University Press, 1997.
[8]
http://www.iiasa.ac.at/Research/LUC/External-World-soil database/HTML/.
[9]
J. Illian, A. Penttinen, H. Stoyan, and D. Stoyan. Statistical analysis and modelling of spatial point patterns. 2008.
[10]
A. Koufakou, E. G. Ortiz, M. Georgiopoulos, G. C. Anagnostopoulos, and K. M. Reynolds. A scalable and efficient outlier detection strategy for categorical data. IEEE ICTAI '07, pages 210--217, 2007.
[11]
S. Ramaswamy, R. Rastogi, and K. Shim. Efficient algorithms for mining outliers from large data sets. ACM SIGMOD '00, pages 427--438, 2000.
[12]
T. Reed and K. Gubbins. Applied statistical mechanics: thermodynamic and transport properties of fluids. Butterworth-Heinemann reprint series in chemical engineering. Butterworth-Heinemann, 1973.
[13]
S. Shekhar, C.-T. Lu, and P. Zhang. Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In KDD, pages 371--376, 2001.
[14]
W. R. Tobler. A computer movie simulating urban growth in the detroit region. Economic Geography, 46(2):234--240, 1970.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GIS '11: Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2011
559 pages
ISBN:9781450310314
DOI:10.1145/2093973

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. outlier detection
  2. pair correlation
  3. spatial categorical data

Qualifiers

  • Poster

Conference

GIS '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 257 of 1,238 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 182
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media