[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Learning to detect events with Markov-modulated poisson processes

Published: 01 December 2007 Publication History

Abstract

Time-series of count data occur in many different contexts, including Internet navigation logs, freeway traffic monitoring, and security logs associated with buildings. In this article we describe a framework for detecting anomalous events in such data using an unsupervised learning approach. Normal periodic behavior is modeled via a time-varying Poisson process model, which in turn is modulated by a hidden Markov process that accounts for bursty events. We outline a Bayesian framework for learning the parameters of this model from count time-series. Two large real-world datasets of time-series counts are used as testbeds to validate the approach, consisting of freeway traffic data and logs of people entering and exiting a building. We show that the proposed model is significantly more accurate at detecting known events than a more traditional threshold-based technique. We also describe how the model can be used to investigate different degrees of periodicity in the data, including systematic day-of-week and time-of-day effects, and to make inferences about different aspects of events such as number of vehicles or people involved. The results indicate that the Markov-modulated Poisson framework provides a robust and accurate framework for adaptively and autonomously learning how to separate unusual bursty events from traces of normal human activity.

References

[1]
Baum, L. E., Petrie, T., Soules, G., and Weiss, N. 1970. A maximization technique occurring in statistical analysis of probabilistic functions of Markov chains. Ann. Math. Statis. 41, 1 (Feb.), 164--171.
[2]
Buntine, W. 1994. Operations for learning with graphical models. J. Artif. Intell. Res. 2, 159--225.
[3]
Chen, C., Petty, K., Skabardonis, A., Varaiya, P., and Jia, Z. 2001. Freeway performance measurement system: Mining loop detector data. In the 80th Annual Meeting of the Transportation Research Board. Washington, D.C. http://pems.eecs.berkeley.edu/.
[4]
Chib, S. 1995. Marginal likelihood from the Gibbs output. J. Amer. Statis. Assoc. 90, 432 (Dec.), 1313--1321.
[5]
Gelfand, A. E. and Dey, D. K. 1990. Bayesian model choice: Asymptotics and exact calculations. J. Royal Statis. Soc. Series C 56, 3, 501--514.
[6]
Gelfand, A. E. and Smith, A. F. M. 1990. Sampling-Based approaches to calculating marginal densities. J. Amer. Statis. Assoc. 85, 398--409.
[7]
Geman, S. and Geman, D. 1984. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6, 6 (Nov.), 721--741.
[8]
Guralnik, V. and Srivastava, J. 1999. Event detection from time-series data. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). ACM Press, New York, 33--42.
[9]
Heffes, H. and Lucantoni, D. M. 1984. A Markov-modulated characterization of packetized voice and data traffic and related statistical multiplexer performance. IEEE J. Selected Areas Commun. 4, 6, 856--868.
[10]
Ihler, A., Hutchins, J., and Smyth, P. 2006. Adaptive event detection with time--varying Poisson processes. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). ACM Press, New York, 207--216.
[11]
Jordan, M. I., ed. 1998. Learning in Graphical Models. MIT Press, Cambridge, MA.
[12]
Keogh, E., Lonardi, S., and chi' Chiu, B. Y. 2002. Finding surprising patterns in a time-series database in linear time and space. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). ACM Press, New York, 550--556.
[13]
Kleinberg, J. 2002. Bursty and hierarchical structure in streams. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). ACM Press, New York, 91--101.
[14]
Papoulis, A. 1991. Probability, Random Variables, and Stochastic Processes, 3rd ed. McGraw-Hill, New York.
[15]
Salmenkivi, M. and Mannila, H. 2005. Using Markov chain Monte Carlo and dynamic programming for event sequence data. Knowl. Inf. Syst. 7, 3, 267--288.
[16]
Scott, S. 1998. Bayesian methods and extensions for the two state Markov modulated Poisson process. Ph.D. thesis, Harvard University, Department of Statistics.
[17]
Scott, S. 2002. Detecting network intrusion using a Markov modulated nonhomogeneous Poisson process. http://www-rcf.usc.edu/~sls/mmnhpp.ps.gz.
[18]
Scott, S. L. and Smyth, P. 2003. The Markov modulated Poisson process and Markov Poisson cascade with applications to Web traffic data. In Bayesian Statistics, vol. 7, M. J. Bayarri et al., eds. Oxford University Press, Oxford, UK, 671--680.
[19]
Svensson, A. 1981. On a goodness of fit test for multiplicative Poisson models. Ann. Statis. 9, 4, 697--704.

Cited By

View all
  • (2024)A comparison of three algorithms in the filtering of a Markov-modulated non-homogeneous Poisson processInternational Journal of Systems Science10.1080/00207721.2023.229474755:4(741-770)Online publication date: Jan-2024
  • (2024)Cross-organizational data exchange based on consortium blockchain with consistency guaranteeThe Journal of Supercomputing10.1007/s11227-024-06164-z80:12(18199-18236)Online publication date: 1-Aug-2024
  • (2023)Long-Term Risk Evaluation of Power System Considering Extreme Weather Events2023 5th Asia Energy and Electrical Engineering Symposium (AEEES)10.1109/AEEES56888.2023.10114287(753-757)Online publication date: 23-Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 1, Issue 3
December 2007
145 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/1297332
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 2007
Published in TKDD Volume 1, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Event detection
  2. Markov modulated
  3. Poisson

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)2
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A comparison of three algorithms in the filtering of a Markov-modulated non-homogeneous Poisson processInternational Journal of Systems Science10.1080/00207721.2023.229474755:4(741-770)Online publication date: Jan-2024
  • (2024)Cross-organizational data exchange based on consortium blockchain with consistency guaranteeThe Journal of Supercomputing10.1007/s11227-024-06164-z80:12(18199-18236)Online publication date: 1-Aug-2024
  • (2023)Long-Term Risk Evaluation of Power System Considering Extreme Weather Events2023 5th Asia Energy and Electrical Engineering Symposium (AEEES)10.1109/AEEES56888.2023.10114287(753-757)Online publication date: 23-Mar-2023
  • (2020)Investigating Network Traffic and Selecting a Matching Mathematical ModelHerald of the Bauman Moscow State Technical University. Series Instrument Engineering10.18698/0236-3933-2020-3-84-99(84-99)Online publication date: Sep-2020
  • (2019)Reasoning with Probabilistic and Deterministic Graphical Models: Exact Algorithms, Second EditionSynthesis Lectures on Artificial Intelligence and Machine Learning10.2200/S00893ED2V01Y201901AIM04113:1(1-199)Online publication date: 14-Feb-2019
  • (2019)Measuring the size of a crowd using InstagramEnvironment and Planning B: Urban Analytics and City Science10.1177/239980831984161547:9(1690-1703)Online publication date: 22-Apr-2019
  • (2018)On Real-Time Monitoring on Data Stream for Traffic Flow Anomalies2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom)10.1109/BDCloud.2018.00058(322-329)Online publication date: Dec-2018
  • (2018)Information Spreading During Emergencies and Anomalous EventsComplex Spreading Phenomena in Social Systems10.1007/978-3-319-77332-2_15(269-286)Online publication date: 22-Jun-2018
  • (2017)Forecasting Ad-Impressions on Online Retail Websites using Non-homogeneous Hawkes ProcessesProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3133017(1089-1098)Online publication date: 6-Nov-2017
  • (2017)Copula Analysis of Temporal Dependence Structure in Markov Modulated Poisson Process and Its ApplicationsACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/30892542:3(1-28)Online publication date: 29-Jun-2017
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media