Abstract
We are interested in processing video data for the purpose of solving a variety of problems in video search, analysis, indexing, browsing and compression. Instead of concentrating on a particular problem, in this paper we present a framework for developing video applications. Our basic thesis is that video data can be represented at a higher level of abstraction as a string generated by a grammar, termed motion picture grammar. The rules of that grammar relate different spatiotemporal representations of the video content and, in particular, representations of action.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arman, F., Hsu, A., Chiu, M.: Feature management for large video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 2–12
Liu, H. C., Zick, G. L.: Scene decomposition of mpeg compressed video. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 26–37
Otsuji, K., Tonomura, Y., Ohba, Y.: Video browsing using brightness data. SPIE 1606, Visual Communications and Image Processing (1991) 980–989
Sethi, I. K., Patel, N.: A statistical approach to scene change detection. SPIE 2420, Storage and Retrieval for Image and Video Databases III (1995) 329–338
Shahraray, B.: Scene change detection and content-based sampling of video sequences. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 2–13
Swain, M. J., Ballard, D. H.: Color indexing. International Journal of Computer Vision 7 (1991) 11–32
Zhang, H., Kankanhalli, A., Smoliar, S. W.: Automatic partitioning of full motion video. Multimedia Systems 1 (1993) 10–28
Zhang, H. J., Low, C. Y., Smoliar, S. W.: Video parsing and browsing using compressed data. Multimedia Tools and Applications 1 (1995) 89–111
Mann, S., Picard, R. W.: Virtual bellows: Constructing high quality still from video. International Conference on Image Processing, volume 1 (1994) 363–367
Sawhney, H. S., Ayer, S., Gorkani, M.: Model based 2D & 3D dominant motion estimation for mosaicking and video representation. Technical report, IBM Almaden Research Laboratory (December 1994)
Szeliski, R.: Image mosaicking for telereality applications. Technical Report CRL 94/2, DEC Cambridge Research Laboratory (1994)
Teodosio, L., Bender, W.: Salient video stills: Content and context preserved. Proceedings, Multimedia '93, ACM (1993) 39–46
Tonomura, Y., Akutsu, A., Otsuji, K., Sadakata, T.: Video map and video space icon: Tools for anatomizing video content. INTERCHI '93 Conference on Human Factors in Computing Systems, ACM (1993) 131–136
Yeung, M. M., Yeo, B. L.: Data modelling of videos with temporal events and its applications. Technical Report TR-EE-ISS-YM9603, Princeton University (April 1996)
Yow, K. D., Yeo, B. L., Yeung, M. M., Liu, B.: Analysis and presentation of soccer highlights from digital video. Second Asian Conference on Computer Vision, volume 2 (1995) 499–503
Hibino, S., Steiner, E. A. R.: A visual query language for identifying temporal trends in video data. International Workshop on Multi-media Database Management Systems (1995) 74–81
Yeung, M., Yeo, B.: Time-constrained clustering for segmentation of video into story units. ICPR '96, volume 6 (August 1996) 375–380
Swanberg, D., Shu, C. F., Jain, R.: Knowledge-guided parsing in video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 13–25
Zhang, H. J., Gong, Y. H., Smoliar, S. W., Yan, S. Y.: Automatic parsing of news video. International Conference on Multimedia Computing and Systems (1994) 45–54
Dierckx, P.: Curve and Surface Fitting with Splines. Clarendon: Oxford (1993)
Tiller, W.: Rational b-splines for curve and surface representation. IEEE CGA 3 (1983) 61–69
Faugeras, O. D.: Three-Dimensional Computer Vision. Cambridge, MA: MIT Press (1992)
Shulman, D., Aloimonos, J. Y.: (non-)rigid motion interpretation: a regularized approach. Proc. Royal Society, London B 233 (1988) 217–234
Fu, K. S.: Syntactic Pattern Recognition and Applications. Englewood Cliffs, NJ: Prentice Hall (1982)
Lee, K. H., Eom, K. B., Kashyap, R. L.: Character recognition based on attribute-dependent programmed grammar. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992)
Zhao, M.: Two-dimensional extended attribute grammar method for the recognition of hand-printed chinese characters. Pattern Recognition 23 1990
Charniak, E.: Statistical Language Learning. MIT Press (1993)
Huang, X. D., Ariki, Y., Jack, M. A.: Hidden Markov models for Speech Recognition. Edinburgh University Press (1990)
Grenander, U.: Elements of pattern theory. Johns Hopkins, Baltimore (1996)
Abney, S.: Stochastic attribute valued grammars. Currently working at AT & T Labs Research in Florsham Park, NJ
Keller, B., Lutz, R.: Learning stochastic context-free grammars from corpora using a genetic algorithm. ICANNGA (1997)
Johnson, M.: Attribute valued logic and the theory of grammar. CSLI Lecture Notes, volume 16. CSLI (1988)
Torenvliet, L., Trautwein, M.: A note on the complexity of restricted attribute valued grammars. Computational Linguistics in the Netherlands, Meeting at Twente (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolle, R., Aloimonos, Y., Fermüller, C. (1997). Toward motion picture grammars. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_228
Download citation
DOI: https://doi.org/10.1007/3-540-63931-4_228
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63931-2
Online ISBN: 978-3-540-69670-4
eBook Packages: Springer Book Archive