Abstract
We address the problem of efficiently estimating the rotation of a camera relative to the canonical 3D Cartesian frame of an urban scene, under the so-called “Manhattan World” assumption [1,2]. While the problem has received considerable attention in recent years, it is unclear how current methods stack up in terms of accuracy and efficiency, and how they might best be improved. It is often argued that it is best to base estimation on all pixels in the image [2]. However, in this paper, we argue that in a sense, less can be more: that basing estimation on sparse, accurately localized edges, rather than dense gradient maps, permits the derivation of more accurate statistical models and leads to more efficient estimation. We also introduce and compare several different search techniques that have advantages over prior approaches. A cornerstone of the paper is the establishment of a new public groundtruth database which we use to derive required statistics and to evaluate and compare algorithms.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Coughlan, J.M., Yuille, A.L.: Manhattan world: Compass direction from a single image by bayesian inference. In: Seventh International Conference on Computer Vision, vol. 2, pp. 941–947. IEEE, Los Alamitos (1999)
Coughlan, J.M., Yuille, A.L.: Manhattan world: Orientation and outlier detection by bayesian inference. Neural Computation 15(5), 1063–1088 (2003)
Deutscher, J., Isard, M., MacCormick, J.: Automatic camera calibration from a single manhattan image. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 175–188. Springer, Heidelberg (2002)
Schindler, G., Dellaert, F.: Atlanta world: An expectation maximization framework for simultaneous low-level edge grouping and camera calibration in complex man-made environments. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. I–203 – I–209. IEEE, Los Alamitos (2004)
Kos̆ecká, J., Zhang, W.: Video compass. In: Seventh European Conference on Computer Vision, pp. 476–490 (2002)
Wildenauer, H., Vincze, M.: Vanishing point detection in complex man-made worlds. In: 14th IEEE International Conference on Image Analysis and Processing, pp. 615–622. IEEE, Los Alamitos (2007)
Collins, R., Weiss, R.: Vanishing point calculation as a statistical inference on the unit sphere. In: Third International Conference on Computer Vision, pp. 400–403. IEEE, Los Alamitos (1990)
Kanatani, K.: Geometric Computation for Machine Vision. Oxford University Press, Inc., New York (1993)
Elder, J.H., Zucker, S.W.: Local scale control for edge detection and blur estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(7), 699–716 (1998)
Avriel, M.: Nonlinear Programming: Analysis and Methods. Prentice-Hall Inc., Englewood Cliffs (1976)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Denis, P., Elder, J.H., Estrada, F.J. (2008). Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88688-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-88688-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88685-3
Online ISBN: 978-3-540-88688-4
eBook Packages: Computer ScienceComputer Science (R0)