Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery

Patrick Denis⁴,
James H. Elder⁴ &
Francisco J. Estrada⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5303))

Included in the following conference series:

European Conference on Computer Vision

10k Accesses
98 Citations
3 Altmetric

Abstract

We address the problem of efficiently estimating the rotation of a camera relative to the canonical 3D Cartesian frame of an urban scene, under the so-called “Manhattan World” assumption [1,2]. While the problem has received considerable attention in recent years, it is unclear how current methods stack up in terms of accuracy and efficiency, and how they might best be improved. It is often argued that it is best to base estimation on all pixels in the image [2]. However, in this paper, we argue that in a sense, less can be more: that basing estimation on sparse, accurately localized edges, rather than dense gradient maps, permits the derivation of more accurate statistical models and leads to more efficient estimation. We also introduce and compare several different search techniques that have advantages over prior approaches. A cornerstone of the paper is the establishment of a new public groundtruth database which we use to derive required statistics and to evaluate and compare algorithms.

Download to read the full chapter text

Chapter PDF

Precise State Tracking Using Three-Dimensional Edge Detection

Fundamentals of Machine Vision

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Coughlan, J.M., Yuille, A.L.: Manhattan world: Compass direction from a single image by bayesian inference. In: Seventh International Conference on Computer Vision, vol. 2, pp. 941–947. IEEE, Los Alamitos (1999)
Chapter Google Scholar
Coughlan, J.M., Yuille, A.L.: Manhattan world: Orientation and outlier detection by bayesian inference. Neural Computation 15(5), 1063–1088 (2003)
Article Google Scholar
Deutscher, J., Isard, M., MacCormick, J.: Automatic camera calibration from a single manhattan image. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 175–188. Springer, Heidelberg (2002)
Chapter Google Scholar
Schindler, G., Dellaert, F.: Atlanta world: An expectation maximization framework for simultaneous low-level edge grouping and camera calibration in complex man-made environments. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. I–203 – I–209. IEEE, Los Alamitos (2004)
Google Scholar
Kos̆ecká, J., Zhang, W.: Video compass. In: Seventh European Conference on Computer Vision, pp. 476–490 (2002)
Google Scholar
Wildenauer, H., Vincze, M.: Vanishing point detection in complex man-made worlds. In: 14th IEEE International Conference on Image Analysis and Processing, pp. 615–622. IEEE, Los Alamitos (2007)
Google Scholar
Collins, R., Weiss, R.: Vanishing point calculation as a statistical inference on the unit sphere. In: Third International Conference on Computer Vision, pp. 400–403. IEEE, Los Alamitos (1990)
Google Scholar
Kanatani, K.: Geometric Computation for Machine Vision. Oxford University Press, Inc., New York (1993)
MATH Google Scholar
Elder, J.H., Zucker, S.W.: Local scale control for edge detection and blur estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(7), 699–716 (1998)
Article Google Scholar
Avriel, M.: Nonlinear Programming: Analysis and Methods. Prentice-Hall Inc., Englewood Cliffs (1976)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

York University, Canada
Patrick Denis & James H. Elder
University of Toronto, Canada
Francisco J. Estrada

Authors

Patrick Denis
View author publications
You can also search for this author in PubMed Google Scholar
James H. Elder
View author publications
You can also search for this author in PubMed Google Scholar
Francisco J. Estrada
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA
David Forsyth
Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK
Philip Torr
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Denis, P., Elder, J.H., Estrada, F.J. (2008). Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88688-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-88688-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88685-3
Online ISBN: 978-3-540-88688-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery

Abstract

Chapter PDF

Similar content being viewed by others

Precise State Tracking Using Three-Dimensional Edge Detection

Fundamentals of Machine Vision

Fundamentals of Machine Vision

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us