More Web Proxy on the site http://driver.im/

research-article

A cross-view geo-localization method guided by relation-aware global attention

Authors:

Fuming SunAuthors Info & Claims

Multimedia Systems, Volume 29, Issue 4

Pages 2205 - 2216

https://doi.org/10.1007/s00530-023-01101-1

Published: 09 May 2023 Publication History

Abstract

Cross-view geo-localization mainly exploits query images to match images from the same geographical location from different platforms. Most existing methods fail to adequately consider the effect of image structural information on cross-view geo-localization, resulting in the extracted features can not fully characterize the image, which affects the localization accuracy. Based on this, this paper proposes a cross-view geo-localization method guided by relation-aware global attention, which can capture the rich global structural information by perfectly integrating attention mechanism and feature extraction network, thus improving the representation ability of features. Meanwhile, considering the important role of semantic and context information in geo-localization, a joint training structure with parallel global branch and local branch is designed to fully mine multi-scale context features for image matching, which can further improve the accuracy of cross-view geo-localization. The quantitative and qualitative experimental results on University-1652, CVUSA, and CVACT datasets show that the algorithm in this paper outperforms other advanced methods in recall accuracy (Recall) and image retrieval average precision (AP).

References

[1]

Wang Z, Qin J, Xiang X, and Tan Y A privacy-preserving and traitor tracking content-based image retrieval scheme in cloud computing Multimedia Syst. 2021 27 3 403-415

[2]

Saritha RR, Paul V, and Kumar PG Content based image retrieval using deep learning process Cluster Comput. 2019 22 2 4187-4200

[3]

Outay F, Mengash HA, and Adnan M Applications of unmanned aerial vehicle (uav) in road safety, traffic and highway infrastructure management: recent advances and challenges Trans. Res. Part A 2020 141 116-129

[4]

Zhao X, Huang P, and Shu X Wavelet-attention CNN for image classification Multimedia Syst. 2022 28 3 915-924

[5]

Wang P, Fan E, and Wang P Comparative analysis of image classification algorithms based on traditional machine learning and deep learning Pattern Recogn. Lett. 2021 141 61-67

[6]

Wang H, Song Y, Huo L, Chen L, and He Q Multiscale object detection based on channel and data enhancement at construction sites Multimedia Syst. 2023 29 1 49-58

[7]

Tan, M., Pang, R., Le, Q.V.: Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)

[8]

Yuan, Y., Chen, X., Wang, J.: Object-contextual representations for semantic segmentation. In: Proceedings of the European Conference on Computer Vision, pp. 173–190 (2020)

[9]

Hao S, Zhou Y, and Guo Y A brief survey on semantic segmentation with deep learning Neurocomputing 2020 406 302-321

[10]

Jaouedi N, Boujnah N, and Bouhlel MS A new hybrid deep learning model for human action recognition J. King Saud Univ. Comput. Inf. Sci. 2020 32 4 447-453

[11]

Yang, C., Xu, Y., Shi, J., Dai, B., Zhou, B.: Temporal pyramid network for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 588–597 (2020)

[12]

Shi, Y., Yu, X., Liu, L., Zhang, T., Li, H.: Optimal feature transport for cross-view image geo-localization. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 11990–11997 (2020)

[13]

Zheng, Z., Wei, Y., Yang, Y.: University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In: Proceedings of the 28th ACM International Conference on Multimedia, pp. 1395–1403 (2020)

[14]

Wang T, Zheng Z, Yan C, Zhang J, Sun Y, Zheng B, and Yang Y Each part matters: local patterns facilitate cross-view geo-localization IEEE Trans. Circuits Syst. Video Technol. 2021 32 2 867-879

[15]

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

[16]

Zhang, Z., Lan, C., Zeng, W., Jin, X., Chen, Z.: Relation-aware global attention for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3186–3195 (2020)

[17]

Yu, F., Koltun, V., Funkhouser, T.: Dilated residual networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 472–480 (2017)

[18]

Zheng Z, Zheng L, and Yang Y A discriminatively learned cnn embedding for person reidentification ACM Tran. Multimedia Comput. Commun. Appl. 2018 14 1 13-11320

[19]

Li X, Yu L, Chang D, Ma Z, and Cao J Dual cross-entropy loss for small-sample fine-grained vehicle classification IEEE Trans. Vehicular Technol. 2019 68 5 4204-4212

[20]

Workman, S., Jacobs, N.: On the location dependence of convolutional neural network features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 70–78 (2015)

[21]

Workman, S., Souvenir, R., Jacobs, N.: Wide-area image geolocalization with aerial reference imagery. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3961–3969 (2015)

[22]

Lin, T.-Y., Cui, Y., Belongie, S., Hays, J.: Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5007–5015 (2015)

[23]

Vo, N.N., Hays, J.: Localizing and orienting street views using overhead imagery. In: Proceedings of the European Conference on Computer Vision, Springer. pp 494–509 (2016)

[24]

Tian, Y., Chen, C., Shah, M.: Cross-view image matching for geo-localization in urban environments. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 3608–3616 (2017)

[25]

Altwaijry, H., Trulls, E., Hays, J., Fua, P., Belongie, S.: Learning to match aerial images with deep attentive architectures. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 3539–3547 (2016)

[26]

Zhai, M., Bessinger, Z., Workman, S., Jacobs, N.: Predicting ground-level scene layout from aerial imagery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 867–875 (2017)

[27]

Hu, S., Feng, M., Nguyen, R.M., Lee, G.H.: Cvm-net: Cross-view matching network for image-based ground-to-aerial geo-localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 7258–7267 (2018)

[28]

Arandjelovic, R., Gronát, P., Torii, A., Pajdla, T., Sivic, J.: Netvlad: Cnn architecture for weakly supervised place recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 5297–5307 (2016)

[29]

Shi, Y., Liu, L., Yu, X., Li, H.: Spatial-aware feature aggregation for cross-view image based geo-localization. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. pp 10090–10100 (2019)

[30]

Shi, Y., Yu, X., Campbell, D., Li, H.: Where am i looking at? joint location and orientation estimation by cross-view matching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 4064–4072 (2020)

[31]

Liu, L., Li, H.: Lending orientation to neural networks for cross-view geo-localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5624–5633 (2019)

[32]

Rodrigues, R., Tani, M.: Are these from the same place? seeing the unseen in cross-view image geo-localization. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision. pp 3753–3761 (2021)

[33]

Regmi, K., Shah, M.: Bridging the domain gap for ground-to-aerial image matching. In: Proceedings of the IEEE International Conference on Computer Visio. pp 470–479 (2019)

[34]

Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville AC, and Bengio Y Generative adversarial networks Commun. ACM 2020 63 11 139-144

[35]

Toker, A., Zhou, Q., Maximov, M., Leal-Taixé, L.: Coming down to earth: Satellite-to-street view synthesis for geo-localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 6488–6497 (2021)

[36]

Zheng Z, Zheng L, Garrett M, Yang Y, Xu M, and Shen Y Dual-path convolutional image-text embeddings with instance loss ACM Trans. Multimedia Compu. Commun. Appl. 2020 16 2 1-23

[37]

Ding L, Zhou J, Meng L, and Long Z A practical cross-view image matching method between uav and satellite for uav-based geo-localization Remote Sens. 2020 13 1 47

[38]

Zhuang J, Dai M, Chen X, and Zheng E A faster and more effective cross-view matching method of uav and satellite images for uav geolocalization Remote Sens. 2021 13 19 3979

[39]

Lin J, Zheng Z, Zhong Z, Luo Z, Li S, Yang Y, and Sebe N Joint representation learning and keypoint detection for cross-view geo-localization IEEE Trans. Image Process. 2022 31 3780-3792

[40]

Dai M, Hu J, Zhuang J, and Zheng E A transformer-based feature segmentation and region alignment method for uav-view geo-localization IEEE Trans. Circuits. Syst. Video Technol. 2022 32 7 4376-4389

[41]

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L.u., Polosukhin, I.: Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, vol. 30, pp. 1–11 (2017)

[42]

Chechik G, Sharma V, Shalit U, and Bengio S Large scale online learning of image similarity through ranking J. Mach. Learning Res. 2010 11 3 1109-1135

[43]

Cai, S., Guo, Y., Khan, S., Hu, J., Wen, G.: Ground-to-aerial image geo-localization with a hard exemplar reweighting triplet loss. In: Proceedings of the IEEE International Conference on Computer Vision. pp 8391–8400 (2019)

[44]

Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 7132–7141 (2018)

Recommendations

Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models
UAVM '24: Proceedings of the 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective

Cross-view geo-localization in GNSS-denied environments aims to determine an unknown location by matching drone-view images with the correct geo-tagged satellite-view images from a large gallery. Recent research shows that learning discriminative image ...
Image and Object Geo-Localization
Abstract
The concept of geo-localization broadly refers to the process of determining an entity’s geographical location, typically in the form of Global Positioning System (GPS) coordinates. The entity of interest may be an image, a sequence of images, a ...
Attention-based neural network with Generalized Mean Pooling for cross-view geo-localization between UAV and satellite
Abstract
Cross-view geo-localization is finding images containing the same geographic target in multi-views. For example, given a query image from UAV view, a proposed matching model can find an exact image of the same location in a gallery collected by ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Multimedia Systems

Multimedia Systems Volume 29, Issue 4

Aug 2023

584 pages

ISSN:0942-4962

Issue’s Table of Contents

© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 09 May 2023

Accepted: 26 April 2023

Received: 20 February 2023

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents