[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Unified Geolocation Framework for Web Videos

Published: 17 July 2014 Publication History

Abstract

In this article, we propose a unified geolocation framework to automatically determine where on the earth a web video was shot. We analyze different social, visual, and textual relationships from a real-world dataset and find four relationships with apparent geography clues that can be used for web video geolocation. Then, the geolocation process is formulated as an optimization problem that simultaneously takes the social, visual, and textual relationships into consideration. The optimization problem is solved by an iterative procedure, which can be interpreted as a propagation of the geography information among the web video social network. Extensive experiments on a real-world dataset clearly demonstrate the effectiveness of our proposed framework, with the geolocation accuracy higher than state-of-the-art approaches.

References

[1]
S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang. 2007. World explorer: Visualizing aggregate data from unstructured text in geo-referenced collections. In JCDL. 1--10.
[2]
E. Amitay, N. Har’El, R. Sivan, and A. Soffer. 2004. Web-a-where: Geotagging web content. In SIGIR. 273--280.
[3]
L. Backstrom, J. M. Kleinberg, R. Kumar, and J. Novak. 2008. Spatial variation in search engine queries. In WWW. 357--366.
[4]
H. Bay, T. Tuytelaars, and L. J. V. Gool. 2006. Surf: Speeded up robust features. In ECCV (1). 404--417.
[5]
D. Brockmann, L. Hufnagel, and T. Geisel. 2006. The scaling laws of human travel. Nature 439, 7075, 462--5.
[6]
J. Cao, C.-W. Ngo, Y.-D. Zhang, and J.-T. Li. 2011. Tracking web video topics: Discovery, visualization, and monitoring. IEEE Transactions on Circuits and Systems for Video Technology 21, 12, 1835--1846.
[7]
J. Choi, H. Lei, and G. Friedland. 2011. The 2011 ICSI video location estimation system. In MediaEval 2011.
[8]
A. Clauset, M. E. J. Newman, and C. Moore. 2004. Finding community structure in very large networks. Physical Review E 70, 6, 066111+.
[9]
D. J. Crandall, L. Backstrom, D. P. Huttenlocher, and J. M. Kleinberg. 2009. Mapping the world’s photos. In WWW. 761--770.
[10]
J. Davidson, B. Liebald, J. Liu, P. Nandy, T. V. Vleet, U. Gargi, S. Gupta, Y. He, M. Lambert, B. Livingston, and D. Sampath. 2010. The YouTube video recommendation system. In RecSys. 293--296.
[11]
G. Friedland, O. Vinyals, and T. Darrell. 2010. Multimodal location estimation. In ACM Multimedia. 1245--1252.
[12]
J. Hays and A. A. Efros. 2008. Im2gps: estimating geographic information from a single image. In CVPR.
[13]
T. Hwang and R. Kuang. 2010. A heterogeneous label propagation algorithm for disease gene discovery. In SDM. 583--594.
[14]
F. Inc. 2013. Flickr. Retrieved from http://www.flickr.com/.
[15]
Y. Inc. 2011. YouTube. Retrieved from http://www.youtube.com/.
[16]
M. Ji, Y. Sun, M. Danilevsky, J. Han, and J. Gao. 2010. Graph regularized transductive classification on heterogeneous information networks. In ECML/PKDD (1). 570--586.
[17]
P. Kelm, S. Schmiedeke, and T. Sikora. 2011. Multi-modal, multi-resource methods for placing Flickr videos on the map. In ICMR. 52.
[18]
O. V. Laere, S. Schockaert, and B. Dhoedt. 2011. Finding locations of Flickr resources using language models and similarity search. In ICMR. 48.
[19]
M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones. 2011. Automatic tagging and geotagging in video collections and communities. In ICMR. 51.
[20]
L. T. Li, J. Almeida, and R. da Silva Torres. 2011. Recod working notes for placing task MediaEval 2011. Retrieved from http://ceur-ws.org/Vol-807/Li_UNICAMP_Placing_me11wn.pdf.
[21]
L. T. Li, J. Almeida, D. C. G. Pedronette, O. A. B. Penatti, and R. da Silva Torres. 2012. A multimodal approach for video geocoding. Retrieved from http://ceur-ws.org/Vol-927/mediaeval2012_submission_19.pdf.
[22]
D. Liu, S. Yan, X.-S. Hua, and H.-J. Zhang. 2011. Image retagging using collaborative tag propagation. IEEE Transactions on Multimedia 13, 4, 702--712.
[23]
J. Luo, D. Joshi, J. Yu, and A. C. Gallagher. 2011. Geotagging in multimedia and computer vision—a survey. Multimedia Tools Appl. 51, 1, 187--211.
[24]
MediaEval. 2011. Placing task in MediaEval 2011. Retrieved from http://www.multimediaeval.org/mediaeval2011/placing2011/.
[25]
MediaEval. 2012. Placing task in MediaEval 2012. Retrieved from http://www.multimediaeval.org/mediaeval2012/placing2012/.
[26]
O. A. B. Penatti, L. T. Li, J. Almeida, and R. da Silva Torres. 2012. A visual approach for video geocoding using bag-of-scenes. In ICMR. 53.
[27]
A. Popescu and N. Ballas. 2012. Cea list’s participation at MediaEval 2012 placing task. Retrieved from http://ceur-ws.org/Vol-927/mediaeval2012_submission_32.pdf.
[28]
A. Rae and P. Kelm. 2012. Working notes for the placing task at MediaEval 2012. Retrieved from http://ceur-ws.org/Vol-927/mediaeval2012_submission_6.pdf.
[29]
T. Rattenbury, N. Good, and M. Naaman. 2007. Towards automatic extraction of event and place semantics from Flickr tags. In SIGIR. 103--110.
[30]
K. Sahr, D. White, and A. J. Kimerling. 2003. Geodesic discrete global grid systems. Cartography and Geographic Information Science 30, 2, 121--134.
[31]
R. L. Santos, B. P. Rocha, C. G. Rezende, and A. A. F. Loureiro. 2007. Characterizing the YouTube video-sharing community. Retrieved from http://www.mendeley.com/research/characterizing-youtube-qvideosharing-community-4/.
[32]
P. Serdyukov, V. Murdock, and R. van Zwol. 2009. Placing Flickr photos on a map. In SIGIR. 484--491.
[33]
Y. Song, J. Cao, Z. Chen, Y. Zhang, and J. Li. 2010. Tag transformer. In ACM Multimedia. 639--642.
[34]
Y. Song, Y.-D. Zhang, J. Cao, T. Xia, W. Liu, and J.-T. Li. 2012. Web video geolocation by geotagged social resources. IEEE Transactions on Multimedia 14, 2, 456--470.
[35]
J. Tang, R. Hong, S. Yan, T.-S. Chua, G.-J. Qi, and R. Jain. 2011. Image annotation by knn-sparse graph-based label propagation over noisily tagged web images. ACM Transactions on Intelligent System Technology 2, 2, 14:1--14:15.
[36]
J. Tang, S. Yan, R. Hong, G.-J. Qi, and T.-S. Chua. 2009. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the 17th ACM International Conference on Multimedia (MM’09). ACM, New York, 223--232.
[37]
K. Yanai, H. Kawakubo, and B. Qiu. 2009. A visual analysis of the relationship between word concepts and geographical locations. In CIVR.
[38]
W. Zhao, X. Wu, and C.-W. Ngo. 2010. On the annotation of web videos by efficient near-duplicate search. IEEE Transactions on Multimedia 12, 5, 448--461.
[39]
Y.-T. Zheng, Z.-J. Zha, and T.-S. Chua. 2011. Research and applications on georeferenced multimedia: A survey. Multimedia Tools and Applications 51, 77--98.
[40]
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. 2003. Learning with local and global consistency. In NIPS.

Cited By

View all
  • (2021)A transfer approach with attention reptile method and long-term generation mechanism for few-shot traffic predictionNeurocomputing10.1016/j.neucom.2021.03.068452(15-27)Online publication date: Sep-2021
  • (2019)Co-saliency Detection with Graph MatchingACM Transactions on Intelligent Systems and Technology10.1145/331387410:3(1-22)Online publication date: 12-Apr-2019
  • (2018)A Review of Co-Saliency Detection AlgorithmsACM Transactions on Intelligent Systems and Technology10.1145/31586749:4(1-31)Online publication date: 30-Jan-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 3
Special Section on Urban Computing
September 2014
361 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/2648782
  • Editor:
  • Qiang Yang
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 July 2014
Accepted: 01 September 2013
Revised: 01 July 2013
Received: 01 March 2013
Published in TIST Volume 5, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Unified geolocation framework
  2. geotag
  3. web video

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)3
Reflects downloads up to 30 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)A transfer approach with attention reptile method and long-term generation mechanism for few-shot traffic predictionNeurocomputing10.1016/j.neucom.2021.03.068452(15-27)Online publication date: Sep-2021
  • (2019)Co-saliency Detection with Graph MatchingACM Transactions on Intelligent Systems and Technology10.1145/331387410:3(1-22)Online publication date: 12-Apr-2019
  • (2018)A Review of Co-Saliency Detection AlgorithmsACM Transactions on Intelligent Systems and Technology10.1145/31586749:4(1-31)Online publication date: 30-Jan-2018
  • (2016)Exploring the Use of Tags for Georeplicated Content Placement2016 IEEE International Conference on Cloud Engineering (IC2E)10.1109/IC2E.2016.37(172-181)Online publication date: Apr-2016
  • (2015)Exploiting Spatial Relationship between Scenes for Hierarchical Video GeotaggingProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749354(363-370)Online publication date: 22-Jun-2015

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media