[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2072298.2072317acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

News contextualization with geographic and visual information

Published: 28 November 2011 Publication History

Abstract

In this paper, we investigate the contextualization of news documents with geographic and visual information. We propose a matrix factorization approach to analyze the location relevance for each news document. We also propose a method to enrich the document with a set of web images. For location relevance analysis, we first perform toponym extraction and expansion to obtain a toponym list from news documents. We then propose a matrix factorization method to estimate the location-document relevance scores while simultaneously capturing the correlation of locations and documents. For image enrichment, we propose a method to generate multiple queries from each news document for image search and then employ an intelligent fusion approach to collect a set of images from the search results. Based on the location relevance analysis and image enrichment, we introduce a news browsing system named NewsMap which can support users in reading news via browsing a map and retrieving news with location queries. The news documents with the corresponding enriched images are presented to help users quickly get information. Extensive experiments demonstrate the effectiveness of our approaches.

References

[1]
E. Amitay, R. Sivan, and A. Soffer. Web-a-where: Geotagging web content. In Proceedings of ACM SIGIR, pages 273--280. Sheffield, UK, July 2004.
[2]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. In Proceedings of ACM WWW, pages 107--117, April 1998.
[3]
L. Cao, J. Yu, J. Luo, and T. S. Huang. Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression. In Proceedings of ACM Multimedia, pages 125--134. China, 2009.
[4]
M. G. Christel, A. M. Olligschlaeege, and C. Huang. Interative maps for a digital video library. IEEE Multimedia, 7(1):60--67, March 2000.
[5]
R. L. Cilibrasi and P. M. B. Vitanyi. The google similarity distance. IEEE Trans. on Knowledge and Data Engineering, 19(3):370--383, March 2007.
[6]
B. Coyne and R. Sproat. Wordseye: An automatic text-to-scene conversion system. In Proceedings of Annual Conference on Computer Graphics and Interactive Techniques, pages 487--496. Los Angeles, USA, August 2001.
[7]
D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In Proceedings of ACM WWW, pages 761--770. Madrid, Spain, April 2009.
[8]
D. Delgado, J. Magalhäes, and N. Correia. Assisted news reading with automated illustrations. In Proceedings of ACM Multimedia, pages 1647--1650. Firenze, Italy, October 2010.
[9]
D. Delgado, J. Magalhäes, and N. Correia. Automated illustration of news stories - improving the readers experience. In Proceedings of IEEE International Conference on Semantic Computing, pages 73--78, September 2010.
[10]
J. Ding, L. Gravano, and N. Shivakumar. Computing geographical scopes of web sources. In Proceedings of International Conference on Very Large Data Bases, pages 545--556. San Francisco, USA, September 2000.
[11]
B. Geng, L. Yang, C. Xu, and X.-S. Hua. Content-aware ranking for visual search. In CVPR, pages 3400--3407, 2010.
[12]
F. Gey, R. Larson, M. Sanderson, H. Joho, P. Clough, and V. Petras. Geoclef: The clef 2005 cross-language geographic information retrieval track overview. In CLEF'05, pages 908--919, 2005.
[13]
J. Hays and A. A. Efros. Im2gps: estimating geographic information from a single image. In Proceedings of IEEE CVPR, pages 1--8, 2008.
[14]
S. Huston and W. B. Croft. Evaluating verbose query processing techniques. In Proceedings of ACM SIGIR, pages 291--298, July 2010.
[15]
K. J\"arvelin and J. Kekäläinen. Cumulated gain-based evaluation of ir techniques. ACM Trans. on Information Systems, 20(4):422--446, October 2002.
[16]
B. Jiao, L. Yang, J. Xu, and F. Wu. Visual summarization of web pages. In Proceedings of ACM SIGIR, pages 499--506. Geneva, Switzerland, July 2010.
[17]
K. S. Jones, S. Walker, and S. E. Robertson. A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management, 36(6):779--808, November 2000.
[18]
D. Joshi, J. Z. Wang, and J. Li. The story picturing engine: Finding elite images to illustrate a story using mutual reinforcement. In Proceedings of ACM Workshop on Multimedia Information Retrieval, pages 119--126, 2004.
[19]
P. Kelm, S. Schmiedeke, and T. Sikora. Video2gps: Geotagging using collaborative systems, textual and visual features. In Proceedings of MediaEval. Pisa, Italy, 2010.
[20]
B. M. King and E. M. Minium. Statistical Reasoning in Psychology and Education. Wiley, New York, 1999.
[21]
G. Kumaran and V. R. Carvalho. Reducing long queries using query quality predictors. In Proceedings of ACM SIGIR, pages 564--571. Boston, USA, July 2009.
[22]
Z. Li, J. Liu, X. Zhu, and H. Lu. Multi-modal multi-correlation person-centric news retrieval. In Proceedings of ACM CIKM, 2010.
[23]
J. Luo, D. Joshi, J. Yu, and A. Gallagher. Geotagging in multimedia and computer vision--a survey. Multimedia Tools and Applications, 51(1):187--211, October 2010.
[24]
X. Olivares, M. Ciaramita, and R. van Zwol. Boosting image retrieval through aggregating search results based on visual annotations. In Proceedings of ACM Multimedia, pages 189--198. Canada, October 2008.
[25]
S. Overell and S. Rüger. Using co-occurrence models for placename disambiguation. International Journal of Geographical Information Science, 22(3):265--287, March 2008.
[26]
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report, Stanford Digital Library Technologies Project, 1999.
[27]
R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems, pages 1257--1264, 2007.
[28]
P. Serdyukov, V. Murdock, and R. van Zwol. Placing flickr photos on a map. In Proceedings of ACM SIGIR, pages 484--491. Boston, USA, July 2009.
[29]
J. F. Sturm. Site matters: The value of local newspaper web sites. Technical report, NAA, 2009. http://www.naa.org/TrendsandNumbers/Research.aspx.
[30]
J. Teevan, E. Cutrell, D. Fisher, S. M. Drucker, G. Ramos, P. Andre, and C. Hu. Visual snippets: Summarizing web pages for search and revisitation. In Proceedings of International Conference on Human factors in computing systems, pages 2023--2032. Boston, USA, April 2009.
[31]
C. C. Vogt and G. W. Cottrell. Fusion via a linear combination of scores. Information Retrieval, 1(3):151--173, October 1999.
[32]
B. Wang, Z. Li, M. Li, and W.-Y. Ma. Large-scale duplicate detection for web image search. In Proceedings of IEEE International Conference on Multimedia Expo, pages 353--356. Toronto, Canada, July 2006.
[33]
M. Wang, X.-S. Hua, R. Hong, J. Tang, G.-J. Qi, and Y. Song. Unified video annotation via multi-graph learning. IEEE Trans. on Circuits and Systems for Video Technology, 19(5):733--766, March 2009.
[34]
M. Wang, X.-S. Hua, J. Tang, and R. Hong. Beyond distance measurement: Constructing neighborhood similarity for video annotation. IEEE Trans. on Multimedia, 11(3):465--473, February 2009.
[35]
R. Yan and A. G. Hauptmann. The combination limit in multimedia retrieval. In Proceedings of ACM Multimedia, pages 339--342, November 2003.
[36]
Y. Yang, D. Xu, F. Nie, J. Luo, and Y. Zhuang. Ranking with local regression and global alignment for cross media retrieval. In Proceedings of ACM Multimedia, pages 175--184, October 2009.
[37]
L. Zhang, L. Chen, F. Jing, K. Deng, and W.-Y. Ma. Enjoyphoto--a verticcal image search engine for enjoying high-quality photos. In Proceedings of ACM Multimedia, pages 367--376. USA, October 2006.
[38]
R. Zhao and W. I. Grosky. Narrowing the semantic gap--improved text-based web document retrieval using visual features. ACM Trans. on Multimedia, 4(2):189--200, June 2002.
[39]
Y. Zheng, Z. Zha, and T.-S. Chua. Research and applications on georeferenced multimedia: a survey. Multimedia Tools and Applications, 51(1):77--98, October 2010.
[40]
X. Zhu, A. B. Goldberg, M. Eldawy, C. R. Dyer, and B. Strock. A text-to-picture synthesis system for augmenting communication. In Proceedings of National Conference on Artificial Intelligence, pages 1590--1595. Vancouver, Canada, July 2007.
[41]
W. Zong, D. Wu, A. Sun, E.-P. Lim, and D. H.-L. Goh. On assigning place names to geography related web pages. In Proceedings of ACM/IEEE-CS joint conference on Digital libraries, pages 354--362. New York, USA, June 2005.

Cited By

View all
  • (2023)A semantic modular framework for events topic modeling in social mediaMultimedia Tools and Applications10.1007/s11042-023-15745-883:4(10755-10778)Online publication date: 24-Jun-2023
  • (2023)Text classification of Chinese news based on multi-scale CNN and LSTM hybrid modelMultimedia Tools and Applications10.1007/s11042-023-14450-w82:14(20975-20988)Online publication date: 6-Feb-2023
  • (2021)Adaptive Salp swarm optimization algorithms with inertia weights for novel fake news detection model in online social mediaMultimedia Tools and Applications10.1007/s11042-021-11006-8Online publication date: 13-May-2021
  • Show More Cited By

Index Terms

  1. News contextualization with geographic and visual information

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '11: Proceedings of the 19th ACM international conference on Multimedia
    November 2011
    944 pages
    ISBN:9781450306164
    DOI:10.1145/2072298
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 November 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. image enrichment
    2. news location relevance
    3. newsmap

    Qualifiers

    • Research-article

    Conference

    MM '11
    Sponsor:
    MM '11: ACM Multimedia Conference
    November 28 - December 1, 2011
    Arizona, Scottsdale, USA

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A semantic modular framework for events topic modeling in social mediaMultimedia Tools and Applications10.1007/s11042-023-15745-883:4(10755-10778)Online publication date: 24-Jun-2023
    • (2023)Text classification of Chinese news based on multi-scale CNN and LSTM hybrid modelMultimedia Tools and Applications10.1007/s11042-023-14450-w82:14(20975-20988)Online publication date: 6-Feb-2023
    • (2021)Adaptive Salp swarm optimization algorithms with inertia weights for novel fake news detection model in online social mediaMultimedia Tools and Applications10.1007/s11042-021-11006-8Online publication date: 13-May-2021
    • (2020)Multi-label text classification with latent word-wise label informationApplied Intelligence10.1007/s10489-020-01838-6Online publication date: 10-Sep-2020
    • (2018)GeoHbbTVMultimedia Tools and Applications10.5555/3287850.328790077:21(28023-28048)Online publication date: 1-Nov-2018
    • (2018)VizByWikiProceedings of the 2018 World Wide Web Conference10.1145/3178876.3186135(873-882)Online publication date: 10-Apr-2018
    • (2018)BreakingNews: Article Annotation by Image and Text ProcessingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2017.272194540:5(1072-1085)Online publication date: 1-May-2018
    • (2018)GeoHbbTV: A framework for the development and evaluation of geographic interactive TV contentsMultimedia Tools and Applications10.1007/s11042-018-6021-677:21(28023-28048)Online publication date: 26-Apr-2018
    • (2017)Understanding-Oriented Multimedia News SummarizationUnderstanding-Oriented Multimedia Content Analysis10.1007/978-981-10-3689-7_6(131-153)Online publication date: 27-May-2017
    • (2017)Understanding-Oriented Multimedia News RetrievalUnderstanding-Oriented Multimedia Content Analysis10.1007/978-981-10-3689-7_5(101-129)Online publication date: 27-May-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media