Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia

Yonghui Cong²²,
Zengchang Qin²²,
Jing Yu²² &
…
Tao Wan²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7713))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

Abstract

Probability models have been used in cross-modal multimedia information retrieval recently by building conjunctive models bridging the text and image components. Previous studies have shown that cross-modal information retrieval system using the topic correlation model (TCM) outperforms state-of-the-art models in English corpus. In this paper, we will focus on the Chinese language, which is different from western languages composed by alphabets. Words and characters will be chosen as the basic structural units of Chinese, respectively. We also set up a test database, named Ch-Wikipedia, in which documents with paired image and text are extracted from Chinese website of Wikipedia. We investigate the problems of retrieving texts (ranked by semantic closeness) given an image query, and vice versa. The capabilities of the TCM model is verified by experiments across the Ch-Wikipedia dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Topic correlation model for cross-modal multimedia information retrieval

Article 05 May 2015

Fine-Grained Label Learning via Siamese Network for Cross-modal Information Retrieval

Multi-Lingual Retrieval of Pictures in ImageCLEF

References

Blei, D.M., Ng, A., Jordan, M.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
MATH Google Scholar
Blei, D.M., Lafferty, J.D.: Topic models. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series (2009)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., Freitas, N., Blei, D., Jordan, M.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
MATH Google Scholar
Jeon, J., Lavreko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: ACM SIGIR Conf. Research and Development in Information Retrieval, New York, pp. 119–126 (2003)
Google Scholar
Jiang, Y., Ngo, C., Yang, J.: Towards optimal Bag-of-features for object categorization and semantic video retrieval. In: CIVR, pp. 494–501 (2007)
Google Scholar
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Image and Video Retrieval, pp. 42–50 (2005)
Google Scholar
Qin, Z., Thint, M., Huang, Z.: Ranking Answers by Hierarchical Topic Models. In: Chien, B.-C., Hong, T.-P., Chen, S.-M., Ali, M. (eds.) IEA/AIE 2009. LNCS, vol. 5579, pp. 103–112. Springer, Heidelberg (2009)
Chapter Google Scholar
Rasiwasia, N., Pereira, J.C., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N.: A new approach to cross-modal multimedia retrieval. In: ACM Multimedia (MM), pp. 251–260 (2010)
Google Scholar
Rasiwasia, N., Moreno, P., Vasconcelos, N.: Bridging the gap: Query by semantic example. IEEE Transactions on Multimedia 9(5), 923–938 (2007)
Article Google Scholar
Schmid, C., Mikolajczyk, K.: A performance evaluation of local descriptors. In: ICPR, vol. 2, pp. 257–263 (2003)
Google Scholar
Snoek, C., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications 25(1), 5–35 (2005)
Article Google Scholar
Westerveld, T.: Probabilistic multimedia retrieval. ACM 25, 438 (2002)
Google Scholar
Xu, T.Q.: Fundamental structural principles of Chinese semantic syntax in terms of Chinese Characters. Applied Linguistics 1, 3–13 (2001) (in Chinese)
Google Scholar
Yu, J., Cong, Y., Qin, Z., Wan, T.: Cross-modal topic correlations for multimedia retrieval. To appear in ICPR (2012)
Google Scholar
Yuan, X., Yu, J., Qin, Z., Wan, T.: A SIFT-LBP image retrieval model based on bag-of features. In: International Conference on Image Processing (ICIP), pp. 1061–1064 (2011)
Google Scholar
Zhang, Y., Qin, Z.: A topic model of observing Chinese characters. In: Proceedings of the International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol. 2, pp. 7–10 (2010)
Google Scholar
Zhao, Q., Qin, Z., Wan, T.: What Is the Basic Semantic Unit of Chinese Language? A Computational Approach Based on Topic Models. In: Kanazawa, M., Kornai, A., Kracht, M., Seki, H. (eds.) MOL 12. LNCS, vol. 6878, pp. 143–157. Springer, Heidelberg (2011)
Chapter Google Scholar
Zhao, Q., Qin, Z., Wan, T.: Topic Modeling of Chinese Language Using Character-Word Relations. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part III. LNCS, vol. 7064, pp. 139–147. Springer, Heidelberg (2011)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Computing and Machine Learning Lab, School of ASEE, Beihang University, Beijing, China
Yonghui Cong, Zengchang Qin & Jing Yu
Department of Biomedical Engineering, Rutgers University, USA
Tao Wan

Authors

Yonghui Cong
View author publications
You can also search for this author in PubMed Google Scholar
Zengchang Qin
View author publications
You can also search for this author in PubMed Google Scholar
Jing Yu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Wan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Fudan University, Handan Road 220, 200433, Shanghai, China
Shuigeng Zhou
Chinese Academy of Sciences, Academy of Mathematics and Systems Science, Dongguancun East Road 55, 100190, Beijing, China
Songmao Zhang
Department of Computer Science and Engineering, University of Minnesota, Union Street SE 200, 55455, Minneapolis, MN, USA
George Karypis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cong, Y., Qin, Z., Yu, J., Wan, T. (2012). Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-35527-1_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35526-4
Online ISBN: 978-3-642-35527-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Topic correlation model for cross-modal multimedia information retrieval

Fine-Grained Label Learning via Siamese Network for Cross-modal Information Retrieval

Multi-Lingual Retrieval of Pictures in ImageCLEF

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Topic correlation model for cross-modal multimedia information retrieval

Fine-Grained Label Learning via Siamese Network for Cross-modal Information Retrieval

Multi-Lingual Retrieval of Pictures in ImageCLEF

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation