[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3308558.3313486acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Open access

Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness

Published: 13 May 2019 Publication History

Abstract

In many applications, it is important to characterize the way in which two concepts are semantically related. Knowledge graphs such as ConceptNet provide a rich source of information for such characterizations by encoding relations between concepts as edges in a graph. When two concepts are not directly connected by an edge, their relationship can still be described in terms of the paths that connect them. Unfortunately, many of these paths are uninformative and noisy, which means that the success of applications that use such path features crucially relies on their ability to select high-quality paths. In existing applications, this path selection process is based on relatively simple heuristics. In this paper we instead propose to learn to predict path quality from crowdsourced human assessments. Since we are interested in a generic task-independent notion of quality, we simply ask human participants to rank paths according to their subjective assessment of the paths' naturalness, without attempting to define naturalness or steering the participants towards particular indicators of quality. We show that a neural network model trained on these assessments is able to predict human judgments on unseen paths with near optimal performance. Most notably, we find that the resulting path selection method is substantially better than the current heuristic approaches at identifying meaningful paths.

References

[1]
Lora Aroyo and Chris Welty. 2014. The Three Sides of CrowdTruth. Human Computation 1(2014), 31-34.
[2]
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. DBpedia: A Nucleus for a Web of Open Data. The Semantic Web (2007), 722-735.
[3]
Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, 1247-1250.
[4]
Adrian Boteanu and Sonia Chernova. 2015. Solving and Explaining Analogy Questions Using Semantic Networks. In AAAI Conference on Artificial Intelligence. AAAI, 1460-1466.
[5]
Zied Bouraoui, Shoaib Jameel, and Steven Schockaert. 2018. Relation Induction in Word Embeddings Revisited. In Proceedings of the 27th International Conference on Computational Linguistics (COLING). 1627-1637.
[6]
Andrew Carlson, Justin Betteridge, Bryan Kisiel, Burr Settles, Estevam R Hruschka Jr, and Tom M Mitchell. 2010. Toward an Architecture for Never-Ending Language Learning. In AAAI Conference on Artificial Intelligence. 1306-1313.
[7]
Junpeng Chen and Juan Liu. 2011. Combining ConceptNet and WordNet for Word Sense Disambiguation. In International Joint Conference on Natural Language Processing (IJCNLP). 686-694.
[8]
Rajarshi Das, Arvind Neelakantan, David Belanger, and Andrew McCallum. 2017. Chains of Reasoning over Entities, Relations, and Text Using Recurrent Neural Networks. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Vol. 1. 132-141.
[9]
Georgiana Dinu, Angeliki Lazaridou, and Marco Baroni. 2015. Improving Zero-Shot Learning by Mitigating the Hubness Problem. In Proceedings of the International Conference on Learning Representations (ICLR).
[10]
Manaal Faruqui, Jesse Dodge, Sujay Kumar Jauhar, Chris Dyer, Eduard Hovy, and Noah A. Smith. 2015. Retrofitting Word Vectors to Semantic Lexicons. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL:HLT). Association for Computational Linguistics, 1606-1615.
[11]
Matt Gardner, Partha Talukdar, Jayant Krishnamurthy, and Tom Mitchell. 2014. Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 397-406.
[12]
Kelvin Guu, John Miller, and Percy Liang. 2015. Traversing Knowledge Graphs in Vector Space. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).
[13]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (1997), 1735-1780.
[14]
Diederik Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference Learning Representations (ICLR).
[15]
Alexander Kotov and Chengiang Zhai. 2012. Tapping into Knowledge Base for Concept Feedback: Leveraging ConceptNet to Improve Search Results for Difficult Queries. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM). ACM, 403-412.
[16]
Alicia Krebs, Alessandro Lenci, and Denis Paperno. 2018. SemEval-2018 Task 10: Capturing Discriminative Attributes. In Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval). 732-740.
[17]
Ni Lao and William W Cohen. 2010. Relational Retrieval Using a Combination of Path-Constrained Random Walks. Machine Learning 81, 1 (2010), 53-67.
[18]
Douglas B Lenat. 1995. CYC: A large-scale investment in knowledge infrastructure. Communications of the ACM (CACM) 38, 11 (1995), 33-38.
[19]
Yankai Lin, Zhiyuan Liu, Huanbo Luan, Maosong Sun, Siwei Rao, and Song Liu. 2015. Modeling Relation Paths for Representation Learning of Knowledge Bases. In Proceedings of the Conference Empirical Methods in Natural Language Processing (EMNLP). 705-714.
[20]
Hugo Liu and Push Singh. 2004. ConceptNet-a Practical Commonsense Reasoning Tool-Kit. BT Technology Journal 22, 4 (2004), 211-226.
[21]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed Representations of Words and Phrases and Their Compositionality. In Advances in Neural Information Processing Systems (NIPS). 3111-3119.
[22]
George A Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Katherine J Miller. 1990. Introduction to WordNet: An On-line Lexical Database. International Journal Lexicography 3, 4 (1990), 235-244.
[23]
Arvind Neelakantan, Benjamin Roth, and Andrew McCallum. 2015. Compositional Vector Space Models for Knowledge Base Completion. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL & IJCNLP). Association for Computational Linguistics, 156-166.
[24]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. GloVe: Global Vectors for Word Representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 1532-1543.
[25]
Miloš Radovanovic, Alexandros Nanopoulos, and Mirjana Ivanovic. 2010. Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data. Journal of Machine Learning Research (JMLR) 11 (2010), 2487-2531.
[26]
Push Singh, Thomas Lin, Erik T Mueller, Grace Lim, Travell Perkins, and Wan Li Zhu. 2002. Open Mind Common Sense: Knowledge Acquisition from the General Public. In OTM Confederated International Conferences “On the Move to Meaningful Internet Systems”. Springer, 1223-1237.
[27]
Robert Speer, Joshua Chin, and Catherine Havasi. 2017. ConceptNet 5.5: An Open Multilingual Graph of General Knowledge. In AAAI Conference on Artificial Intelligence. AAAI, 4444-4451.
[28]
Kristina Toutanova, Victoria Lin, Wen-tau Yih, Hoifung Poon, and Chris Quirk. 2016. Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text. In Procedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), Vol. 1. 1434-1444.
[29]
Luis Von Ahn, Mihir Kedia, and Manuel Blum. 2006. Verbosity: A Game for Collecting Common-Sense Facts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI). ACM, 75-78.
[30]
Ellen M Voorhees. 2004. Overview of the TREC 2004 Robust Retrieval Track. In TREC.
[31]
Denny Vrandecic and Markus Krötzsch. 2014. Wikidata: A Free Collaborative Knowledgebase. Communications of the ACM (CACM) 57, 10 (2014), 78-85.
[32]
Wenhan Xiong, Thien Hoang, and William Yang Wang. 2017. DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP). 564-573.
[33]
Chang Xu, Yalong Bai, Jiang Bian, Bin Gao, Gang Wang, Xiaoguang Liu, and Tie-Yan Liu. 2014. RC-NET: A General Framework for Incorporating Knowledge into Word Representations. In Procedings of the 23rd ACM International Conference Information and Knowledge Management (CIKM). ACM, 1219-1228.
[34]
Jinxi Xu and W. Bruce Croft. 1996. Query Expansion Using Local and Global Document Analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, USA, 4-11.
[35]
George Kingsley Zipf. 1935. The Psycho-Biology of Language. (1935).

Cited By

View all
  • (2024)A novel customizing knowledge graph evaluation method for incorporating user needsScientific Reports10.1038/s41598-024-60004-x14:1Online publication date: 26-Apr-2024
  • (2024)Extraction of object-action and object-state associations from Knowledge GraphsJournal of Web Semantics10.1016/j.websem.2024.10081681(100816)Online publication date: Jul-2024
  • (2024)Detection Model of News Distortion Based on Chinese-German Bilingual Knowledge GraphsWeb Information Systems and Applications10.1007/978-981-97-7707-5_42(512-523)Online publication date: 11-Sep-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '19: The World Wide Web Conference
May 2019
3620 pages
ISBN:9781450366748
DOI:10.1145/3308558
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Commonsense Knowledge
  2. ConceptNet
  3. Crowdsourcing
  4. Feature Selection
  5. Knowledge Graph

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '19
WWW '19: The Web Conference
May 13 - 17, 2019
CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)132
  • Downloads (Last 6 weeks)11
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A novel customizing knowledge graph evaluation method for incorporating user needsScientific Reports10.1038/s41598-024-60004-x14:1Online publication date: 26-Apr-2024
  • (2024)Extraction of object-action and object-state associations from Knowledge GraphsJournal of Web Semantics10.1016/j.websem.2024.10081681(100816)Online publication date: Jul-2024
  • (2024)Detection Model of News Distortion Based on Chinese-German Bilingual Knowledge GraphsWeb Information Systems and Applications10.1007/978-981-97-7707-5_42(512-523)Online publication date: 11-Sep-2024
  • (2022)ECCKG: An Eventuality-Centric Commonsense Knowledge GraphKnowledge Science, Engineering and Management10.1007/978-3-031-10983-6_44(568-584)Online publication date: 6-Aug-2022

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media