[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.24963/ijcai.2023/401guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

DeepPSL: end-to-end perception and reasoning

Published: 19 August 2023 Publication History

Abstract

We introduce DeepPSL a variant of probabilistic soft logic (PSL) to produce an end-to-end trainable system that integrates reasoning and perception. PSL represents first-order logic in terms of a convex graphical model -- hinge-loss Markov random fields (HL-MRFs). PSL stands out among probabilistic logic frameworks due to its tractability having been applied to systems of more than 1 billion ground rules. The key to our approach is to represent predicates in first-order logic using deep neural networks and then to approximately back-propagate through the HL-MRF and thus train every aspect of the first-order system being represented. We believe that this approach represents an interesting direction for the integration of deep learning and reasoning techniques with applications to knowledge base learning, multi-task learning, and explainability. Evaluation on three different tasks demonstrates that DeepPSL significantly outperforms state-of-the-art neuro-symbolic methods on scalability while achieving comparable or better accuracy.

References

[1]
Akshay Agrawal, Brandon Amos, Shane Barratt, Stephen Boyd, Steven Diamond, and J. Zico Kolter. Differentiable Convex Optimization Layers. In Advances in Neural Information Processing Systems, volume 32. Curran Associates Inc., 2019.
[2]
Akshay Agrawal, Shane Barratt, Stephen Boyd, Enzo Busseti, and Walaa M. Moursi. Differentiating Through a Cone Program. arXiv; 1904.09043, 2020.
[3]
Brandon Amos and J Zico Kolter. Optnet: Differentiable Optimization as a Layer in Neural Networks. In International Conference on Machine Learning, pages 136-145. PMLR, 2017.
[4]
E. Augustine and L. Getoor. A Comparison of Bottom-Up Approaches to Grounding for Templated Markov Random Fields. In SysML, 2018.
[5]
Stephen H Bach, Matthias Broecheler, Bert Huang, and Lise Getoor. Hinge-Loss Markov Random Fields and Probabilistic Soft Logic. Journal of Machine Learning Research, 18:1-67, 2017.
[6]
Mikhail Belkin, Partha Niyogi, and Vikas Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7(11), 2006.
[7]
Indrajit Bhattacharya and Lise Getoor. Collective Entity Resolution In Relational Data. ACM Transactions on Knowledge Discovery from Data, pages 1-36, 2007.
[8]
Michelle Cheatham and Pascal Hitzler. String Similarity Metrics for Ontology Alignment. In The Semantic Web - ISWC 2013, pages 294-309, Berlin, Heidelberg, 2013. Springer Berlin Heidelberg.
[9]
W. Cohen, P. Ravikumar, and S. Fienberg. A Comparison of String Distance Metrics for Name-Matching Tasks. In The IJCAI Workshop on Information Integration on the Web (IIWeb), 2003.
[10]
William W. Cohen, Fan Yang, and Kathryn Mazaitis. TensorLog: A Probabilistic Database Implemented Using Deep-Learning Infrastructure. Journal of Artificial Intelligence Research, 67:285-325, 2020.
[11]
Stephan Dempe. Bilevel Optimization: Theory, Algorithms and Applications. TU Bergakademie Freiberg, Fakultät für Mathematik und Informatik, 2018.
[12]
Michelangelo Diligenti, Marco Gori, and Claudio Saccà. Semantic-based regularization for learning and inference. Artificial Intelligence, 244:143- 165, 2017.
[13]
Ivan Donadello, Luciano Serafini, and Artur D'Avila Garcez. Logic Tensor Networks for Semantic Image Interpretation. In Proceedings of the 26th International Joint Conference on Artificial Intelligence, page 1596-1602. AAAI Press, 2017.
[14]
Honghua Dong, Jiayuan Mao, Tian Lin, Chong Wang, Lihong Li, and Denny Zhou. Neural Logic Machines. In International Conference on Learning Representations, 2019.
[15]
Najlah Gali, Radu Mariescu-Istodor, and Pasi Franti. Similarity Measures for Title Matching. In International Conference on Pattern Recognition (ICPR), 2016.
[16]
Marouene Sfar Gandoura, Zografoula Vagena, and Nikolaos Vasiloglou. Human in the Loop Enrichment of Product Graphs with Probabilistic Soft Logic. In Proceedings of Knowledge Graphs and E-commerce, KDD 20, 2020.
[17]
Saeed Ghadimi and Mengdi Wang. Approximation Methods for Bilevel Programming. arXiv preprint arXiv:1802.02246, 2018.
[18]
Stephen Gould, Basura Fernando, Anoop Cherian, Peter Anderson, Rodrigo Santa Cruz, and Edison Guo. On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization. arXiv; 1607.05447, 2016.
[19]
Mourad Gridach. A framework based on (probabilistic) soft logic and neural network for NLP. Applied Soft Computing, 93:106232, 2020.
[20]
Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard Hovy, and Eric Xing. Harnessing Deep Neural Networks with Logic Rules. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2410-2420. Association for Computational Linguistics, August 2016.
[21]
Matt Jordan and Alexandros G Dimakis. Exactly Computing the Local Lipschitz Constant of ReLU Networks. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 7344-7353, 2020.
[22]
Thomas N. Kipf and Max Welling. Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations, ICLR 2017. OpenReview.net, 2017.
[23]
Gregory Koch, Richard Zemel, and Ruslan Salakhutdinov. Siamese Neural Networks for One-shot Image Recognition. In ICML 2015 Deep Learning Worshop, 2015.
[24]
Daphne Koller, Nir Friedman, Lise Getoor, and Ben Taskar. Graphical Models in a Nutshell. Introduction to statistical relational learning, 43, 2007.
[25]
Pigi Kouki, Jay Pujara, Christopher Marcum, Laura Koehly, and Lise Getoor. Collective entity resolution in familial networks. In 2017 IEEE International Conference on Data Mining (ICDM), pages 227- 236. IEEE, 2017.
[26]
Kwonjoon Lee, Subhransu Maji, Avinash Ravichandran, and Stefano Soatto. Meta-Learning with Differentiable Convex Optimization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10657-10665, 2019.
[27]
Qing Lu and Lise Getoor. Link-based classification. In Proceedings of the 20th International Conference on Machine Learning, pages 496-503. AAAI Press, 2003.
[28]
Robin Manhaeve, Sebastijan Dumancic, Angelika Kimmig, Thomas Demeester, and Luc De Raedt. DeepProbLog: Neural Probabilistic Logic Programming. In Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
[29]
Giuseppe Marra, Francesco Giannini, Michelangelo Diligenti, and Marco Gori. Integrating Learning and Reasoning with Deep Logic Models. In Machine Learning and Knowledge Discovery in Databases - European Conference, 2019, Proceedings, Part II, volume 11907, pages 517-532. Springer, 2019.
[30]
Giuseppe Marra, Michelangelo Diligenti, Francesco Giannini, Marco Gori, and Marco Maggini. Relational Neural Machines. In 24th European Conference on Artificial Intelligence, 2020.
[31]
Ian Perera, Jena Hwang, Kevin Bayas, Bonnie Dorr, and Yorick Wilks. Cyberattack Prediction Through Public Text Analysis and Mini-Theories. In 2018 IEEE International Conference on Big Data (Big Data), pages 3001-3010. IEEE, 2018.
[32]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 701-710, 2014.
[33]
Connor Pryor, Charles Dickens, Eriq Augustine, Alon Albalak, William Wang, and Lise Getoor. NeuPSL: Neural Probabilistic Soft Logic. arXiv preprint arXiv:2205.14268, 2022.
[34]
Luc De Raedt, Angelika Kimmig, and Hannu Toivonen. ProbLog: A Probabilistic Prolog and Its Application in Link Discovery. In 30th International Joint Conference on Artificial Intelligence, pages 2462- 2467, 2007.
[35]
Luc de Raedt, Sebastijan Dumančić, Robin Manhaeve, and Giuseppe Marra. From Statistical Relational to Neuro-Symbolic Artificial Intelligence. In 29th International Joint Conference on Artificial Intelligence, pages 4943-4950, 2020.
[36]
Matthew Richardson and Pedro Domingos. Markov Logic Networks. Machine Learning, 62(1-2):107-136, February 2006.
[37]
RT Rockafellar. Favorable Classes of Lipschitz Continuous Functions in Subgradient Optimization. 1981.
[38]
Tim Rocktäschel and Sebastian Riedel. Learning Knowledge Base Inference with Neural Theorem Provers. In Proceedings of the 5th Workshop on Automated Knowledge Base Construction, pages 45-50. Association for Computational Linguistics, June 2016.
[39]
Aaron Rodden, Tarun Salh, Eriq Augustine, and Lise Getoor. VMI-PSL: Visual Model Inspector for Probabilistic Soft Logic. In Fourteenth ACM Conference on Recommender Systems, pages 604-606, 2020.
[40]
Marco Rospocher. An Ontology-Driven Probabilistic Soft Logic Approach to Improve NLP Entity Annotations. In International Semantic Web Conference, pages 144-161. Springer, 2018.
[41]
Kevin Scaman and Aladin Virmaux. Lipschitz regularity of deep neural networks: analysis and efficient estimation. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, pages 3839-3848, 2018.
[42]
Ankur Sinha, Pekka Malo, and Kalyanmoy Deb. A Review on Bilevel Optimization: From Classical to Evolutionary Approaches and Applications. IEEE Transactions on Evolutionary Computation, 22(2):276- 295, 2017.
[43]
Sriram Srinivasan, Eriq Augustine, and Lise Getoor. Tandem Inference: An Out-of-Core Streaming Algorithm for Very Large-Scale Relational Inference. Proceedings of the AAAI Conference on Artificial Intelligence, 34(06):10259-10266, 2020.
[44]
Jason Weston, Frédéric Ratle, Hossein Mobahi, and Ronan Collobert. Deep learning via semi-supervised embedding. In Neural Networks: Tricks of the Trade, pages 639-655, 2012.
[45]
Thomas Winters, Giuseppe Marra, Robin Manhaeve, and Luc De Raedt. Deepstochlog: Neural stochastic logic programming. Proceedings of the AAAI Conference on Artificial Intelligence, 36(9):10090-10100, Jun. 2022.
[46]
Zhilin Yang, William Cohen, and Ruslan Salakhudinov. Revisiting semi-supervised learning with graph embeddings. In International Conference on Machine Learning, pages 40-48. PMLR, 2016.
[47]
Zhun Yang, Adam Ishay, and Joohyung Lee. NeurASP: Embracing Neural networks into Answer Set Programming. In Proceedings of the 29th International Joint Conference on Artificial Intelligence, pages 1755-1762. International Joint Conferences on Artificial Intelligence Organization, 7 2020.
[48]
Wen-Tau Yih and Christopher Meek. Improving Similarity Measures for Short Segments of Text. In Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, AAAI'07, page 1489-1494. AAAI Press, 2007.
[49]
Xiaojin Zhu, Zoubin Ghahramani, and John D Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning, pages 912-919. AAAI Press, 2003.

Cited By

View all
  • (2024)Convex and bilevel optimization for neural-symbolic inference and learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692502(10865-10896)Online publication date: 21-Jul-2024
  • (2024)Embed2Rule Scalable Neuro-Symbolic Learning via Latent Space Weak-LabellingNeural-Symbolic Learning and Reasoning10.1007/978-3-031-71167-1_11(195-218)Online publication date: 9-Sep-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
August 2023
7242 pages
ISBN:978-1-956792-03-4

Sponsors

  • International Joint Conferences on Artifical Intelligence (IJCAI)

Publisher

Unknown publishers

Publication History

Published: 19 August 2023

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Convex and bilevel optimization for neural-symbolic inference and learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692502(10865-10896)Online publication date: 21-Jul-2024
  • (2024)Embed2Rule Scalable Neuro-Symbolic Learning via Latent Space Weak-LabellingNeural-Symbolic Learning and Reasoning10.1007/978-3-031-71167-1_11(195-218)Online publication date: 9-Sep-2024

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media