[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3340531.3416020acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Expert-in-the-loop AI for Polymer Discovery

Published: 19 October 2020 Publication History

Abstract

The use of AI in knowledge dense domains, e.g., chemistry, medicine, biology, etc. - is extremely promising, but often suffers from slow deployment and adaptation to different tasks. We propose a methodology to quickly capture the intent and expertise of a domain expert in order to train personalized AI models for specific tasks. Specifically we focus on the domain of polymer materials design and discovery: it often takes 10 years or more to design, synthesize, test, and introduce a new polymer material into the market. One way to accelerate up the design of polymer materials is through the use of computational methods to design the material, such as combinatorial screening, generative models, inverse design, etc. The drawback of these methods is that they generate a large number of candidates for new molecules, which then need to be manually reviewed by subject matter experts who select only a dozen for further investigation. Our solution is a human-in-the-loop methodology where we rank the candidates according to a utility function that is learned via the continued interaction with the subject matter experts, but which is also constrained by specific chemical knowledge. We prove the viability of our proposed methodology in a polymer production lab and we (i) evaluate against datasets of polymers previously produced in the lab as well as (ii) producing several novel materials that are undergoing experimental development, and (iii) quantitatively show that standard synthetic accessibility scores do not inform about patterns of SME decisions.

References

[1]
M. Batool, B. Ahmad, and S. Choi. 2019. A Structure-Based Drug Discovery Paradigm. Int J Mol Sci, Vol. 20, 11 (Jun 2019).
[2]
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw, Vol. 5, 2 (1994), 157--166.
[3]
Dean G. Brown and Jonas Bostram. 2018. Where Do Recent Small Molecule Clinical Development Candidates Come From? Journal of Medicinal Chemistry, Vol. 61, 21 (2018), 9442--9468.
[4]
Fiorella Cravero, Santiago A. Schustik, Marí a Jimena Martí nez, Carlos D. Barranco, Mó nica F. Dí az, and Ignacio Ponzoni. 2019. Computer-aided design of polymeric materials: Computational study for characterization of databases for prediction of mechanical properties under polydispersity. Chemometrics and Intelligent Laboratory Systems, Vol. 191, March (2019), 65--72.
[5]
Nicola De Cao and Thomas Kipf. 2018. MolGAN: An implicit generative model for small molecular graphs. arXiv preprint arXiv:1805.11973 (2018).
[6]
Jörg Degen, Christof Wegscheid-Gerlach, Andrea Zaliani, and Matthias Rarey. 2008. On the Art of Compiling and Using'Drug-Like'Chemical Fragment Spaces. ChemMedChem: Chemistry Enabling Drug Discovery, Vol. 3, 10 (2008), 1503--1507.
[7]
Daniel C Elton, Zois Boukouvalas, Mark D Fuge, and Peter W Chung. 2019. Deep learning for molecular generation and optimization-a review of the state of the art. arXiv preprint arXiv:1903.04388 (2019).
[8]
Peter Ertl and Ansgar Schuffenhauer. 2009. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. Journal of cheminformatics, Vol. 1, 1 (2009), 8.
[9]
Palash Goyal and Emilio Ferrara. 2018. Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems, Vol. 151 (2018), 78--94.
[10]
Gabriel Lima Guimaraes, Benjamin Sanchez-Lengeling, Carlos Outeiral, Pedro Luis Cunha Farias, and Alán Aspuru-Guzik. 2017. Objective-reinforced generative adversarial networks (ORGAN) for sequence generation models. arXiv preprint arXiv:1705.10843 (2017).
[11]
Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jürgen Schmidhuber, et almbox. 2001. Gradient flow in recurrent nets: the difficulty of learning long-term dependencies.
[12]
Andreas Holzinger. 2016. Interactive machine learning for health informatics: when do we need the human-in-the-loop? Brain Informatics, Vol. 3, 2 (2016), 119--131.
[13]
Tran Doan Huan, Arun Mannodi-Kanakkithodi, Chiho Kim, Vinit Sharma, Ghanshyam Pilania, and Rampi Ramprasad. 2016. A polymer dataset for accelerated property prediction and design. Scientific Data, Vol. 3 (2016), 1--10. https://doi.org/10.1038/sdata.2016.12
[14]
Masashi Ishii, Taro Takemura, and Mikiko Tanifuji. [n.d.]. PoLyInfo RDF: A Semantically Reinforced Polymer Database for Materials Informatics. In Proceedings of the ISWC 2019 Satellite Tracks, Auckland, New Zealand, October 26-30, 2019. 69--72.
[15]
Kevin Lin, Dianqi Li, Xiaodong He, Zhengyou Zhang, and Ming-ting Sun. 2017. Adversarial Ranking for Language Generation. In Advances in Neural Information Processing Systems 30. 3155--3165.
[16]
Yue Liu, Tianlu Zhao, Wangwei Ju, and Siqi Shi. 2017. Materials discovery and design using machine learning. J Materiomics, Vol. 3, 3 (2017), 159--177.
[17]
Arun Mannodi-kanakkithodi, Ghanshyam Pilania, Tran Doan Huan, and Turab Lookman. 2016. Machine Learning Strategy for Accelerated Design of Polymer Dielectrics. Nature Publishing Group January (2016), 1--10. https://doi.org/10.1038/srep20952
[18]
Robert Munro. April 2020 (estimated). Human-in-the-Loop Machine Learning. Vol. 11. MEAP.
[19]
Tarak K. Patra, Venkatesh Meenakshisundaram, Jui Hsiang Hung, and David S. Simmons. 2017. Neural-Network-Biased Genetic Algorithms for Materials Design: Evolutionary Algorithms That Learn. ACS Combinatorial Science, Vol. 19, 2 (2017), 96--107.
[20]
Sai Rajeswar, Sandeep Subramanian, Francis Dutil, Christopher Pal, and Aaron Courville. 2017. Adversarial generation of natural language. arXiv preprint arXiv:1705.10929 (2017).
[21]
David Rogers and Mathew Hahn. 2010. Extended-connectivity fingerprints. Journal of chemical information and modeling, Vol. 50, 5 (2010), 742--754.
[22]
Miha Skalic, José Jiménez, Davide Sabbadin, and Gianni De Fabritiis. 2019. Shape-Based Generative Modeling for de Novo Drug Design. Journal of chemical information and modeling, Vol. 59, 3 (2019), 1205--1214.
[23]
Justin S. Smith, Adrian E. Roitberg, and Olexandr Isayev. 2018. Transforming Computational Drug Discovery with Machine Learning and AI. ACS Medicinal Chemistry Letters, Vol. 9, 11 (2018).
[24]
Roberto Todeschini and Viviana Consonni. 2008. Handbook of molecular descriptors. Vol. 11. John Wiley & Sons.
[25]
C C Wang, G Pilania, S A Boggs, S Kumar, C Breneman, and R Ramprasad. 2014. Computational strategies for polymer dielectrics design. Polymer, Vol. 55, 4 (2014), 979--988. https://doi.org/10.1016/j.polymer.2013.12.069
[26]
David Weininger. 1988. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. Journal of chemical information and computer sciences, Vol. 28, 1 (1988), 31--36.
[27]
Stephen Wu, Yukiko Kondo, Masa aki Kakimoto, Bin Yang, Hironao Yamada, Isao Kuwajima, Guillaume Lambard, Kenta Hongo, Yibin Xu, Junichiro Shiomi, Christoph Schick, Junko Morikawa, and Ryo Yoshida. 2019. Machine-learning-assisted discovery of polymers with high thermal conductivity using a molecular design algorithm. npj Computational Materials, Vol. 5, 1 (2019). http://dx.doi.org/10.1038/s41524-019-0203-2
[28]
Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. Seqgan: Sequence generative adversarial nets with policy gradient. In Thirty-First AAAI Conference on Artificial Intelligence.
[29]
Yizhe Zhang, Zhe Gan, Kai Fan, Zhi Chen, Ricardo Henao, Dinghan Shen, and Lawrence Carin. 2017. Adversarial feature matching for text generation. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 4006--4015.

Cited By

View all
  • (2023)Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific languageNature Communications10.1038/s41467-023-39396-314:1Online publication date: 21-Jun-2023
  • (2023)Emerging Trends in Machine Learning: A Polymer PerspectiveACS Polymers Au10.1021/acspolymersau.2c000533:3(239-258)Online publication date: 18-Jan-2023
  • (2023)Predicting the complex stress-strain curves of polymeric solids by classification-embedded dual neural networkMaterials & Design10.1016/j.matdes.2023.111773227(111773)Online publication date: Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
October 2020
3619 pages
ISBN:9781450368599
DOI:10.1145/3340531
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. artificial intelligence
  2. expert-in-the-loop
  3. polymer discovery

Qualifiers

  • Research-article

Conference

CIKM '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)70
  • Downloads (Last 6 weeks)12
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific languageNature Communications10.1038/s41467-023-39396-314:1Online publication date: 21-Jun-2023
  • (2023)Emerging Trends in Machine Learning: A Polymer PerspectiveACS Polymers Au10.1021/acspolymersau.2c000533:3(239-258)Online publication date: 18-Jan-2023
  • (2023)Predicting the complex stress-strain curves of polymeric solids by classification-embedded dual neural networkMaterials & Design10.1016/j.matdes.2023.111773227(111773)Online publication date: Mar-2023
  • (2022)Accelerating materials discovery using artificial intelligence, high performance computing and roboticsnpj Computational Materials10.1038/s41524-022-00765-z8:1Online publication date: 26-Apr-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media