Reinforcement control with fuzzy-rules emulated network for robust-optimal drug-dosing of cancer dynamics

Chidentree Treesatayapun¹ &
Aldo Jonathan Muñoz-Vázquez²

285 Accesses
1 Altmetric
Explore all metrics

Abstract

In this article, a nonlinear mathematical model of the biological phenomena in chemotherapy cancer treatment is considered as a class of unknown discrete-time systems when the input data and the measured output are only available. The input data are the drug administration represented as the control effort and the output is the tumor cells population. As a result, the actor-critic architecture is constructed without the full-state observer. Two sets of IF-THEN rules are utilized for fuzzy rules emulated networks by human knowledge according to the pharmacokinetic and pharmacodynamic details. The learning laws are derived from the concept of the incoherent reward function. Thus, the convergence of the internal signals and the robustness are accomplished by the theoretical and numerical results. Furthermore, the comparative results are given to demonstrate the effectiveness of the proposed scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning optimal control with semi-continuous reward function and fuzzy-rules networks for drug administration of cancer treatment

Article 15 April 2023

Knowledge-based reinforcement learning controller with fuzzy-rule network: experimental validation

Article 03 October 2019

How an Adaptive Learning Rate Benefits Neuro-Fuzzy Reinforcement Learning Systems

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Sharifi M, Moradi H (2019) Nonlinear composite adaptive control of cancer chemotherapy with online identification of uncertain parameters. Biomed Signal Process Control 49:360–374
Article Google Scholar
Selvanambi R, Natarajan J, Karuppiah M, Islam SH, Hassan MM, Fortino G (2020) RETRACTED ARTICLE: lung cancer prediction using higher-order recurrent neural network based on glowworm swarm optimization. Neural Comput Appl 32:4373–4386
Article Google Scholar
Robertson-Tessi M, El-Kareh A, Goriely A (2011) A mathematical model of tumor-immune interactions. J Theor Biol 294:56–73
Article MathSciNet MATH Google Scholar
Rihan FA, Velmurugan G (2020) Dynamics of fractional-order delay differential model for tumor-immune system. Chaos Solitons Fractals 132:109592
Article MathSciNet MATH Google Scholar
Parthasakha D, Samhita D, Pritha D, Rihan FA, Uzuntarla M (2021) Dibakar Ghosh, Optimal control strategy for cancer remission using combinatorial therapy: a mathematical model-based approach, Chaos. Solitons Fractals 145:110789
Article MATH Google Scholar
Manavalan R, Priya S (2021) Genetic interactions effects for cancer disease identification using computational models: a review. Med Biol Eng Comput 59:733–758
Article Google Scholar
Sweilam NH, Al-Mekhlafi SM, Albalawi AO, Tenreiro-Machado JA (2021) Optimal control of variable-order fractional model for delay cancer treatments. Appl Math Modell 89:1557–1574
Article MathSciNet MATH Google Scholar
Algoul S, Alam MS, Hossain MA, Majumder MAA (2011) Multi-objective optimal chemotherapy control model for cancer treatment. Med Biol Eng Comput 49:51–65
Article Google Scholar
Chen T, Kirkby NF, Jena R (2012) Optimal dosing of cancer chemotherapy using model predictive control and moving horizon state/parameter estimation. Comput Methods Progr Biomed 108(3):973–983
Article Google Scholar
Noble SL, Sherer E, Hannemann RE, Ramkrishna D, Vik T, Rundell AE (2010) Using adaptive model predictive control to customize maintenance therapy chemotherapeutic dosing for childhood acute lymphoblastic leukemia. J Theor Biol 264(3):990–1002
Article MathSciNet MATH Google Scholar
Yu G, Wu J (2022) Efficacy prediction based on attribute and multi-source data collaborative for auxiliary medical system in developing countries. Neural Comput Appl 34:5497–5512
Article Google Scholar
Liu J, Wang XS (2019) Numerical optimal control of a size-structured PDE model for metastatic cancer treatment. Math Biosci 314:28–42
Article MathSciNet MATH Google Scholar
Bermudez-Contreras E (2021) Deep reinforcement learning to study spatial navigation, learning and memory in artificial and biological agents. Biol Cybern 115:131–134
Article Google Scholar
Nowakowski K, Carvalho P, Six JB, Maillet Y, Nguyen AT, Seghiri I, Pemba LM, Marcille T, Ngo ST, Dao TT (2021) Human locomotion with reinforcement learning using bioinspired reward reshaping strategies. Med Biol Eng Comput 59:243–256
Article Google Scholar
Padmanabhan R, Meskin N, Haddad WM (2017) Reinforcement learning-based control of drug dosing for cancer chemotherapy treatment. Math Biosci 293:11–20
Article MathSciNet MATH Google Scholar
Yazdjerdi P, Meskin N, Al-Naemi M, Moustafa AE, Kovacs L (2019) Reinforcement learning-based control of tumor growth under anti-angiogenic therapy. Comput Methods Programs Biomed 173:15–26
Article Google Scholar
Batmani Y, Khaloozadeh H (2013) Optimal chemotherapy in cancer treatment: state dependent Riccati equation control and extended Kalman filter. Optim Control Appl Methods 34(5):562–577
Article MathSciNet MATH Google Scholar
Babaei N, Salamci MU (2015) Personalized drug administration for cancer treatment using Model Reference Adaptive Control. J Theor Biol 371:24–44
Article MathSciNet MATH Google Scholar
Friston K, Samothrakis S, Montague R (2012) Active inference and agency: optimal control without cost functions. Biol Cybern 106:523–541
Article MathSciNet MATH Google Scholar
Azar AT, El-Said SA (2013) Superior neuro-fuzzy classification systems. Neural Comput Appl 23:55–72
Article Google Scholar
Hou Z, Chi R, Gao H (2017) An overview of dynamic-linearization-based data-driven control and applications. IEEE Trans Ind Electron 64(5):4076–4090
Article Google Scholar
Hyunseong L, Hyung JL, Chattopadhyay A (2021) A data-driven time-series fault prediction framework for dynamically evolving large-scale data streaming systems. Neural Comput Appl 33:3235–3250
Google Scholar
Sharma PJ, Patel PL, Jothiprakash V (2021) Data-driven modelling framework for streamflow prediction in a physio-climatically heterogeneous river basin. Soft Comput 25:5951–5978
Article Google Scholar
Zhang M, Gan MG (2019) Data-driven adaptive optimal control for linear systems with structured time-varying uncertainty. IEEE Access 7:9215–9224
Article Google Scholar
Yuliang C, Huaguang Z, Zhang K, Chong L (2020) Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems. Neural Comput Appl 32:8763–8781
Article Google Scholar
Qiu R, Sun Y, Fan Z, Sun M (2020) Robust multi-product inventory optimization under support vector clustering-based data-driven demand uncertainty set. Soft Comput 24:6259–6275
Article MATH Google Scholar
Shang C, Chen WH, Stroock AD, You F (2020) Robust model predictive control of irrigation systems with active uncertainty learning and data analytics. IEEE Trans Control Syst Technol 28(4):1493–1504
Article Google Scholar
Hamza MF, Yap HJ, Choudhury IA (2017) Recent advances on the use of meta-heuristic optimization algorithms to optimize the type-2 fuzzy logic systems in intelligent control. Neural Comput Appl 28:979–999
Article Google Scholar
Wieser E, Cheng G (2020) EO-MTRNN: evolutionary optimization of hyperparameters for a neuro-inspired computational model of spatiotemporal learning. Biol Cybern 114:363–387
Article MATH Google Scholar
Cetin O, Temurtas F (2021) A comparative study on classification of magnetoencephalography signals using probabilistic neural network and multilayer neural network. Soft Comput 25:2267–2275
Article Google Scholar
Treesatayapun C (2020) Prescribed performance of discrete-time controller based on the dynamic equivalent data model. Appl Math Model 78:366–382
Article MathSciNet MATH Google Scholar
Danial E, Mehrdad GS, Zhang WJ (2021) A semiempirical model for rate of penetration with application to an offshore gas field. SPE Drill Complet 36:29–46
Article Google Scholar
Cai M, Lin Y, Han B, Liu C, Zhang W (2017) On a simple and efficient approach to probability distribution function aggregation. IEEE Trans Syst Man Cybern Syst 47(9):2444–2453
Google Scholar
Nithilasaravanan K, Thakwani N, Mishra P, Kumar V, Rana KPS (2019) Adaptive fuzzy variable structure control of fractional-order nonlinear systems with input nonlinearities. Neural Comput Appl 31:4137–4155
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering and Technology, Walailak University, 222 Thaiburi, Thasala District, Nakhonsrithammarat, 80161, Thailand
Chidentree Treesatayapun
Department of Multidisciplinary Engineering, Texas A &M University, Higher Education Center at McAllen, 6200 Tres Lagos Blvd., McAllen, TX, 78504, United States
Aldo Jonathan Muñoz-Vázquez

Authors

Chidentree Treesatayapun
View author publications
You can also search for this author in PubMed Google Scholar
Aldo Jonathan Muñoz-Vázquez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CT: Conceptualization, Formal analysis, Research, MiFREN methodology, Validation results, Writing, Review Editing. AJM-V: Conceptualization, Formal analysis, Research, Controller design, Simulations, Writing, Editing.

Corresponding author

Correspondence to Chidentree Treesatayapun.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethics approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Written informed consent for publication was obtained from all participants.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Treesatayapun, C., Muñoz-Vázquez, A.J. Reinforcement control with fuzzy-rules emulated network for robust-optimal drug-dosing of cancer dynamics. Neural Comput & Applic 35, 11701–11711 (2023). https://doi.org/10.1007/s00521-023-08312-7

Download citation

Received: 09 August 2022
Accepted: 16 January 2023
Published: 02 February 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00521-023-08312-7

Reinforcement control with fuzzy-rules emulated network for robust-optimal drug-dosing of cancer dynamics

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Reinforcement learning optimal control with semi-continuous reward function and fuzzy-rules networks for drug administration of cancer treatment

Knowledge-based reinforcement learning controller with fuzzy-rule network: experimental validation

How an Adaptive Learning Rate Benefits Neuro-Fuzzy Reinforcement Learning Systems

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Reinforcement control with fuzzy-rules emulated network for robust-optimal drug-dosing of cancer dynamics

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Reinforcement learning optimal control with semi-continuous reward function and fuzzy-rules networks for drug administration of cancer treatment

Knowledge-based reinforcement learning controller with fuzzy-rule network: experimental validation

How an Adaptive Learning Rate Benefits Neuro-Fuzzy Reinforcement Learning Systems

Explore related subjects

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation