Abstract
In this article, a nonlinear mathematical model of the biological phenomena in chemotherapy cancer treatment is considered as a class of unknown discrete-time systems when the input data and the measured output are only available. The input data are the drug administration represented as the control effort and the output is the tumor cells population. As a result, the actor-critic architecture is constructed without the full-state observer. Two sets of IF-THEN rules are utilized for fuzzy rules emulated networks by human knowledge according to the pharmacokinetic and pharmacodynamic details. The learning laws are derived from the concept of the incoherent reward function. Thus, the convergence of the internal signals and the robustness are accomplished by the theoretical and numerical results. Furthermore, the comparative results are given to demonstrate the effectiveness of the proposed scheme.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Sharifi M, Moradi H (2019) Nonlinear composite adaptive control of cancer chemotherapy with online identification of uncertain parameters. Biomed Signal Process Control 49:360–374
Selvanambi R, Natarajan J, Karuppiah M, Islam SH, Hassan MM, Fortino G (2020) RETRACTED ARTICLE: lung cancer prediction using higher-order recurrent neural network based on glowworm swarm optimization. Neural Comput Appl 32:4373–4386
Robertson-Tessi M, El-Kareh A, Goriely A (2011) A mathematical model of tumor-immune interactions. J Theor Biol 294:56–73
Rihan FA, Velmurugan G (2020) Dynamics of fractional-order delay differential model for tumor-immune system. Chaos Solitons Fractals 132:109592
Parthasakha D, Samhita D, Pritha D, Rihan FA, Uzuntarla M (2021) Dibakar Ghosh, Optimal control strategy for cancer remission using combinatorial therapy: a mathematical model-based approach, Chaos. Solitons Fractals 145:110789
Manavalan R, Priya S (2021) Genetic interactions effects for cancer disease identification using computational models: a review. Med Biol Eng Comput 59:733–758
Sweilam NH, Al-Mekhlafi SM, Albalawi AO, Tenreiro-Machado JA (2021) Optimal control of variable-order fractional model for delay cancer treatments. Appl Math Modell 89:1557–1574
Algoul S, Alam MS, Hossain MA, Majumder MAA (2011) Multi-objective optimal chemotherapy control model for cancer treatment. Med Biol Eng Comput 49:51–65
Chen T, Kirkby NF, Jena R (2012) Optimal dosing of cancer chemotherapy using model predictive control and moving horizon state/parameter estimation. Comput Methods Progr Biomed 108(3):973–983
Noble SL, Sherer E, Hannemann RE, Ramkrishna D, Vik T, Rundell AE (2010) Using adaptive model predictive control to customize maintenance therapy chemotherapeutic dosing for childhood acute lymphoblastic leukemia. J Theor Biol 264(3):990–1002
Yu G, Wu J (2022) Efficacy prediction based on attribute and multi-source data collaborative for auxiliary medical system in developing countries. Neural Comput Appl 34:5497–5512
Liu J, Wang XS (2019) Numerical optimal control of a size-structured PDE model for metastatic cancer treatment. Math Biosci 314:28–42
Bermudez-Contreras E (2021) Deep reinforcement learning to study spatial navigation, learning and memory in artificial and biological agents. Biol Cybern 115:131–134
Nowakowski K, Carvalho P, Six JB, Maillet Y, Nguyen AT, Seghiri I, Pemba LM, Marcille T, Ngo ST, Dao TT (2021) Human locomotion with reinforcement learning using bioinspired reward reshaping strategies. Med Biol Eng Comput 59:243–256
Padmanabhan R, Meskin N, Haddad WM (2017) Reinforcement learning-based control of drug dosing for cancer chemotherapy treatment. Math Biosci 293:11–20
Yazdjerdi P, Meskin N, Al-Naemi M, Moustafa AE, Kovacs L (2019) Reinforcement learning-based control of tumor growth under anti-angiogenic therapy. Comput Methods Programs Biomed 173:15–26
Batmani Y, Khaloozadeh H (2013) Optimal chemotherapy in cancer treatment: state dependent Riccati equation control and extended Kalman filter. Optim Control Appl Methods 34(5):562–577
Babaei N, Salamci MU (2015) Personalized drug administration for cancer treatment using Model Reference Adaptive Control. J Theor Biol 371:24–44
Friston K, Samothrakis S, Montague R (2012) Active inference and agency: optimal control without cost functions. Biol Cybern 106:523–541
Azar AT, El-Said SA (2013) Superior neuro-fuzzy classification systems. Neural Comput Appl 23:55–72
Hou Z, Chi R, Gao H (2017) An overview of dynamic-linearization-based data-driven control and applications. IEEE Trans Ind Electron 64(5):4076–4090
Hyunseong L, Hyung JL, Chattopadhyay A (2021) A data-driven time-series fault prediction framework for dynamically evolving large-scale data streaming systems. Neural Comput Appl 33:3235–3250
Sharma PJ, Patel PL, Jothiprakash V (2021) Data-driven modelling framework for streamflow prediction in a physio-climatically heterogeneous river basin. Soft Comput 25:5951–5978
Zhang M, Gan MG (2019) Data-driven adaptive optimal control for linear systems with structured time-varying uncertainty. IEEE Access 7:9215–9224
Yuliang C, Huaguang Z, Zhang K, Chong L (2020) Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems. Neural Comput Appl 32:8763–8781
Qiu R, Sun Y, Fan Z, Sun M (2020) Robust multi-product inventory optimization under support vector clustering-based data-driven demand uncertainty set. Soft Comput 24:6259–6275
Shang C, Chen WH, Stroock AD, You F (2020) Robust model predictive control of irrigation systems with active uncertainty learning and data analytics. IEEE Trans Control Syst Technol 28(4):1493–1504
Hamza MF, Yap HJ, Choudhury IA (2017) Recent advances on the use of meta-heuristic optimization algorithms to optimize the type-2 fuzzy logic systems in intelligent control. Neural Comput Appl 28:979–999
Wieser E, Cheng G (2020) EO-MTRNN: evolutionary optimization of hyperparameters for a neuro-inspired computational model of spatiotemporal learning. Biol Cybern 114:363–387
Cetin O, Temurtas F (2021) A comparative study on classification of magnetoencephalography signals using probabilistic neural network and multilayer neural network. Soft Comput 25:2267–2275
Treesatayapun C (2020) Prescribed performance of discrete-time controller based on the dynamic equivalent data model. Appl Math Model 78:366–382
Danial E, Mehrdad GS, Zhang WJ (2021) A semiempirical model for rate of penetration with application to an offshore gas field. SPE Drill Complet 36:29–46
Cai M, Lin Y, Han B, Liu C, Zhang W (2017) On a simple and efficient approach to probability distribution function aggregation. IEEE Trans Syst Man Cybern Syst 47(9):2444–2453
Nithilasaravanan K, Thakwani N, Mishra P, Kumar V, Rana KPS (2019) Adaptive fuzzy variable structure control of fractional-order nonlinear systems with input nonlinearities. Neural Comput Appl 31:4137–4155
Author information
Authors and Affiliations
Contributions
CT: Conceptualization, Formal analysis, Research, MiFREN methodology, Validation results, Writing, Review Editing. AJM-V: Conceptualization, Formal analysis, Research, Controller design, Simulations, Writing, Editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
Written informed consent for publication was obtained from all participants.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Treesatayapun, C., Muñoz-Vázquez, A.J. Reinforcement control with fuzzy-rules emulated network for robust-optimal drug-dosing of cancer dynamics. Neural Comput & Applic 35, 11701–11711 (2023). https://doi.org/10.1007/s00521-023-08312-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08312-7