Abstract
Parameter and initial conditions (factors) estimation is a challenging task in non-linear models. Even if researchers successfully estimate those model factors, they still should estimate their confidence intervals, which could require a high computational cost to turn them into informative values. Some methods in the literature attempt to estimate regions within the search space where factors may jointly exist and fit the experimental data, i.e., confidence contours. However, the estimation of such confidence contours comes up with several issues as the number of factors increases. In this work we propose a method to compute a region within the confidence contour of an inverse problem, we denote this region as the confidence sub-contour box (CSB). To estimate a CSB of an inverse problem, we propose two main algorithms alongside their interpretation and validation, testing their performance with two epidemiological models for different diseases and different kinds of data. Moreover, we exposed some desirable properties of the method through numerical experiments, such as user-defined uncertainty level, asymmetry of the resultant intervals regarding the nominal value, sensitivity assessment related to the interval length for each factor, and identification of true-influential factors. We set available all the algorithms we used in this paper at Mathworks in the GSUA-CSB toolbox.
Similar content being viewed by others
References
Archer GEB, Saltelli A, Sobol IM (1997) Sensitivity measures, ANOVA-like techniques and the use of bootstrap. Stat Comput Simul 58:99–120
Babtie AC, Kirk P, Stumpf MPH (2014) Topological sensitivity analysis for systems biology. Proc Natl Acad Sci 111(52):18507–18512. https://doi.org/10.1073/pnas.1414026112
Barrios J, Piétrus A, Joya G, Marrero A, de Arazoza H (2013) A differential inclusion approach for modeling and analysis of dynamical systems under uncertainty: application to dengue disease transmission. Soft Comput 17(2):239–253. https://doi.org/10.1007/s00500-012-0889-2
Cantó B, Coll C, Sánchez E (2017) Estimation of parameters in a structured SIR model. Adv Differ Equ. https://doi.org/10.1186/s13662-017-1078-5
Cuevas A, Febrero M, Fraiman R (2006) On the use of the bootstrap for estimating functions with functional data. Comput Stat Data Anal 51(2):1063–1074. https://doi.org/10.1016/j.csda.2005.10.012
Draper NR, Smith H (1998) Applied regression analysis. Wiley, England. https://doi.org/10.1198/tech.2005.s303
Duong V, Lambrechts L, Paul RE, Ly S, Lay RS, Long KC, Huy R, Tarantola A, Scott TW, Sakuntabhai A, Buchy P (2015) Asymptomatic humans transmit dengue virus to mosquitoes. Proc Natl Acad Sci 112(47):14688–14693. https://doi.org/10.1073/pnas.1508114112
Geris L, Gomez-Cabrero D (2016) Uncertainty in biology a computational modeling approach. Springer, Switzerland
Inselberg A, Dimsdale B (1990) Parallel coordinates: a tool for visualizing multi-dimensional geometry. In: Proceedings of the first IEEE conference on visualization: visualization, vol 90, pp 361–378
Instituto Nacional de Salud, Manual del usuario del software SIVIGILA (2011). https://goo.gl/j1DpX4
Isermann R, Münchhof M (2011) Identification of dynamic systems. Springer, Berlin. https://doi.org/10.1007/978-3-540-78879-9
James F (2006) Statistical methods in experimental physics. World Scientific, Switzerland. https://doi.org/10.1142/6096
Joshi M, Seidel-Morgenstern A, Kremling A (2006) Exploiting the bootstrap method for quantifying parameter confidence intervals in dynamical systems. Metab Eng 8(5):447–455. https://doi.org/10.1016/j.ymben.2006.04.003
Kreutz C, Raue A, Kaschek D, Timmer J (2013) Profile likelihood in systems biology. FEBS J 280(11):2564–2571. https://doi.org/10.1111/febs.12276
Liu RY, Parelius JM, Singh K (1999) Multivariate analysis by data depth: descriptive statistics, graphics and inference (with discussion and a rejoinder by Liu and Singh). Ann Stat 27(3):783–858
Lizarralde-Bejarano DP, Rojas-Díaz D, Arboleda-Sánchez S, Puerta-Yepes ME (2020) Sensitivity, uncertainty and identifiability analyses to define a dengue transmission model with real data of an endemic municipality of colombia. PLoS ONE 15(3):e0229668. https://doi.org/10.1371/journal.pone.0229668
Loperfido N (2018) Skewness-based projection pursuit: a computational approach. Comput Stat Data Anal 120:42–57. https://doi.org/10.1016/j.csda.2017.11.001
López-Pintado S, Romo J (2009) On the concept of depth for functional data. J Am Stat Assoc 104(486):718–734
Mathworks (2022) https://www.mathworks.com/help/simbio/ref/sbioparameterci.html
Miao H, Xia X, Perelson AS, Wu H (2011) On identifiability of nonlinear ODE models and applications in viral dynamics. SIAM Rev Soc Ind Appl Math 53(1):3–39
Munster VJ, Koopmans M, van Doremalen N, van Riel D, de Wit E (2020) A novel coronavirus emerging in China—key questions for impact assessment. N Engl J Med 382(8):692–694. https://doi.org/10.1056/nejmp2000929
Nikishova A, Hoekstra AG (2019) Semi-intrusive uncertainty propagation for multiscale models. J Comput Sci 35:80–90. https://doi.org/10.1016/j.jocs.2019.06.007
Olsson A, Sandberg G, Dahlblom O (2003) On Latin hypercube sampling for structural reliability analysis. Struct Saf 25(1):47–68. https://doi.org/10.1016/s0167-4730(02)00039-5
Pedrycz W (2008) Handbook of granular computing. John Wiley & Sons, Chichester
Peixoto VR, Nunes C, Abrantes A (2020) Epidemic surveillance of Covid-19: considering uncertainty and under-ascertainment. Port J Public Health 38(1):23–29. https://doi.org/10.1159/000507587
Peña D, Prieto FJ (2007) Combining random and specific directions for outlier detection and robust estimation in high-dimensional multivariate data. J Comput Graph Stat 16(1):228–254
Rojas-Díaz D, Vélez-Sánchez CM (2019) drojasd/gsua-csb: Gsua-csb v1.0. https://doi.org/10.5281/zenodo.3383316
Saltelli A, Ratto M, Andres T, Campolongo F, Cariboni J, Gatelli D, Saisana M, Tarantola S (2008) Global sensitivity analysis: the primer. John Wiley & Sons, England
Saltelli A, Annoni P, Azzini I, Campolongo F, Ratto M, Tarantola S (2010) Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index. Comput Phys Commun 181:259–270
Seber GAF, Wild CJ (2003) Nonlinear regression. John Wiley & Sons, New Jersey
Sen D, Sen D (2021) Use of a modified SIRD model to analyze COVID-19 data. Ind Eng Chem Res 60(11):4251–4260. https://doi.org/10.1021/acs.iecr.0c04754
Shampine LF, Reichelt MW (1997) The MATLAB ODE suite. SIAM J Sci Comput 18(1):1–22. https://doi.org/10.1137/s1064827594276424
Small NJH (1980) Marginal skewness and kurtosis in testing multivariate normality. Appl Stat 29(1):85. https://doi.org/10.2307/2346414
Sobol IM (1990) On sensitivity estimation for nonlinear mathematical models. Mat Mod 2:112–118
Sobol IM (2001) Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math Comput Simul 55:271–280
Stapor P, Fröhlich F, Hasenauer J (2018) Optimization and profile calculation of ODE models using second order adjoint sensitivity analysis. Bioinformatics 34(13):i151–i159. https://doi.org/10.1093/bioinformatics/bty230
Sun Y, Genton MG (2011) Functional boxplots. J Comput Graph Stat 20(2):316–334. https://doi.org/10.1198/jcgs.2011.09224
Takahashi LT, Maidana NA, Ferreira WC, Pulino P, Yang HM (2005) Mathematical models for the Aedes aegypti dispersal dynamics: travelling waves by wing and wind. Bull Math Biol 67:509–528. https://doi.org/10.1016/j.bulm.2004.08.005
Talbi EG (2009) Metaheuristics: from design to implementation. Wiley Online Library, England. https://doi.org/10.1002/9780470496916
Toan NT, Rossi S, Prisco G, Nante N, Viviani S (2015) Dengue epidemiology in selected endemic countries: factors influencing expansion factors as estimates of underreporting. Trop Med Int Health 20(7):840–863
Voss HU, Timmer J, Kurths J (2004) Nonlinear dynamical system identification from uncertain and indirect measurements. Int J Bifurc Chaos 14(6):1905–1933
Vugrin KW, Swiler LP, Roberts RM, Stucky-Mack NJ, Sullivan SP (2007) Confidence region estimation techniques for nonlinear regression in groundwater flow: three case studies. Water Resour Res. https://doi.org/10.1029/2005wr004804
Wegman EJ (1990) Hyperdimensional data analysis using parallel coordinates. J Am Stat Assoc 85(411):664–675
Wood SN (2001) Minimizing model fitting objectives that contain spurious local minima by bootstrap restarting. Biometrics 57(1):240–244. https://doi.org/10.1111/j.0006-341x.2001.00240.x
Xiao S, Lu Z, Wang P (2018) Multivariate global sensitivity analysis based on distance components decomposition. Risk Anal 38(12):2703–2721. https://doi.org/10.1111/risa.13133
Yang HM, da Graca Macoris MD, Galvani KC, Andrighetti MT (2011) Follow up estimation of Aedes aegypti entomological parameters and mathematical modellings. Biosystems 103(3):360–371. https://doi.org/10.1016/j.biosystems.2010.11.002
Acknowledgements
This study was funded by Departamento Administrativo de Ciencia, Tecnología e Innovación - COLCIENCIAS (Grant Number 111572553478) and Universidad EAFIT (Grant Number 954-000002).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix
Appendix 1: Mono-objective example: Dengue
We selected a system of ordinary differential equations with seven states proposed by Lizarralde-Bejarano et al. (2020) to estimate parameters and confidence intervals for a dengue outbreak. This model includes the virus transmission between a population of humans and mosquitoes (see Eq. in 11). We kept the assumptions in Lizarralde-Bejarano et al. (2020) of a total mosquito population variable in time M, where \(M = M_s+M_e+M_i\), and total human population constant in time H, where \(H= H_s+H_e+H_i+H_r\). We list the state variables and parameters of the model in Table 5.
Appendix 2: Multi-objective example: COVID-19
Implemented model for COVID-19 infection (Sen and Sen 2021).
See Table 6.
with \(N = S + C + E + A + I + Q + R + D\)
Appendix 3: Samples of multiple CSB estimations for dengue and COVID-19 models
Appendix 4: Confidence intervals for simpler model
Consider the following non-linear pendulum model with input force \(u=1\) and friction
We selected \(\hat{X}\) as an arbitrary combination of factors within some arbitrary search box and then we proceeded to compute the \(\hat{X}\)-protected CSB intervals with sample size \(N=5000\) alongside the likelihood-based CI. Results are summarized in Table 7 and Fig. 10. We used the same parameterization for this experiment as for the Dengue and the COVID-19 models.
Appendix 5: Confidence intervals: MSE-based
See Fig. 11.
Appendix 6: Algorithms
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rojas-Diaz, D., Catano-Lopez, A., Vélez, C.M. et al. Confidence sub-contour box: an alternative to traditional confidence intervals. Comput Stat 39, 2821–2858 (2024). https://doi.org/10.1007/s00180-023-01362-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-023-01362-4